|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
17-152 |
4.70e-81 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization.
Pssm-ID: 461094 Cd Length: 117 Bit Score: 253.89 E-value: 4.70e-81
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 17 FKFSVLEICDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMaaphqcpqggtsyphwprlsplqYYEMSYGLNIE 96
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVM-----------------------YYEMSYGLNIE 57
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 1720361216 97 MHKQAEIVKRLSAICAQMVPFLTQEHQQQVLQAVDRAKQVTVGELNSLLGQQNQLQ 152
Cdd:pfam03920 58 MHKQTEIAKRLNAICAQVIPFLSQEHQQQVAQAVERAKQVTMAELNAIIGQQQQLQ 113
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
351-753 |
5.89e-42 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 157.77 E-value: 5.89e-42
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 351 LAAQPAAPTDSIALRSPLTLSSPFTSSFSLGSHSTLNGDLSMPGSYVGLHLSPQQMAFESHPHLRGSSVSLPGIPVAKPA 430
Cdd:COG2319 4 ADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLS 83
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 431 YSFhvSADGQMQPVPFPSDALVGTGIPRHARQLHTLAHGEVVCAVTISSSTQHVYTGGK-GCVKVWDVGqpgSKTPVAQL 509
Cdd:COG2319 84 VAF--SPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSAdGTVRLWDLA---TGKLLRTL 158
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 510 DclNRDNYIRSCKLLPDGQSLIVGGEASTLSIWDLAapTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDLQ 589
Cdd:COG2319 159 T--GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLA--TGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLA 234
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 590 NQAMVRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWDLREGRQLQQ-HDFSSQIFSLGHCPNQDWLAVGMESSHVEV 668
Cdd:COG2319 235 TGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTlTGHSGGVNSVAFSPDGKLLASGSDDGTVRL 314
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 669 LHVRKPEK-YQLRLHESCVLSLKFASCGRWFVSTGKDNLLNAWRTPYGASIFQSKE-SSSVLSCDISRNNKYIVTGSGDK 746
Cdd:COG2319 315 WDLATGKLlRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGhTGAVTSVAFSPDGRTLASGSADG 394
|
....*..
gi 1720361216 747 KATVYEV 753
Cdd:COG2319 395 TVRLWDL 401
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
468-752 |
1.22e-36 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 139.39 E-value: 1.22e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 468 HGEVVCAVTISSSTQHVYTGGK-GCVKVWDV-GQPGSKTPVAQLDClnrdnyIRSCKLLPDGQSLIVGGEASTLSIWDLa 545
Cdd:cd00200 8 HTGGVTCVAFSPDGKLLATGSGdGTIKVWDLeTGELLRTLKGHTGP------VRDVAASADGTYLASGSSDKTIRLWDL- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 546 aPTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDLQNQAMVRQFQGHTDGASCIDISDYGTRLWTGGLDNTV 625
Cdd:cd00200 81 -ETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTI 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 626 RCWDLREGR---QLQQHDfsSQIFSLGHCPNQDWLAVGMESSHVEVLHVRKPE-KYQLRLHESCVLSLKFASCGRWFVST 701
Cdd:cd00200 160 KLWDLRTGKcvaTLTGHT--GEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKcLGTLRGHENGVNSVAFSPDGYLLASG 237
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|..
gi 1720361216 702 GKDNLLNAWRTPYGASIFQ-SKESSSVLSCDISRNNKYIVTGSGDKKATVYE 752
Cdd:cd00200 238 SEDGTIRVWDLRTGECVQTlSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
269-447 |
1.53e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 55.33 E-value: 1.53e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 269 PSEPPSPVT--TPCGKAPLCIPARRDLTDSPASLASSLGSPLPRSKDIALNDLPTGTPASRSCGTSPPQDSSTPGPSSAS 346
Cdd:PHA03247 2739 PAPPAVPAGpaTPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAAL 2818
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 347 HLCQLAAQPAAPTDSIALRSPLTLSSPFTSSFSLGSHSTLNGDLSMPGSYvglHLSPQQMAFESHPHLRgssvSLPGIPV 426
Cdd:PHA03247 2819 PPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPS---RSPAAKPAAPARPPVR----RLARPAV 2891
|
170 180
....*....|....*....|.
gi 1720361216 427 AKPAYSFHVSADGQmQPVPFP 447
Cdd:PHA03247 2892 SRSTESFALPPDQP-ERPPQP 2911
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
594-629 |
6.36e-07 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 46.15 E-value: 6.36e-07
10 20 30
....*....|....*....|....*....|....*.
gi 1720361216 594 VRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWD 629
Cdd:smart00320 5 LKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
594-629 |
9.29e-06 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 43.10 E-value: 9.29e-06
10 20 30
....*....|....*....|....*....|....*.
gi 1720361216 594 VRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWD 629
Cdd:pfam00400 4 LKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
222-430 |
1.48e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 42.08 E-value: 1.48e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 222 ESLVEEDHPSSRGGSGKQQRAEDkDLSGP----YDSEEDKSDyNLVVDEDQPSEPPSPVTTPCGKAPLCIPARRDLTDSP 297
Cdd:PHA03307 12 EAAAEGGEFFPRPPATPGDAADD-LLSGSqgqlVSDSAELAA-VTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPT 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 298 ASLAS-SLGSPLPRSKDIALNDLPTGTPASR----SCGTSPPQDSSTPGPSSASHLCQLAAQPAAPTDSIAlRSPLTLSS 372
Cdd:PHA03307 90 WSLSTlAPASPAREGSPTPPGPSSPDPPPPTpppaSPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPA-AVASDAAS 168
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 1720361216 373 PFTSSFSLGSHSTLNGDLSMPGSYVGlhLSPQQMAFESHPHLRGSSVSLP-GIPVAKPA 430
Cdd:PHA03307 169 SRQAALPLSSPEETARAPSSPPAEPP--PSTPPAAASPRPPRRSSPISASaSSPAPAPG 225
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
17-152 |
4.70e-81 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization.
Pssm-ID: 461094 Cd Length: 117 Bit Score: 253.89 E-value: 4.70e-81
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 17 FKFSVLEICDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMaaphqcpqggtsyphwprlsplqYYEMSYGLNIE 96
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVM-----------------------YYEMSYGLNIE 57
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 1720361216 97 MHKQAEIVKRLSAICAQMVPFLTQEHQQQVLQAVDRAKQVTVGELNSLLGQQNQLQ 152
Cdd:pfam03920 58 MHKQTEIAKRLNAICAQVIPFLSQEHQQQVAQAVERAKQVTMAELNAIIGQQQQLQ 113
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
351-753 |
5.89e-42 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 157.77 E-value: 5.89e-42
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 351 LAAQPAAPTDSIALRSPLTLSSPFTSSFSLGSHSTLNGDLSMPGSYVGLHLSPQQMAFESHPHLRGSSVSLPGIPVAKPA 430
Cdd:COG2319 4 ADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLS 83
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 431 YSFhvSADGQMQPVPFPSDALVGTGIPRHARQLHTLAHGEVVCAVTISSSTQHVYTGGK-GCVKVWDVGqpgSKTPVAQL 509
Cdd:COG2319 84 VAF--SPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSAdGTVRLWDLA---TGKLLRTL 158
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 510 DclNRDNYIRSCKLLPDGQSLIVGGEASTLSIWDLAapTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDLQ 589
Cdd:COG2319 159 T--GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLA--TGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLA 234
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 590 NQAMVRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWDLREGRQLQQ-HDFSSQIFSLGHCPNQDWLAVGMESSHVEV 668
Cdd:COG2319 235 TGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTlTGHSGGVNSVAFSPDGKLLASGSDDGTVRL 314
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 669 LHVRKPEK-YQLRLHESCVLSLKFASCGRWFVSTGKDNLLNAWRTPYGASIFQSKE-SSSVLSCDISRNNKYIVTGSGDK 746
Cdd:COG2319 315 WDLATGKLlRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGhTGAVTSVAFSPDGRTLASGSADG 394
|
....*..
gi 1720361216 747 KATVYEV 753
Cdd:COG2319 395 TVRLWDL 401
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
461-713 |
8.20e-37 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 143.13 E-value: 8.20e-37
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 461 RQLHTL-AHGEVVCAVTISSSTQHVYTGGK-GCVKVWDVGqpgSKTPVAQLDclNRDNYIRSCKLLPDGQSLIVGGEAST 538
Cdd:COG2319 153 KLLRTLtGHSGAVTSVAFSPDGKLLASGSDdGTVRLWDLA---TGKLLRTLT--GHTGAVRSVAFSPDGKLLASGSADGT 227
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 539 LSIWDLAapTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDLQNQAMVRQFQGHTDGASCIDISDYGTRLWT 618
Cdd:COG2319 228 VRLWDLA--TGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLAS 305
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 619 GGLDNTVRCWDLREGRQLQQHD-FSSQIFSLGHCPNQDWLAVGMESSHVEVLHVR-KPEKYQLRLHESCVLSLKFASCGR 696
Cdd:COG2319 306 GSDDGTVRLWDLATGKLLRTLTgHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLAtGELLRTLTGHTGAVTSVAFSPDGR 385
|
250
....*....|....*..
gi 1720361216 697 WFVSTGKDNLLNAWRTP 713
Cdd:COG2319 386 TLASGSADGTVRLWDLA 402
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
468-752 |
1.22e-36 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 139.39 E-value: 1.22e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 468 HGEVVCAVTISSSTQHVYTGGK-GCVKVWDV-GQPGSKTPVAQLDClnrdnyIRSCKLLPDGQSLIVGGEASTLSIWDLa 545
Cdd:cd00200 8 HTGGVTCVAFSPDGKLLATGSGdGTIKVWDLeTGELLRTLKGHTGP------VRDVAASADGTYLASGSSDKTIRLWDL- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 546 aPTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDLQNQAMVRQFQGHTDGASCIDISDYGTRLWTGGLDNTV 625
Cdd:cd00200 81 -ETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTI 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 626 RCWDLREGR---QLQQHDfsSQIFSLGHCPNQDWLAVGMESSHVEVLHVRKPE-KYQLRLHESCVLSLKFASCGRWFVST 701
Cdd:cd00200 160 KLWDLRTGKcvaTLTGHT--GEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKcLGTLRGHENGVNSVAFSPDGYLLASG 237
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|..
gi 1720361216 702 GKDNLLNAWRTPYGASIFQ-SKESSSVLSCDISRNNKYIVTGSGDKKATVYE 752
Cdd:cd00200 238 SEDGTIRVWDLRTGECVQTlSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
454-711 |
6.06e-34 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 131.69 E-value: 6.06e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 454 TGIPRHARQLHTlahGEVVCAVTISSSTQhVYTGGK-GCVKVWDVGQPgsktpvaqlDCLNR----DNYIRSCKLLPDGQ 528
Cdd:cd00200 40 TGELLRTLKGHT---GPVRDVAASADGTY-LASGSSdKTIRLWDLETG---------ECVRTltghTSYVSSVAFSPDGR 106
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 529 SLIVGGEASTLSIWDLaaPTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDLQNQAMVRQFQGHTDGASCID 608
Cdd:cd00200 107 ILSSSSRDKTIKVWDV--ETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVA 184
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 609 ISDYGTRLWTGGLDNTVRCWDLREGRQLQQHD-FSSQIFSLGHCPNQDWLAVGMESSHVEVLHVRKPE-KYQLRLHESCV 686
Cdd:cd00200 185 FSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRgHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGEcVQTLSGHTNSV 264
|
250 260
....*....|....*....|....*
gi 1720361216 687 LSLKFASCGRWFVSTGKDNLLNAWR 711
Cdd:cd00200 265 TSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
562-753 |
7.55e-24 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 102.41 E-value: 7.55e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 562 CYALAVSPDAKVCFSCCSDGNIVVWDLQNQAMVRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWDLREGRQLQQ-HD 640
Cdd:cd00200 12 VTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVRTlTG 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 641 FSSQIFSLGHCPNQDWLAVGMESSHVEVLHVRKPE-KYQLRLHESCVLSLKFASCGRwFVSTGK-DNLLNAWRTPYGASI 718
Cdd:cd00200 92 HTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKcLTTLRGHTDWVNSVAFSPDGT-FVASSSqDGTIKLWDLRTGKCV 170
|
170 180 190
....*....|....*....|....*....|....*..
gi 1720361216 719 --FQSkESSSVLSCDISRNNKYIVTGSGDKKATVYEV 753
Cdd:cd00200 171 atLTG-HTGEVNSVAFSPDGEKLLSSSSDGTIKLWDL 206
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
523-753 |
1.06e-17 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 86.12 E-value: 1.06e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 523 LLPDGQSLIVGGEASTLSIWDLAAPTPRIKAELTSSAPAcyALAVSPDAKVCFSCCSDGNIVVWDLQNQAMVRQFQGHTD 602
Cdd:COG2319 2 LSADGAALAAASADLALALLAAALGALLLLLLGLAAAVA--SLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTA 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 603 GASCIDISDYGTRLWTGGLDNTVRCWDLREGRQLQQ-HDFSSQIFSLGHCPNQDWLAVGMESSHVEVLHVRKPEK-YQLR 680
Cdd:COG2319 80 AVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTlTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLlRTLT 159
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1720361216 681 LHESCVLSLKFASCGRWFVSTGKDNLLNAWRTPYGASIFQ-SKESSSVLSCDISRNNKYIVTGSGDKKATVYEV 753
Cdd:COG2319 160 GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTlTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDL 233
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
593-753 |
1.12e-17 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 84.31 E-value: 1.12e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 593 MVRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWDLREG---RQLQQHdfSSQIFSLGHCPNQDWLAVGMESSHVEVL 669
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGellRTLKGH--TGPVRDVAASADGTYLASGSSDKTIRLW 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 670 HVRKPEK-YQLRLHESCVLSLKFASCGRWFVSTGKDNLLNAWRTPYGASI--FQSKEsSSVLSCDISRNNKYIVTGSGDK 746
Cdd:cd00200 79 DLETGECvRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLttLRGHT-DWVNSVAFSPDGTFVASSSQDG 157
|
....*..
gi 1720361216 747 KATVYEV 753
Cdd:cd00200 158 TIKLWDL 164
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
461-590 |
8.47e-13 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 70.71 E-value: 8.47e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 461 RQLHTLA-HGEVVCAVTISSSTQHVYTGGKGC-VKVWDVGqpgSKTPVAQLDclNRDNYIRSCKLLPDGQSLIVGGEAST 538
Cdd:COG2319 279 ELLRTLTgHSGGVNSVAFSPDGKLLASGSDDGtVRLWDLA---TGKLLRTLT--GHTGAVRSVAFSPDGKTLASGSDDGT 353
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 1720361216 539 LSIWDLAapTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDLQN 590
Cdd:COG2319 354 VRLWDLA--TGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
269-447 |
1.53e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 55.33 E-value: 1.53e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 269 PSEPPSPVT--TPCGKAPLCIPARRDLTDSPASLASSLGSPLPRSKDIALNDLPTGTPASRSCGTSPPQDSSTPGPSSAS 346
Cdd:PHA03247 2739 PAPPAVPAGpaTPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAAL 2818
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 347 HLCQLAAQPAAPTDSIALRSPLTLSSPFTSSFSLGSHSTLNGDLSMPGSYvglHLSPQQMAFESHPHLRgssvSLPGIPV 426
Cdd:PHA03247 2819 PPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPS---RSPAAKPAAPARPPVR----RLARPAV 2891
|
170 180
....*....|....*....|.
gi 1720361216 427 AKPAYSFHVSADGQmQPVPFP 447
Cdd:PHA03247 2892 SRSTESFALPPDQP-ERPPQP 2911
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
594-629 |
6.36e-07 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 46.15 E-value: 6.36e-07
10 20 30
....*....|....*....|....*....|....*.
gi 1720361216 594 VRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWD 629
Cdd:smart00320 5 LKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
594-629 |
9.29e-06 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 43.10 E-value: 9.29e-06
10 20 30
....*....|....*....|....*....|....*.
gi 1720361216 594 VRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWD 629
Cdd:pfam00400 4 LKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| YncE |
COG3391 |
DNA-binding beta-propeller fold protein YncE [General function prediction only]; |
523-630 |
1.25e-03 |
|
DNA-binding beta-propeller fold protein YncE [General function prediction only];
Pssm-ID: 442618 [Multi-domain] Cd Length: 237 Bit Score: 41.22 E-value: 1.25e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 523 LLPDGQSLIVGGEAS-TLSIWDLAapTPRIKAEL-TSSAPacYALAVSPDAKVCFSCCSDGN-----IVVWDLQNQAMVR 595
Cdd:COG3391 117 VDPDGGRLYVADSGNgRVSVIDTA--TGKVVATIpVGAGP--HGIAVDPDGKRLYVANSGSNtvsviVSVIDTATGKVVA 192
|
90 100 110 120
....*....|....*....|....*....|....*....|...
gi 1720361216 596 QFQGHtDGASCIDISDYGTRLW--------TGGLDNTVRCWDL 630
Cdd:COG3391 193 TIPVG-GGPVGVAVSPDGRRLYvanrgsntSNGGSNTVSVIDL 234
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
222-430 |
1.48e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 42.08 E-value: 1.48e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 222 ESLVEEDHPSSRGGSGKQQRAEDkDLSGP----YDSEEDKSDyNLVVDEDQPSEPPSPVTTPCGKAPLCIPARRDLTDSP 297
Cdd:PHA03307 12 EAAAEGGEFFPRPPATPGDAADD-LLSGSqgqlVSDSAELAA-VTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPT 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720361216 298 ASLAS-SLGSPLPRSKDIALNDLPTGTPASR----SCGTSPPQDSSTPGPSSASHLCQLAAQPAAPTDSIAlRSPLTLSS 372
Cdd:PHA03307 90 WSLSTlAPASPAREGSPTPPGPSSPDPPPPTpppaSPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPA-AVASDAAS 168
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 1720361216 373 PFTSSFSLGSHSTLNGDLSMPGSYVGlhLSPQQMAFESHPHLRGSSVSLP-GIPVAKPA 430
Cdd:PHA03307 169 SRQAALPLSSPEETARAPSSPPAEPP--PSTPPAAASPRPPRRSSPISASaSSPAPAPG 225
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
548-587 |
1.58e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 36.91 E-value: 1.58e-03
10 20 30 40
....*....|....*....|....*....|....*....|
gi 1720361216 548 TPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWD 587
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
|