|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
17-129 |
1.30e-85 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization.
Pssm-ID: 461094 Cd Length: 117 Bit Score: 266.98 E-value: 1.30e-85
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907074849 17 FKFSVLEICDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQAEIVKRLSAICAQMVPFLT 96
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNAICAQVIPFLS 80
|
90 100 110
....*....|....*....|....*....|...
gi 1907074849 97 QEHQQQVLQAVDRAKQVTVGELNSLLGQQNQLQ 129
Cdd:pfam03920 81 QEHQQQVAQAVERAKQVTMAELNAIIGQQQQLQ 113
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
437-689 |
3.91e-36 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 141.20 E-value: 3.91e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907074849 437 RQLHTL-AHGEVVCAVTISSSTQHVYTGGK-GCVKVWDVGqpgSKTPVAQLDclNRDNYIRSCKLLPDGQSLIVGGEAST 514
Cdd:COG2319 153 KLLRTLtGHSGAVTSVAFSPDGKLLASGSDdGTVRLWDLA---TGKLLRTLT--GHTGAVRSVAFSPDGKLLASGSADGT 227
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907074849 515 LSIWDLAapTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDLQNQAMVRQFQGHTDGASCIDISDYGTRLWT 594
Cdd:COG2319 228 VRLWDLA--TGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLAS 305
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907074849 595 GGLDNTVRCWDLREGRQLQQHD-FSSQIFSLGHCPNQDWLAVGMESSHVEVLHVR-KPEKYQLRLHESCVLSLKFASCGR 672
Cdd:COG2319 306 GSDDGTVRLWDLATGKLLRTLTgHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLAtGELLRTLTGHTGAVTSVAFSPDGR 385
|
250
....*....|....*..
gi 1907074849 673 WFVSTGKDNLLNAWRTP 689
Cdd:COG2319 386 TLASGSADGTVRLWDLA 402
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
430-687 |
2.24e-33 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 130.15 E-value: 2.24e-33
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907074849 430 TGIPRHARQLHTlahGEVVCAVTISSSTQhVYTGGK-GCVKVWDVGQPgsktpvaqlDCLNR----DNYIRSCKLLPDGQ 504
Cdd:cd00200 40 TGELLRTLKGHT---GPVRDVAASADGTY-LASGSSdKTIRLWDLETG---------ECVRTltghTSYVSSVAFSPDGR 106
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907074849 505 SLIVGGEASTLSIWDLaaPTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDLQNQAMVRQFQGHTDGASCID 584
Cdd:cd00200 107 ILSSSSRDKTIKVWDV--ETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVA 184
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907074849 585 ISDYGTRLWTGGLDNTVRCWDLREGRQLQQHD-FSSQIFSLGHCPNQDWLAVGMESSHVEVLHVRKPE-KYQLRLHESCV 662
Cdd:cd00200 185 FSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRgHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGEcVQTLSGHTNSV 264
|
250 260
....*....|....*....|....*
gi 1907074849 663 LSLKFASCGRWFVSTGKDNLLNAWR 687
Cdd:cd00200 265 TSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
246-423 |
1.57e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 55.33 E-value: 1.57e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907074849 246 PSEPPSPVT--TPCGKAPLCIPARRDLTDSPASLASSLGSPLPRSKDIALNDLPTGTPASRSCGTSPPQDSSTPGPSSAS 323
Cdd:PHA03247 2739 PAPPAVPAGpaTPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAAL 2818
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907074849 324 HLCQLAAQPAAPTDSIALRSPLTLSSPFTSSFSLGSHSTLNGDLSMPGSyvglHLSPQMAFESHPHLRGSSVSLPgiPVA 403
Cdd:PHA03247 2819 PPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPP----SRSPAAKPAAPARPPVRRLARP--AVS 2892
|
170 180
....*....|....*....|
gi 1907074849 404 KPAYSFHVSADGQmQPVPFP 423
Cdd:PHA03247 2893 RSTESFALPPDQP-ERPPQP 2911
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
570-605 |
9.20e-07 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 46.15 E-value: 9.20e-07
10 20 30
....*....|....*....|....*....|....*.
gi 1907074849 570 VRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWD 605
Cdd:smart00320 5 LKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
570-605 |
1.23e-05 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 42.72 E-value: 1.23e-05
10 20 30
....*....|....*....|....*....|....*.
gi 1907074849 570 VRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWD 605
Cdd:pfam00400 4 LKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
17-129 |
1.30e-85 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization.
Pssm-ID: 461094 Cd Length: 117 Bit Score: 266.98 E-value: 1.30e-85
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907074849 17 FKFSVLEICDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQAEIVKRLSAICAQMVPFLT 96
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNAICAQVIPFLS 80
|
90 100 110
....*....|....*....|....*....|...
gi 1907074849 97 QEHQQQVLQAVDRAKQVTVGELNSLLGQQNQLQ 129
Cdd:pfam03920 81 QEHQQQVAQAVERAKQVTMAELNAIIGQQQQLQ 113
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
437-689 |
3.91e-36 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 141.20 E-value: 3.91e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907074849 437 RQLHTL-AHGEVVCAVTISSSTQHVYTGGK-GCVKVWDVGqpgSKTPVAQLDclNRDNYIRSCKLLPDGQSLIVGGEAST 514
Cdd:COG2319 153 KLLRTLtGHSGAVTSVAFSPDGKLLASGSDdGTVRLWDLA---TGKLLRTLT--GHTGAVRSVAFSPDGKLLASGSADGT 227
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907074849 515 LSIWDLAapTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDLQNQAMVRQFQGHTDGASCIDISDYGTRLWT 594
Cdd:COG2319 228 VRLWDLA--TGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLAS 305
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907074849 595 GGLDNTVRCWDLREGRQLQQHD-FSSQIFSLGHCPNQDWLAVGMESSHVEVLHVR-KPEKYQLRLHESCVLSLKFASCGR 672
Cdd:COG2319 306 GSDDGTVRLWDLATGKLLRTLTgHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLAtGELLRTLTGHTGAVTSVAFSPDGR 385
|
250
....*....|....*..
gi 1907074849 673 WFVSTGKDNLLNAWRTP 689
Cdd:COG2319 386 TLASGSADGTVRLWDLA 402
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
327-691 |
1.26e-35 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 139.66 E-value: 1.26e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907074849 327 QLAAQPAAPTDSIALRSPLTLSSPFTSSFSLGSHSTLNGDLSMPGSYVGLHLSPQMAFESHPHLRGSSVSLPGIPVAKPA 406
Cdd:COG2319 4 ADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLS 83
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907074849 407 YSFhvSADGQMQPVPFPSDALVGTGIPRHARQLHTLAHGEVVCAVTISSSTQHVYTGGK-GCVKVWDVGqpgSKTPVAQL 485
Cdd:COG2319 84 VAF--SPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSAdGTVRLWDLA---TGKLLRTL 158
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907074849 486 DclNRDNYIRSCKLLPDGQSLIVGGEASTLSIWDLAapTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDLQ 565
Cdd:COG2319 159 T--GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLA--TGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLA 234
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907074849 566 NQAMVRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWDLREGRQLQQ-HDFSSQIFSLGHCPNQDWLAVGMESSHVEV 644
Cdd:COG2319 235 TGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTlTGHSGGVNSVAFSPDGKLLASGSDDGTVRL 314
|
330 340 350 360
....*....|....*....|....*....|....*....|....*...
gi 1907074849 645 LHVRKPEK-YQLRLHESCVLSLKFASCGRWFVSTGKDNLLNAWRTPYG 691
Cdd:COG2319 315 WDLATGKLlRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATG 362
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
430-687 |
2.24e-33 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 130.15 E-value: 2.24e-33
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907074849 430 TGIPRHARQLHTlahGEVVCAVTISSSTQhVYTGGK-GCVKVWDVGQPgsktpvaqlDCLNR----DNYIRSCKLLPDGQ 504
Cdd:cd00200 40 TGELLRTLKGHT---GPVRDVAASADGTY-LASGSSdKTIRLWDLETG---------ECVRTltghTSYVSSVAFSPDGR 106
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907074849 505 SLIVGGEASTLSIWDLaaPTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDLQNQAMVRQFQGHTDGASCID 584
Cdd:cd00200 107 ILSSSSRDKTIKVWDV--ETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVA 184
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907074849 585 ISDYGTRLWTGGLDNTVRCWDLREGRQLQQHD-FSSQIFSLGHCPNQDWLAVGMESSHVEVLHVRKPE-KYQLRLHESCV 662
Cdd:cd00200 185 FSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRgHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGEcVQTLSGHTNSV 264
|
250 260
....*....|....*....|....*
gi 1907074849 663 LSLKFASCGRWFVSTGKDNLLNAWR 687
Cdd:cd00200 265 TSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
390-698 |
2.57e-32 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 130.03 E-value: 2.57e-32
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907074849 390 LRGSSVSLPGIPVAKPAYSFHVSADGQMQPVPFPSDALVGTGIPRHARQLHTLAHGEVVCAVTISSSTQHVYTGGK-GCV 468
Cdd:COG2319 23 AALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASAdGTV 102
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907074849 469 KVWDVgqpgsKTPVAQLDCLNRDNYIRSCKLLPDGQSLIVGGEASTLSIWDLAapTPRIKAELTSSAPACYALAVSPDAK 548
Cdd:COG2319 103 RLWDL-----ATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLA--TGKLLRTLTGHSGAVTSVAFSPDGK 175
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907074849 549 VCFSCCSDGNIVVWDLQNQAMVRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWDLREGRQLQQ-HDFSSQIFSLGHC 627
Cdd:COG2319 176 LLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTlTGHSGSVRSVAFS 255
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907074849 628 PNQDWLAVGMESSHVEVLHVR-KPEKYQLRLHESCVLSLKFASCGRWFVSTGKDNLLNAWRTPYGASIFQVT 698
Cdd:COG2319 256 PDGRLLASGSADGTVRLWDLAtGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLT 327
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
444-688 |
3.26e-31 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 123.98 E-value: 3.26e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907074849 444 HGEVVCAVTISSSTQHVYTGGK-GCVKVWDV-GQPGSKTPVAQLDClnrdnyIRSCKLLPDGQSLIVGGEASTLSIWDLa 521
Cdd:cd00200 8 HTGGVTCVAFSPDGKLLATGSGdGTIKVWDLeTGELLRTLKGHTGP------VRDVAASADGTYLASGSSDKTIRLWDL- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907074849 522 aPTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDLQNQAMVRQFQGHTDGASCIDISDYGTRLWTGGLDNTV 601
Cdd:cd00200 81 -ETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTI 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907074849 602 RCWDLREGR---QLQQHDfsSQIFSLGHCPNQDWLAVGMESSHVEVLHVRKPE-KYQLRLHESCVLSLKFASCGRWFVST 677
Cdd:cd00200 160 KLWDLRTGKcvaTLTGHT--GEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKcLGTLRGHENGVNSVAFSPDGYLLASG 237
|
250
....*....|.
gi 1907074849 678 GKDNLLNAWRT 688
Cdd:cd00200 238 SEDGTIRVWDL 248
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
437-566 |
1.92e-12 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 69.94 E-value: 1.92e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907074849 437 RQLHTLA-HGEVVCAVTISSSTQHVYTGGKGC-VKVWDVGqpgSKTPVAQLDclNRDNYIRSCKLLPDGQSLIVGGEAST 514
Cdd:COG2319 279 ELLRTLTgHSGGVNSVAFSPDGKLLASGSDDGtVRLWDLA---TGKLLRTLT--GHTGAVRSVAFSPDGKTLASGSDDGT 353
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 1907074849 515 LSIWDLAapTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDLQN 566
Cdd:COG2319 354 VRLWDLA--TGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
569-691 |
5.12e-11 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 64.28 E-value: 5.12e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907074849 569 MVRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWDLREG---RQLQQHdfSSQIFSLGHCPNQDWLAVGMESSHVEVL 645
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGellRTLKGH--TGPVRDVAASADGTYLASGSSDKTIRLW 78
|
90 100 110 120
....*....|....*....|....*....|....*....|....*..
gi 1907074849 646 HVRKPEK-YQLRLHESCVLSLKFASCGRWFVSTGKDNLLNAWRTPYG 691
Cdd:cd00200 79 DLETGECvRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETG 125
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
499-698 |
5.60e-11 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 65.32 E-value: 5.60e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907074849 499 LLPDGQSLIVGGEASTLSIWDLAAPTPRIKAELTSSAPAcyALAVSPDAKVCFSCCSDGNIVVWDLQNQAMVRQFQGHTD 578
Cdd:COG2319 2 LSADGAALAAASADLALALLAAALGALLLLLLGLAAAVA--SLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTA 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907074849 579 GASCIDISDYGTRLWTGGLDNTVRCWDLREGRQLQQ-HDFSSQIFSLGHCPNQDWLAVGMESSHVEVLHVRKPEK-YQLR 656
Cdd:COG2319 80 AVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTlTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLlRTLT 159
|
170 180 190 200
....*....|....*....|....*....|....*....|..
gi 1907074849 657 LHESCVLSLKFASCGRWFVSTGKDNLLNAWRTPYGASIFQVT 698
Cdd:COG2319 160 GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLT 201
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
246-423 |
1.57e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 55.33 E-value: 1.57e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907074849 246 PSEPPSPVT--TPCGKAPLCIPARRDLTDSPASLASSLGSPLPRSKDIALNDLPTGTPASRSCGTSPPQDSSTPGPSSAS 323
Cdd:PHA03247 2739 PAPPAVPAGpaTPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAAL 2818
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907074849 324 HLCQLAAQPAAPTDSIALRSPLTLSSPFTSSFSLGSHSTLNGDLSMPGSyvglHLSPQMAFESHPHLRGSSVSLPgiPVA 403
Cdd:PHA03247 2819 PPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPP----SRSPAAKPAAPARPPVRRLARP--AVS 2892
|
170 180
....*....|....*....|
gi 1907074849 404 KPAYSFHVSADGQmQPVPFP 423
Cdd:PHA03247 2893 RSTESFALPPDQP-ERPPQP 2911
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
570-605 |
9.20e-07 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 46.15 E-value: 9.20e-07
10 20 30
....*....|....*....|....*....|....*.
gi 1907074849 570 VRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWD 605
Cdd:smart00320 5 LKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
570-605 |
1.23e-05 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 42.72 E-value: 1.23e-05
10 20 30
....*....|....*....|....*....|....*.
gi 1907074849 570 VRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWD 605
Cdd:pfam00400 4 LKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
199-406 |
8.35e-04 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 43.24 E-value: 8.35e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907074849 199 ESLVEEDHPSSRGGSGKQQRAEDkDLSGP----YDSEEDKSDyNLVVDEDQPSEPPSPVTTPCGKAPLCIPARRDLTDSP 274
Cdd:PHA03307 12 EAAAEGGEFFPRPPATPGDAADD-LLSGSqgqlVSDSAELAA-VTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPT 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907074849 275 ASLAS-SLGSPLPRSKDIALNDLPTGTPASR----SCGTSPPQDSSTPGPSSASHLCQLAAQPAAPTDSIAlRSPLTLSS 349
Cdd:PHA03307 90 WSLSTlAPASPAREGSPTPPGPSSPDPPPPTpppaSPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPA-AVASDAAS 168
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*...
gi 1907074849 350 PFTSSFSLGSHSTLNGDLSMPGSYVGLhLSPQMAFESHPHLRGSSVSLP-GIPVAKPA 406
Cdd:PHA03307 169 SRQAALPLSSPEETARAPSSPPAEPPP-STPPAAASPRPPRRSSPISASaSSPAPAPG 225
|
|
| YncE |
COG3391 |
DNA-binding beta-propeller fold protein YncE [General function prediction only]; |
499-606 |
1.35e-03 |
|
DNA-binding beta-propeller fold protein YncE [General function prediction only];
Pssm-ID: 442618 [Multi-domain] Cd Length: 237 Bit Score: 41.22 E-value: 1.35e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907074849 499 LLPDGQSLIVGGEAS-TLSIWDLAapTPRIKAEL-TSSAPacYALAVSPDAKVCFSCCSDGN-----IVVWDLQNQAMVR 571
Cdd:COG3391 117 VDPDGGRLYVADSGNgRVSVIDTA--TGKVVATIpVGAGP--HGIAVDPDGKRLYVANSGSNtvsviVSVIDTATGKVVA 192
|
90 100 110 120
....*....|....*....|....*....|....*....|...
gi 1907074849 572 QFQGHtDGASCIDISDYGTRLW--------TGGLDNTVRCWDL 606
Cdd:COG3391 193 TIPVG-GGPVGVAVSPDGRRLYvanrgsntSNGGSNTVSVIDL 234
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
524-563 |
2.08e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 36.52 E-value: 2.08e-03
10 20 30 40
....*....|....*....|....*....|....*....|
gi 1907074849 524 TPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWD 563
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
|