|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
17-132 |
3.74e-84 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization.
Pssm-ID: 461094 Cd Length: 117 Bit Score: 263.52 E-value: 3.74e-84
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732843 17 FKFSVLEICDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQAEIVKRLSAICAQMVPFLT 96
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNAICAQVIPFLS 80
|
90 100 110
....*....|....*....|....*....|....*.
gi 1039732843 97 QEHQQQVLQAVDRAKQVTVGELNSLLGQNQLQPLSH 132
Cdd:pfam03920 81 QEHQQQVAQAVERAKQVTMAELNAIIGQQQQLQAQH 116
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
449-701 |
5.23e-36 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 140.82 E-value: 5.23e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732843 449 RQLHTL-AHGEVVCAVTISSSTQHVYTGGK-GCVKVWDVGqpgSKTPVAQLDclNRDNYIRSCKLLPDGQSLIVGGEAST 526
Cdd:COG2319 153 KLLRTLtGHSGAVTSVAFSPDGKLLASGSDdGTVRLWDLA---TGKLLRTLT--GHTGAVRSVAFSPDGKLLASGSADGT 227
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732843 527 LSIWDLAapTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDLQNQAMVRQFQGHTDGASCIDISDYGTRLWT 606
Cdd:COG2319 228 VRLWDLA--TGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLAS 305
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732843 607 GGLDNTVRCWDLREGRQLQQHD-FSSQIFSLGHCPNQDWLAVGMESSHVEVLHVR-KPEKYQLRLHESCVLSLKFASCGR 684
Cdd:COG2319 306 GSDDGTVRLWDLATGKLLRTLTgHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLAtGELLRTLTGHTGAVTSVAFSPDGR 385
|
250
....*....|....*..
gi 1039732843 685 WFVSTGKDNLLNAWRTP 701
Cdd:COG2319 386 TLASGSADGTVRLWDLA 402
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
442-699 |
2.78e-33 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 130.15 E-value: 2.78e-33
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732843 442 TGIPRHARQLHTlahGEVVCAVTISSSTQhVYTGGK-GCVKVWDVGQPgsktpvaqlDCLNR----DNYIRSCKLLPDGQ 516
Cdd:cd00200 40 TGELLRTLKGHT---GPVRDVAASADGTY-LASGSSdKTIRLWDLETG---------ECVRTltghTSYVSSVAFSPDGR 106
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732843 517 SLIVGGEASTLSIWDLaaPTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDLQNQAMVRQFQGHTDGASCID 596
Cdd:cd00200 107 ILSSSSRDKTIKVWDV--ETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVA 184
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732843 597 ISDYGTRLWTGGLDNTVRCWDLREGRQLQQHD-FSSQIFSLGHCPNQDWLAVGMESSHVEVLHVRKPE-KYQLRLHESCV 674
Cdd:cd00200 185 FSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRgHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGEcVQTLSGHTNSV 264
|
250 260
....*....|....*....|....*
gi 1039732843 675 LSLKFASCGRWFVSTGKDNLLNAWR 699
Cdd:cd00200 265 TSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
582-617 |
9.91e-07 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 45.77 E-value: 9.91e-07
10 20 30
....*....|....*....|....*....|....*.
gi 1039732843 582 VRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWD 617
Cdd:smart00320 5 LKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
245-435 |
1.11e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 52.63 E-value: 1.11e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732843 245 PSEPPSPVT--TPCGKAPLCIPARRDLTDSPASLASSLGSPLPRSKDIALNDLPTGTPASRSCGTSPPQDSSTPGPSSAS 322
Cdd:PHA03247 2739 PAPPAVPAGpaTPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAAL 2818
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732843 323 HLCQLAAQPAAPTDSIALRSPLTLSSPFTSSFSLGSHSTLNGDLSMPGSyvglhlspqvsssvvYGRSPLQMAFESHPHL 402
Cdd:PHA03247 2819 PPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPP---------------SRSPAAKPAAPARPPV 2883
|
170 180 190
....*....|....*....|....*....|...
gi 1039732843 403 RgssvSLPGIPVAKPAYSFHVSADGQmQPVPFP 435
Cdd:PHA03247 2884 R----RLARPAVSRSTESFALPPDQP-ERPPQP 2911
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
582-617 |
1.33e-05 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 42.72 E-value: 1.33e-05
10 20 30
....*....|....*....|....*....|....*.
gi 1039732843 582 VRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWD 617
Cdd:pfam00400 4 LKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
17-132 |
3.74e-84 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization.
Pssm-ID: 461094 Cd Length: 117 Bit Score: 263.52 E-value: 3.74e-84
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732843 17 FKFSVLEICDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQAEIVKRLSAICAQMVPFLT 96
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNAICAQVIPFLS 80
|
90 100 110
....*....|....*....|....*....|....*.
gi 1039732843 97 QEHQQQVLQAVDRAKQVTVGELNSLLGQNQLQPLSH 132
Cdd:pfam03920 81 QEHQQQVAQAVERAKQVTMAELNAIIGQQQQLQAQH 116
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
449-701 |
5.23e-36 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 140.82 E-value: 5.23e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732843 449 RQLHTL-AHGEVVCAVTISSSTQHVYTGGK-GCVKVWDVGqpgSKTPVAQLDclNRDNYIRSCKLLPDGQSLIVGGEAST 526
Cdd:COG2319 153 KLLRTLtGHSGAVTSVAFSPDGKLLASGSDdGTVRLWDLA---TGKLLRTLT--GHTGAVRSVAFSPDGKLLASGSADGT 227
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732843 527 LSIWDLAapTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDLQNQAMVRQFQGHTDGASCIDISDYGTRLWT 606
Cdd:COG2319 228 VRLWDLA--TGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLAS 305
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732843 607 GGLDNTVRCWDLREGRQLQQHD-FSSQIFSLGHCPNQDWLAVGMESSHVEVLHVR-KPEKYQLRLHESCVLSLKFASCGR 684
Cdd:COG2319 306 GSDDGTVRLWDLATGKLLRTLTgHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLAtGELLRTLTGHTGAVTSVAFSPDGR 385
|
250
....*....|....*..
gi 1039732843 685 WFVSTGKDNLLNAWRTP 701
Cdd:COG2319 386 TLASGSADGTVRLWDLA 402
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
402-703 |
4.42e-35 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 138.12 E-value: 4.42e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732843 402 LRGSSVSLPGIPVAKPAYSFHVSADGQMQPVPFPSDALVGTGIPRHARQLHTLAHGEVVCAVTISSSTQHVYTGGK-GCV 480
Cdd:COG2319 65 AAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSAdGTV 144
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732843 481 KVWDVGqpgSKTPVAQLDclNRDNYIRSCKLLPDGQSLIVGGEASTLSIWDLAapTPRIKAELTSSAPACYALAVSPDAK 560
Cdd:COG2319 145 RLWDLA---TGKLLRTLT--GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLA--TGKLLRTLTGHTGAVRSVAFSPDGK 217
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732843 561 VCFSCCSDGNIVVWDLQNQAMVRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWDLREGRQLQQ-HDFSSQIFSLGHC 639
Cdd:COG2319 218 LLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTlTGHSGGVNSVAFS 297
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1039732843 640 PNQDWLAVGMESSHVEVLHVRKPEK-YQLRLHESCVLSLKFASCGRWFVSTGKDNLLNAWRTPYG 703
Cdd:COG2319 298 PDGKLLASGSDDGTVRLWDLATGKLlRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATG 362
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
442-699 |
2.78e-33 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 130.15 E-value: 2.78e-33
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732843 442 TGIPRHARQLHTlahGEVVCAVTISSSTQhVYTGGK-GCVKVWDVGQPgsktpvaqlDCLNR----DNYIRSCKLLPDGQ 516
Cdd:cd00200 40 TGELLRTLKGHT---GPVRDVAASADGTY-LASGSSdKTIRLWDLETG---------ECVRTltghTSYVSSVAFSPDGR 106
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732843 517 SLIVGGEASTLSIWDLaaPTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDLQNQAMVRQFQGHTDGASCID 596
Cdd:cd00200 107 ILSSSSRDKTIKVWDV--ETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVA 184
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732843 597 ISDYGTRLWTGGLDNTVRCWDLREGRQLQQHD-FSSQIFSLGHCPNQDWLAVGMESSHVEVLHVRKPE-KYQLRLHESCV 674
Cdd:cd00200 185 FSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRgHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGEcVQTLSGHTNSV 264
|
250 260
....*....|....*....|....*
gi 1039732843 675 LSLKFASCGRWFVSTGKDNLLNAWR 699
Cdd:cd00200 265 TSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
423-710 |
3.37e-32 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 129.65 E-value: 3.37e-32
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732843 423 VSADGQMQPVPFPSDALVGTGIPRHARQLHTLAHGEVVCAVTISSSTQHVYTGGK-GCVKVWDVgqpgsKTPVAQLDCLN 501
Cdd:COG2319 44 ASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASAdGTVRLWDL-----ATGLLLRTLTG 118
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732843 502 RDNYIRSCKLLPDGQSLIVGGEASTLSIWDLAapTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDLQNQAM 581
Cdd:COG2319 119 HTGAVRSVAFSPDGKTLASGSADGTVRLWDLA--TGKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKL 196
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732843 582 VRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWDLREGRQLQQ-HDFSSQIFSLGHCPNQDWLAVGMESSHVEVLHVR 660
Cdd:COG2319 197 LRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTlTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLA 276
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|.
gi 1039732843 661 -KPEKYQLRLHESCVLSLKFASCGRWFVSTGKDNLLNAWRTPYGASIFQVT 710
Cdd:COG2319 277 tGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLT 327
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
456-700 |
3.84e-31 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 123.60 E-value: 3.84e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732843 456 HGEVVCAVTISSSTQHVYTGGK-GCVKVWDV-GQPGSKTPVAQLDClnrdnyIRSCKLLPDGQSLIVGGEASTLSIWDLa 533
Cdd:cd00200 8 HTGGVTCVAFSPDGKLLATGSGdGTIKVWDLeTGELLRTLKGHTGP------VRDVAASADGTYLASGSSDKTIRLWDL- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732843 534 aPTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDLQNQAMVRQFQGHTDGASCIDISDYGTRLWTGGLDNTV 613
Cdd:cd00200 81 -ETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTI 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732843 614 RCWDLREGR---QLQQHDfsSQIFSLGHCPNQDWLAVGMESSHVEVLHVRKPE-KYQLRLHESCVLSLKFASCGRWFVST 689
Cdd:cd00200 160 KLWDLRTGKcvaTLTGHT--GEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKcLGTLRGHENGVNSVAFSPDGYLLASG 237
|
250
....*....|.
gi 1039732843 690 GKDNLLNAWRT 700
Cdd:cd00200 238 SEDGTIRVWDL 248
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
506-700 |
1.32e-19 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 91.90 E-value: 1.32e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732843 506 IRSCKLLPDGQSLIVGGEASTLSIWDLAAPTPRIKAELTSSAPacYALAVSPDAKVCFSCCSDGNIVVWDLQNQAMVRQF 585
Cdd:COG2319 39 VASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAV--LSVAFSPDGRLLASASADGTVRLWDLATGLLLRTL 116
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732843 586 QGHTDGASCIDISDYGTRLWTGGLDNTVRCWDLREGRQLQQ-HDFSSQIFSLGHCPNQDWLAVGMESSHVEVLHVRKPEK 664
Cdd:COG2319 117 TGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTlTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKL 196
|
170 180 190
....*....|....*....|....*....|....*..
gi 1039732843 665 YQ-LRLHESCVLSLKFASCGRWFVSTGKDNLLNAWRT 700
Cdd:COG2319 197 LRtLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDL 233
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
449-578 |
2.36e-12 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 69.55 E-value: 2.36e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732843 449 RQLHTLA-HGEVVCAVTISSSTQHVYTGGKGC-VKVWDVGqpgSKTPVAQLDclNRDNYIRSCKLLPDGQSLIVGGEAST 526
Cdd:COG2319 279 ELLRTLTgHSGGVNSVAFSPDGKLLASGSDDGtVRLWDLA---TGKLLRTLT--GHTGAVRSVAFSPDGKTLASGSDDGT 353
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 1039732843 527 LSIWDLAapTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDLQN 578
Cdd:COG2319 354 VRLWDLA--TGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
581-703 |
5.66e-11 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 64.28 E-value: 5.66e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732843 581 MVRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWDLREG---RQLQQHdfSSQIFSLGHCPNQDWLAVGMESSHVEVL 657
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGellRTLKGH--TGPVRDVAASADGTYLASGSSDKTIRLW 78
|
90 100 110 120
....*....|....*....|....*....|....*....|....*..
gi 1039732843 658 HVRKPEK-YQLRLHESCVLSLKFASCGRWFVSTGKDNLLNAWRTPYG 703
Cdd:cd00200 79 DLETGECvRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETG 125
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
511-710 |
6.25e-11 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 65.32 E-value: 6.25e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732843 511 LLPDGQSLIVGGEASTLSIWDLAAPTPRIKAELTSSAPAcyALAVSPDAKVCFSCCSDGNIVVWDLQNQAMVRQFQGHTD 590
Cdd:COG2319 2 LSADGAALAAASADLALALLAAALGALLLLLLGLAAAVA--SLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTA 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732843 591 GASCIDISDYGTRLWTGGLDNTVRCWDLREGRQLQQ-HDFSSQIFSLGHCPNQDWLAVGMESSHVEVLHVRKPEK-YQLR 668
Cdd:COG2319 80 AVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTlTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLlRTLT 159
|
170 180 190 200
....*....|....*....|....*....|....*....|..
gi 1039732843 669 LHESCVLSLKFASCGRWFVSTGKDNLLNAWRTPYGASIFQVT 710
Cdd:COG2319 160 GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLT 201
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
582-617 |
9.91e-07 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 45.77 E-value: 9.91e-07
10 20 30
....*....|....*....|....*....|....*.
gi 1039732843 582 VRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWD 617
Cdd:smart00320 5 LKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
245-435 |
1.11e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 52.63 E-value: 1.11e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732843 245 PSEPPSPVT--TPCGKAPLCIPARRDLTDSPASLASSLGSPLPRSKDIALNDLPTGTPASRSCGTSPPQDSSTPGPSSAS 322
Cdd:PHA03247 2739 PAPPAVPAGpaTPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAAL 2818
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732843 323 HLCQLAAQPAAPTDSIALRSPLTLSSPFTSSFSLGSHSTLNGDLSMPGSyvglhlspqvsssvvYGRSPLQMAFESHPHL 402
Cdd:PHA03247 2819 PPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPP---------------SRSPAAKPAAPARPPV 2883
|
170 180 190
....*....|....*....|....*....|...
gi 1039732843 403 RgssvSLPGIPVAKPAYSFHVSADGQmQPVPFP 435
Cdd:PHA03247 2884 R----RLARPAVSRSTESFALPPDQP-ERPPQP 2911
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
582-617 |
1.33e-05 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 42.72 E-value: 1.33e-05
10 20 30
....*....|....*....|....*....|....*.
gi 1039732843 582 VRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWD 617
Cdd:pfam00400 4 LKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| YncE |
COG3391 |
DNA-binding beta-propeller fold protein YncE [General function prediction only]; |
511-618 |
1.38e-03 |
|
DNA-binding beta-propeller fold protein YncE [General function prediction only];
Pssm-ID: 442618 [Multi-domain] Cd Length: 237 Bit Score: 41.22 E-value: 1.38e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732843 511 LLPDGQSLIVGGEAS-TLSIWDLAapTPRIKAEL-TSSAPacYALAVSPDAKVCFSCCSDGN-----IVVWDLQNQAMVR 583
Cdd:COG3391 117 VDPDGGRLYVADSGNgRVSVIDTA--TGKVVATIpVGAGP--HGIAVDPDGKRLYVANSGSNtvsviVSVIDTATGKVVA 192
|
90 100 110 120
....*....|....*....|....*....|....*....|...
gi 1039732843 584 QFQGHtDGASCIDISDYGTRLW--------TGGLDNTVRCWDL 618
Cdd:COG3391 193 TIPVG-GGPVGVAVSPDGRRLYvanrgsntSNGGSNTVSVIDL 234
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
536-575 |
2.22e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 36.52 E-value: 2.22e-03
10 20 30 40
....*....|....*....|....*....|....*....|
gi 1039732843 536 TPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWD 575
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
|