|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
17-152 |
2.12e-80 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization.
Pssm-ID: 461094 Cd Length: 117 Bit Score: 253.89 E-value: 2.12e-80
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755534466 17 FKFSVLEICDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMaaphqcpqggtsyphwprlsplqYYEMSYGLNIE 96
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVM-----------------------YYEMSYGLNIE 57
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 755534466 97 MHKQAEIVKRLSAICAQMVPFLTQEHQQQVLQAVDRAKQVTVGELNSLLGQQNQLQ 152
Cdd:pfam03920 58 MHKQTEIAKRLNAICAQVIPFLSQEHQQQVAQAVERAKQVTMAELNAIIGQQQQLQ 113
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
472-724 |
5.34e-36 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 141.20 E-value: 5.34e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755534466 472 RQLHTL-AHGEVVCAVTISSSTQHVYTGGK-GCVKVWDVGqpgSKTPVAQLDclNRDNYIRSCKLLPDGQSLIVGGEAST 549
Cdd:COG2319 153 KLLRTLtGHSGAVTSVAFSPDGKLLASGSDdGTVRLWDLA---TGKLLRTLT--GHTGAVRSVAFSPDGKLLASGSADGT 227
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755534466 550 LSIWDLAapTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDLQNQAMVRQFQGHTDGASCIDISDYGTRLWT 629
Cdd:COG2319 228 VRLWDLA--TGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLAS 305
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755534466 630 GGLDNTVRCWDLREGRQLQQHD-FSSQIFSLGHCPNQDWLAVGMESSHVEVLHVR-KPEKYQLRLHESCVLSLKFASCGR 707
Cdd:COG2319 306 GSDDGTVRLWDLATGKLLRTLTgHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLAtGELLRTLTGHTGAVTSVAFSPDGR 385
|
250
....*....|....*..
gi 755534466 708 WFVSTGKDNLLNAWRTP 724
Cdd:COG2319 386 TLASGSADGTVRLWDLA 402
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
465-722 |
2.40e-33 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 130.15 E-value: 2.40e-33
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755534466 465 TGIPRHARQLHTlahGEVVCAVTISSSTQhVYTGGK-GCVKVWDVGQPgsktpvaqlDCLNR----DNYIRSCKLLPDGQ 539
Cdd:cd00200 40 TGELLRTLKGHT---GPVRDVAASADGTY-LASGSSdKTIRLWDLETG---------ECVRTltghTSYVSSVAFSPDGR 106
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755534466 540 SLIVGGEASTLSIWDLaaPTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDLQNQAMVRQFQGHTDGASCID 619
Cdd:cd00200 107 ILSSSSRDKTIKVWDV--ETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVA 184
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755534466 620 ISDYGTRLWTGGLDNTVRCWDLREGRQLQQHD-FSSQIFSLGHCPNQDWLAVGMESSHVEVLHVRKPE-KYQLRLHESCV 697
Cdd:cd00200 185 FSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRgHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGEcVQTLSGHTNSV 264
|
250 260
....*....|....*....|....*
gi 755534466 698 LSLKFASCGRWFVSTGKDNLLNAWR 722
Cdd:cd00200 265 TSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
269-458 |
5.50e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 53.79 E-value: 5.50e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755534466 269 PSEPPSPVT--TPCGKAPLCIPARRDLTDSPASLASSLGSPLPRSKDIALNDLPTGTPASRSCGTSPPQDSSTPGPSSAS 346
Cdd:PHA03247 2739 PAPPAVPAGpaTPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAAL 2818
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755534466 347 HLCQLAAQPAAPTDSIALRSPLTLSSPFTSSFSLGSHSTLNGDLSMPGSyvglhlspqvsssvvyGRSPLMAFESHPHLR 426
Cdd:PHA03247 2819 PPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPP----------------SRSPAAKPAAPARPP 2882
|
170 180 190
....*....|....*....|....*....|..
gi 755534466 427 GSSVSLPgiPVAKPAYSFHVSADGQmQPVPFP 458
Cdd:PHA03247 2883 VRRLARP--AVSRSTESFALPPDQP-ERPPQP 2911
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
605-640 |
9.99e-07 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 45.77 E-value: 9.99e-07
10 20 30
....*....|....*....|....*....|....*.
gi 755534466 605 VRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWD 640
Cdd:smart00320 5 LKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
605-640 |
1.30e-05 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 42.72 E-value: 1.30e-05
10 20 30
....*....|....*....|....*....|....*.
gi 755534466 605 VRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWD 640
Cdd:pfam00400 4 LKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
17-152 |
2.12e-80 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization.
Pssm-ID: 461094 Cd Length: 117 Bit Score: 253.89 E-value: 2.12e-80
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755534466 17 FKFSVLEICDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMaaphqcpqggtsyphwprlsplqYYEMSYGLNIE 96
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVM-----------------------YYEMSYGLNIE 57
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 755534466 97 MHKQAEIVKRLSAICAQMVPFLTQEHQQQVLQAVDRAKQVTVGELNSLLGQQNQLQ 152
Cdd:pfam03920 58 MHKQTEIAKRLNAICAQVIPFLSQEHQQQVAQAVERAKQVTMAELNAIIGQQQQLQ 113
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
472-724 |
5.34e-36 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 141.20 E-value: 5.34e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755534466 472 RQLHTL-AHGEVVCAVTISSSTQHVYTGGK-GCVKVWDVGqpgSKTPVAQLDclNRDNYIRSCKLLPDGQSLIVGGEAST 549
Cdd:COG2319 153 KLLRTLtGHSGAVTSVAFSPDGKLLASGSDdGTVRLWDLA---TGKLLRTLT--GHTGAVRSVAFSPDGKLLASGSADGT 227
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755534466 550 LSIWDLAapTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDLQNQAMVRQFQGHTDGASCIDISDYGTRLWT 629
Cdd:COG2319 228 VRLWDLA--TGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLAS 305
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755534466 630 GGLDNTVRCWDLREGRQLQQHD-FSSQIFSLGHCPNQDWLAVGMESSHVEVLHVR-KPEKYQLRLHESCVLSLKFASCGR 707
Cdd:COG2319 306 GSDDGTVRLWDLATGKLLRTLTgHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLAtGELLRTLTGHTGAVTSVAFSPDGR 385
|
250
....*....|....*..
gi 755534466 708 WFVSTGKDNLLNAWRTP 724
Cdd:COG2319 386 TLASGSADGTVRLWDLA 402
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
425-726 |
4.59e-35 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 138.12 E-value: 4.59e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755534466 425 LRGSSVSLPGIPVAKPAYSFHVSADGQMQPVPFPSDALVGTGIPRHARQLHTLAHGEVVCAVTISSSTQHVYTGGK-GCV 503
Cdd:COG2319 65 AAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSAdGTV 144
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755534466 504 KVWDVGqpgSKTPVAQLDclNRDNYIRSCKLLPDGQSLIVGGEASTLSIWDLAapTPRIKAELTSSAPACYALAVSPDAK 583
Cdd:COG2319 145 RLWDLA---TGKLLRTLT--GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLA--TGKLLRTLTGHTGAVRSVAFSPDGK 217
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755534466 584 VCFSCCSDGNIVVWDLQNQAMVRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWDLREGRQLQQ-HDFSSQIFSLGHC 662
Cdd:COG2319 218 LLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTlTGHSGGVNSVAFS 297
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 755534466 663 PNQDWLAVGMESSHVEVLHVRKPEK-YQLRLHESCVLSLKFASCGRWFVSTGKDNLLNAWRTPYG 726
Cdd:COG2319 298 PDGKLLASGSDDGTVRLWDLATGKLlRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATG 362
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
465-722 |
2.40e-33 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 130.15 E-value: 2.40e-33
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755534466 465 TGIPRHARQLHTlahGEVVCAVTISSSTQhVYTGGK-GCVKVWDVGQPgsktpvaqlDCLNR----DNYIRSCKLLPDGQ 539
Cdd:cd00200 40 TGELLRTLKGHT---GPVRDVAASADGTY-LASGSSdKTIRLWDLETG---------ECVRTltghTSYVSSVAFSPDGR 106
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755534466 540 SLIVGGEASTLSIWDLaaPTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDLQNQAMVRQFQGHTDGASCID 619
Cdd:cd00200 107 ILSSSSRDKTIKVWDV--ETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVA 184
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755534466 620 ISDYGTRLWTGGLDNTVRCWDLREGRQLQQHD-FSSQIFSLGHCPNQDWLAVGMESSHVEVLHVRKPE-KYQLRLHESCV 697
Cdd:cd00200 185 FSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRgHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGEcVQTLSGHTNSV 264
|
250 260
....*....|....*....|....*
gi 755534466 698 LSLKFASCGRWFVSTGKDNLLNAWR 722
Cdd:cd00200 265 TSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
446-733 |
3.50e-32 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 129.65 E-value: 3.50e-32
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755534466 446 VSADGQMQPVPFPSDALVGTGIPRHARQLHTLAHGEVVCAVTISSSTQHVYTGGK-GCVKVWDVgqpgsKTPVAQLDCLN 524
Cdd:COG2319 44 ASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASAdGTVRLWDL-----ATGLLLRTLTG 118
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755534466 525 RDNYIRSCKLLPDGQSLIVGGEASTLSIWDLAapTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDLQNQAM 604
Cdd:COG2319 119 HTGAVRSVAFSPDGKTLASGSADGTVRLWDLA--TGKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKL 196
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755534466 605 VRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWDLREGRQLQQ-HDFSSQIFSLGHCPNQDWLAVGMESSHVEVLHVR 683
Cdd:COG2319 197 LRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTlTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLA 276
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|.
gi 755534466 684 -KPEKYQLRLHESCVLSLKFASCGRWFVSTGKDNLLNAWRTPYGASIFQVT 733
Cdd:COG2319 277 tGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLT 327
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
479-723 |
3.47e-31 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 123.98 E-value: 3.47e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755534466 479 HGEVVCAVTISSSTQHVYTGGK-GCVKVWDV-GQPGSKTPVAQLDClnrdnyIRSCKLLPDGQSLIVGGEASTLSIWDLa 556
Cdd:cd00200 8 HTGGVTCVAFSPDGKLLATGSGdGTIKVWDLeTGELLRTLKGHTGP------VRDVAASADGTYLASGSSDKTIRLWDL- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755534466 557 aPTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDLQNQAMVRQFQGHTDGASCIDISDYGTRLWTGGLDNTV 636
Cdd:cd00200 81 -ETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTI 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755534466 637 RCWDLREGR---QLQQHDfsSQIFSLGHCPNQDWLAVGMESSHVEVLHVRKPE-KYQLRLHESCVLSLKFASCGRWFVST 712
Cdd:cd00200 160 KLWDLRTGKcvaTLTGHT--GEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKcLGTLRGHENGVNSVAFSPDGYLLASG 237
|
250
....*....|.
gi 755534466 713 GKDNLLNAWRT 723
Cdd:cd00200 238 SEDGTIRVWDL 248
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
529-723 |
1.38e-19 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 91.90 E-value: 1.38e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755534466 529 IRSCKLLPDGQSLIVGGEASTLSIWDLAAPTPRIKAELTSSAPacYALAVSPDAKVCFSCCSDGNIVVWDLQNQAMVRQF 608
Cdd:COG2319 39 VASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAV--LSVAFSPDGRLLASASADGTVRLWDLATGLLLRTL 116
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755534466 609 QGHTDGASCIDISDYGTRLWTGGLDNTVRCWDLREGRQLQQ-HDFSSQIFSLGHCPNQDWLAVGMESSHVEVLHVRKPEK 687
Cdd:COG2319 117 TGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTlTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKL 196
|
170 180 190
....*....|....*....|....*....|....*..
gi 755534466 688 YQ-LRLHESCVLSLKFASCGRWFVSTGKDNLLNAWRT 723
Cdd:COG2319 197 LRtLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDL 233
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
472-601 |
2.39e-12 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 69.55 E-value: 2.39e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755534466 472 RQLHTLA-HGEVVCAVTISSSTQHVYTGGKGC-VKVWDVGqpgSKTPVAQLDclNRDNYIRSCKLLPDGQSLIVGGEAST 549
Cdd:COG2319 279 ELLRTLTgHSGGVNSVAFSPDGKLLASGSDDGtVRLWDLA---TGKLLRTLT--GHTGAVRSVAFSPDGKTLASGSDDGT 353
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 755534466 550 LSIWDLAapTPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWDLQN 601
Cdd:COG2319 354 VRLWDLA--TGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
604-726 |
5.40e-11 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 64.28 E-value: 5.40e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755534466 604 MVRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWDLREG---RQLQQHdfSSQIFSLGHCPNQDWLAVGMESSHVEVL 680
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGellRTLKGH--TGPVRDVAASADGTYLASGSSDKTIRLW 78
|
90 100 110 120
....*....|....*....|....*....|....*....|....*..
gi 755534466 681 HVRKPEK-YQLRLHESCVLSLKFASCGRWFVSTGKDNLLNAWRTPYG 726
Cdd:cd00200 79 DLETGECvRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETG 125
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
534-733 |
6.55e-11 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 65.32 E-value: 6.55e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755534466 534 LLPDGQSLIVGGEASTLSIWDLAAPTPRIKAELTSSAPAcyALAVSPDAKVCFSCCSDGNIVVWDLQNQAMVRQFQGHTD 613
Cdd:COG2319 2 LSADGAALAAASADLALALLAAALGALLLLLLGLAAAVA--SLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTA 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755534466 614 GASCIDISDYGTRLWTGGLDNTVRCWDLREGRQLQQ-HDFSSQIFSLGHCPNQDWLAVGMESSHVEVLHVRKPEK-YQLR 691
Cdd:COG2319 80 AVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTlTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLlRTLT 159
|
170 180 190 200
....*....|....*....|....*....|....*....|..
gi 755534466 692 LHESCVLSLKFASCGRWFVSTGKDNLLNAWRTPYGASIFQVT 733
Cdd:COG2319 160 GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLT 201
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
269-458 |
5.50e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 53.79 E-value: 5.50e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755534466 269 PSEPPSPVT--TPCGKAPLCIPARRDLTDSPASLASSLGSPLPRSKDIALNDLPTGTPASRSCGTSPPQDSSTPGPSSAS 346
Cdd:PHA03247 2739 PAPPAVPAGpaTPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAAL 2818
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755534466 347 HLCQLAAQPAAPTDSIALRSPLTLSSPFTSSFSLGSHSTLNGDLSMPGSyvglhlspqvsssvvyGRSPLMAFESHPHLR 426
Cdd:PHA03247 2819 PPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPP----------------SRSPAAKPAAPARPP 2882
|
170 180 190
....*....|....*....|....*....|..
gi 755534466 427 GSSVSLPgiPVAKPAYSFHVSADGQmQPVPFP 458
Cdd:PHA03247 2883 VRRLARP--AVSRSTESFALPPDQP-ERPPQP 2911
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
605-640 |
9.99e-07 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 45.77 E-value: 9.99e-07
10 20 30
....*....|....*....|....*....|....*.
gi 755534466 605 VRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWD 640
Cdd:smart00320 5 LKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
605-640 |
1.30e-05 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 42.72 E-value: 1.30e-05
10 20 30
....*....|....*....|....*....|....*.
gi 755534466 605 VRQFQGHTDGASCIDISDYGTRLWTGGLDNTVRCWD 640
Cdd:pfam00400 4 LKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| YncE |
COG3391 |
DNA-binding beta-propeller fold protein YncE [General function prediction only]; |
534-641 |
1.42e-03 |
|
DNA-binding beta-propeller fold protein YncE [General function prediction only];
Pssm-ID: 442618 [Multi-domain] Cd Length: 237 Bit Score: 41.22 E-value: 1.42e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755534466 534 LLPDGQSLIVGGEAS-TLSIWDLAapTPRIKAEL-TSSAPacYALAVSPDAKVCFSCCSDGN-----IVVWDLQNQAMVR 606
Cdd:COG3391 117 VDPDGGRLYVADSGNgRVSVIDTA--TGKVVATIpVGAGP--HGIAVDPDGKRLYVANSGSNtvsviVSVIDTATGKVVA 192
|
90 100 110 120
....*....|....*....|....*....|....*....|...
gi 755534466 607 QFQGHtDGASCIDISDYGTRLW--------TGGLDNTVRCWDL 641
Cdd:COG3391 193 TIPVG-GGPVGVAVSPDGRRLYvanrgsntSNGGSNTVSVIDL 234
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
559-598 |
2.26e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 36.52 E-value: 2.26e-03
10 20 30 40
....*....|....*....|....*....|....*....|
gi 755534466 559 TPRIKAELTSSAPACYALAVSPDAKVCFSCCSDGNIVVWD 598
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
222-441 |
5.05e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 40.54 E-value: 5.05e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755534466 222 ESLVEEDHPSSRGGSGKQQRAEDkDLSGP----YDSEEDKSDyNLVVDEDQPSEPPSPVTTPCGKAPLCIPARRDLTDSP 297
Cdd:PHA03307 12 EAAAEGGEFFPRPPATPGDAADD-LLSGSqgqlVSDSAELAA-VTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPT 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755534466 298 ASLAS-SLGSPLPRSKDIALNDLPTGTPASR----SCGTSPPQDSSTPGPSSASHLCQLAAQPAAPTDSIAlRSPLTLSS 372
Cdd:PHA03307 90 WSLSTlAPASPAREGSPTPPGPSSPDPPPPTpppaSPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPA-AVASDAAS 168
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755534466 373 PFTSSFSlgshstlngdLSMPGSYVglHLSPQVSSSVVyGRSPLMAFESHPHLRGSSVSLP-GIPVAKPA 441
Cdd:PHA03307 169 SRQAALP----------LSSPEETA--RAPSSPPAEPP-PSTPPAAASPRPPRRSSPISASaSSPAPAPG 225
|
|
|