|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 super family |
cl29593 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
725-1063 |
1.74e-34 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment. The actual alignment was detected with superfamily member cd00200:
Pssm-ID: 475233 [Multi-domain] Cd Length: 289 Bit Score: 134.77 E-value: 1.74e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 725 GHDDDILCLTIHPLKDYVATGqvGRDPSIHIWDTETIKPLSILKGHHQyGVSAVDFSADGKRLASVGIDdsHTVVLWDWK 804
Cdd:cd00200 7 GHTGGVTCVAFSPDGKLLATG--SGDGTIKVWDLETGELLRTLKGHTG-PVRDVAASADGTYLASGSSD--KTIRLWDLE 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 805 KGEKLSIARGSKDKIFVVKMNPYvpDKLITAGIKH--MKFWRKAGGGLIGRkgyigTLGKNDTMMCAVYGWTEEMAFSGT 882
Cdd:cd00200 82 TGECVRTLTGHTSYVSSVAFSPD--GRILSSSSRDktIKVWDVETGKCLTT-----LRGHTDWVNSVAFSPDGTFVASSS 154
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 883 STGDVCIW--RDIFLVKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTYaikraalapgskgllledn 958
Cdd:cd00200 155 QDGTIKLWdlRTGKCVATLTGHTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGTL------------------- 215
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 959 psiraislghghilvgtkngeilevdksgpitllvQGHmEGEVWGLATHPYLPICATVSDDKTLRIWDLSPSHCMLAVRK 1038
Cdd:cd00200 216 -----------------------------------RGH-ENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSG 259
|
330 340
....*....|....*....|....*
gi 1867163826 1039 LKKGGRCCCFSPDGKALAVGLNDGS 1063
Cdd:cd00200 260 HTNSVTSLAWSPDGKRLASGSADGT 284
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
58-438 |
9.65e-31 |
|
WD40 repeat [General function prediction only]; :
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 126.95 E-value: 9.65e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 58 RGHSDDIISLALHPERVLVATGQVGKEpyICIWDSYTVQTISVLKDvHTHGIACLAFDLDGQRLVSVGLDskNAVCVWDW 137
Cdd:COG2319 75 LGHTAAVLSVAFSPDGRLLASASADGT--VRLWDLATGLLLRTLTG-HTGAVRSVAFSPDGKTLASGSAD--GTVRLWDL 149
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 138 KRGKMLSMAPGHTDRIFDISWDlyqPNklvscgvkhikfwslcgnaltpkrgvfGKTgdlqtilcLAcardeltySGALN 217
Cdd:COG2319 150 ATGKLLRTLTGHSGAVTSVAFS---PD---------------------------GKL--------LA--------SGSDD 183
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 218 GDIYVW--KGINLIRTIQGaHAAGIFSMNACEEG--FATGGRDGCIRLWDL-TFKPITVIDLRETdqgykglSVRSVCWR 292
Cdd:COG2319 184 GTVRLWdlATGKLLRTLTG-HTGAVRSVAFSPDGklLASGSADGTVRLWDLaTGKLLRTLTGHSG-------SVRSVAFS 255
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 293 --GDHILVGTQDSEIFeIVVQERNKPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWSLVDHALIARCN-MEEPI 369
Cdd:COG2319 256 pdGRLLASGSADGTVR-LWDLATGELLRTLTGH-SGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTgHTGAV 333
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1867163826 370 RCAAVNADGIHLALGMKDGSFTVLRVRDMTEVVHIKDRKEAIHELKYSPDGTYLAVGCNDSSVDIYGVA 438
Cdd:COG2319 334 RSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1425-1812 |
1.28e-30 |
|
WD40 repeat [General function prediction only]; :
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 126.56 E-value: 1.28e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1425 LTVNQHPKFINIVATGQVGDSADMSATAPSIHIWDAMNKQTLSILRcYHSKGVCSVSFSATGKLLLSVGLDpeHTITIWR 1504
Cdd:COG2319 72 ATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLT-GHTGAVRSVAFSPDGKTLASGSAD--GTVRLWD 148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1505 WQEGAKIASRAGHNQRIFVAEFRPDSdTQFVSVGV-KHVKFWTLAGRALLSkkgllsTLEdARMQTMLAIAFGANNLTF- 1582
Cdd:COG2319 149 LATGKLLRTLTGHSGAVTSVAFSPDG-KLLASGSDdGTVRLWDLATGKLLR------TLT-GHTGAVRSVAFSPDGKLLa 220
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1583 TGTISGDVCVWK-DHILCRIVARAHNGPVFAMyTTLRDG-LIVTGGkerpskEGGAVKLWDqelrrcrafrLETGQatdC 1660
Cdd:COG2319 221 SGSADGTVRLWDlATGKLLRTLTGHSGSVRSV-AFSPDGrLLASGS------ADGTVRLWD----------LATGE---L 280
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1661 VRSVcrgkgkilvgtrnaeiievgeknaacnilvnGHVDGPIWGLATHPSRDFFLSAAEDGTVRLWDIADKKMLNKVNlG 1740
Cdd:COG2319 281 LRTL-------------------------------TGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLT-G 328
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1867163826 1741 HAA--RTVCYSPEGDMVAIGMKNGEFIILLVSSLKIWGKKRDRRCAIHDIRFSPDSRYLAVGSSENSVDFYDLT 1812
Cdd:COG2319 329 HTGavRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
895-1266 |
1.93e-23 |
|
WD40 repeat [General function prediction only]; :
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 104.99 E-value: 1.93e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 895 LVKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTY-----AIKRAALAPGSKglllednpsiraislg 967
Cdd:COG2319 70 LLATLLGHTAAVLSVAFSPDGrlLASASADGTVRLWDLATGLLLRTLtghtgAVRSVAFSPDGK---------------- 133
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 968 hgHILVGTKNGEILEVD-KSGPITLLVQGHmEGEVWGLATHP---YLpicATVSDDKTLRIWDLSPSHCMLAVRKLKKGG 1043
Cdd:COG2319 134 --TLASGSADGTVRLWDlATGKLLRTLTGH-SGAVTSVAFSPdgkLL---ASGSDDGTVRLWDLATGKLLRTLTGHTGAV 207
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1044 RCCCFSPDGKALAVGLNDGSFLMANADTLEDLVSFHHRKDMISDIRFSPgSGKYLAVASHDSFIDIYNVMSSKRVGICKG 1123
Cdd:COG2319 208 RSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSP-DGRLLASGSADGTVRLWDLATGELLRTLTG 286
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1124 ATSYITHIDWDIRGKLLqvntgakeqlffeAPRGKKQTIpsveveKIawasWTSVLGLCcegIWPVIGEVTDVTASCLTS 1203
Cdd:COG2319 287 HSGGVNSVAFSPDGKLL-------------ASGSDDGTV------RL----WDLATGKL---LRTLTGHTGAVRSVAFSP 340
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1867163826 1204 DKMVLATGDDLGFVKLFRYPTKGKFGKFKryvAHSTHVTNVRWTYDDSMLVTlGGTDMSLMVW 1266
Cdd:COG2319 341 DGKTLASGSDDGTVRLWDLATGELLRTLT---GHTGAVTSVAFSPDGRTLAS-GSADGTVRLW 399
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
2-49 |
7.07e-19 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons. :
Pssm-ID: 460922 Cd Length: 72 Bit Score: 82.60 E-value: 7.07e-19
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 1867163826 2 AARSAPSCHLRLEWVYGYRGHQCRNNLYYTAAKEIVYFVAGVGVVYSP 49
Cdd:pfam03451 25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
|
|
| WD40 super family |
cl29593 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1694-1967 |
2.87e-18 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment. The actual alignment was detected with superfamily member cd00200:
Pssm-ID: 475233 [Multi-domain] Cd Length: 289 Bit Score: 87.39 E-value: 2.87e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1694 VNGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWDIADKKMLnKVNLGHAA--RTVCYSPEGDMVAIGMKNGefiillvsS 1771
Cdd:cd00200 5 LKGH-TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELL-RTLKGHTGpvRDVAASADGTYLASGSSDK--------T 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1772 LKIWgKKRDRRC---------AIHDIRFSPDSRYLAVGSSENSVDFYDLTLGPTLNRISYCKDipsFVIQMDFSADSSYl 1842
Cdd:cd00200 75 IRLW-DLETGECvrtltghtsYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTD---WVNSVAFSPDGTF- 149
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1843 qVSSGCYKR--HVYEVPSGKHLMDHaaidritwatwtsilgdevlgiwsrHAEKADVNCACVSHSGISLVTGDDFGMVKL 1920
Cdd:cd00200 150 -VASSSQDGtiKLWDLRTGKCVATL-------------------------TGHTGEVNSVAFSPDGEKLLSSSSDGTIKL 203
|
250 260 270 280
....*....|....*....|....*....|....*....|....*..
gi 1867163826 1921 FDfpcPEKFAKHKRFLGHSPHVTNIRFtSGDRHVVSAGGDDCSLFVW 1967
Cdd:cd00200 204 WD---LSTGKCLGTLRGHENGVNSVAF-SPDGYLLASGSEDGTIRVW 246
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
672-715 |
9.08e-18 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons. :
Pssm-ID: 460922 Cd Length: 72 Bit Score: 79.13 E-value: 9.08e-18
10 20 30 40
....*....|....*....|....*....|....*....|....
gi 1867163826 672 APGNSIRLHFVHGYRGYDCRSNLFYTQIGEIVYHVAAVGVIYNR 715
Cdd:pfam03451 29 PPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
1335-1407 |
2.67e-15 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons. :
Pssm-ID: 460922 Cd Length: 72 Bit Score: 72.20 E-value: 2.67e-15
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1867163826 1335 RQGVVRPPVSRAPPqpeklqTNNVGKKKRPIEDLVLELIFGYRGRDCRNNVHYLNDGdDIIYHTASVGILHNV 1407
Cdd:pfam03451 7 RPGAVYPPSNYYPK------DDLDQKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTG-EIVYFTAAVVVLYDV 72
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
725-1063 |
1.74e-34 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 134.77 E-value: 1.74e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 725 GHDDDILCLTIHPLKDYVATGqvGRDPSIHIWDTETIKPLSILKGHHQyGVSAVDFSADGKRLASVGIDdsHTVVLWDWK 804
Cdd:cd00200 7 GHTGGVTCVAFSPDGKLLATG--SGDGTIKVWDLETGELLRTLKGHTG-PVRDVAASADGTYLASGSSD--KTIRLWDLE 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 805 KGEKLSIARGSKDKIFVVKMNPYvpDKLITAGIKH--MKFWRKAGGGLIGRkgyigTLGKNDTMMCAVYGWTEEMAFSGT 882
Cdd:cd00200 82 TGECVRTLTGHTSYVSSVAFSPD--GRILSSSSRDktIKVWDVETGKCLTT-----LRGHTDWVNSVAFSPDGTFVASSS 154
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 883 STGDVCIW--RDIFLVKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTYaikraalapgskgllledn 958
Cdd:cd00200 155 QDGTIKLWdlRTGKCVATLTGHTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGTL------------------- 215
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 959 psiraislghghilvgtkngeilevdksgpitllvQGHmEGEVWGLATHPYLPICATVSDDKTLRIWDLSPSHCMLAVRK 1038
Cdd:cd00200 216 -----------------------------------RGH-ENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSG 259
|
330 340
....*....|....*....|....*
gi 1867163826 1039 LKKGGRCCCFSPDGKALAVGLNDGS 1063
Cdd:cd00200 260 HTNSVTSLAWSPDGKRLASGSADGT 284
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
720-1112 |
6.02e-34 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 136.58 E-value: 6.02e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 720 QRFYLGHDDDILCLTIHPLKDYVATGqvGRDPSIHIWDTETIKPLSILKGHHQYgVSAVDFSADGKRLASVGIDdsHTVV 799
Cdd:COG2319 71 LATLLGHTAAVLSVAFSPDGRLLASA--SADGTVRLWDLATGLLLRTLTGHTGA-VRSVAFSPDGKTLASGSAD--GTVR 145
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 800 LWDWKKGEKLSIARGSKDKIFVVKMNPyvpD--KLITAGI-KHMKFWRKAGGGLIgrkgyigtlgkndtmmcavygwtee 876
Cdd:COG2319 146 LWDLATGKLLRTLTGHSGAVTSVAFSP---DgkLLASGSDdGTVRLWDLATGKLL------------------------- 197
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 877 mafsgtstgdvciwrdiflvKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTYAikraalapgskgll 954
Cdd:COG2319 198 --------------------RTLTGHTGAVRSVAFSPDGklLASGSADGTVRLWDLATGKLLRTLT-------------- 243
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 955 lEDNPSIRAISLGH-GHILV-GTKNGEILEVD-KSGPITLLVQGHmEGEVWGLATHP---YLpicATVSDDKTLRIWDLS 1028
Cdd:COG2319 244 -GHSGSVRSVAFSPdGRLLAsGSADGTVRLWDlATGELLRTLTGH-SGGVNSVAFSPdgkLL---ASGSDDGTVRLWDLA 318
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1029 PSHCMLAVRKLKKGGRCCCFSPDGKALAVGLNDGSFLMANADTLEDLVSFHHRKDMISDIRFSPgSGKYLAVASHDSFID 1108
Cdd:COG2319 319 TGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSP-DGRTLASGSADGTVR 397
|
....
gi 1867163826 1109 IYNV 1112
Cdd:COG2319 398 LWDL 401
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
58-438 |
9.65e-31 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 126.95 E-value: 9.65e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 58 RGHSDDIISLALHPERVLVATGQVGKEpyICIWDSYTVQTISVLKDvHTHGIACLAFDLDGQRLVSVGLDskNAVCVWDW 137
Cdd:COG2319 75 LGHTAAVLSVAFSPDGRLLASASADGT--VRLWDLATGLLLRTLTG-HTGAVRSVAFSPDGKTLASGSAD--GTVRLWDL 149
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 138 KRGKMLSMAPGHTDRIFDISWDlyqPNklvscgvkhikfwslcgnaltpkrgvfGKTgdlqtilcLAcardeltySGALN 217
Cdd:COG2319 150 ATGKLLRTLTGHSGAVTSVAFS---PD---------------------------GKL--------LA--------SGSDD 183
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 218 GDIYVW--KGINLIRTIQGaHAAGIFSMNACEEG--FATGGRDGCIRLWDL-TFKPITVIDLRETdqgykglSVRSVCWR 292
Cdd:COG2319 184 GTVRLWdlATGKLLRTLTG-HTGAVRSVAFSPDGklLASGSADGTVRLWDLaTGKLLRTLTGHSG-------SVRSVAFS 255
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 293 --GDHILVGTQDSEIFeIVVQERNKPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWSLVDHALIARCN-MEEPI 369
Cdd:COG2319 256 pdGRLLASGSADGTVR-LWDLATGELLRTLTGH-SGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTgHTGAV 333
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1867163826 370 RCAAVNADGIHLALGMKDGSFTVLRVRDMTEVVHIKDRKEAIHELKYSPDGTYLAVGCNDSSVDIYGVA 438
Cdd:COG2319 334 RSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1425-1812 |
1.28e-30 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 126.56 E-value: 1.28e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1425 LTVNQHPKFINIVATGQVGDSADMSATAPSIHIWDAMNKQTLSILRcYHSKGVCSVSFSATGKLLLSVGLDpeHTITIWR 1504
Cdd:COG2319 72 ATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLT-GHTGAVRSVAFSPDGKTLASGSAD--GTVRLWD 148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1505 WQEGAKIASRAGHNQRIFVAEFRPDSdTQFVSVGV-KHVKFWTLAGRALLSkkgllsTLEdARMQTMLAIAFGANNLTF- 1582
Cdd:COG2319 149 LATGKLLRTLTGHSGAVTSVAFSPDG-KLLASGSDdGTVRLWDLATGKLLR------TLT-GHTGAVRSVAFSPDGKLLa 220
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1583 TGTISGDVCVWK-DHILCRIVARAHNGPVFAMyTTLRDG-LIVTGGkerpskEGGAVKLWDqelrrcrafrLETGQatdC 1660
Cdd:COG2319 221 SGSADGTVRLWDlATGKLLRTLTGHSGSVRSV-AFSPDGrLLASGS------ADGTVRLWD----------LATGE---L 280
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1661 VRSVcrgkgkilvgtrnaeiievgeknaacnilvnGHVDGPIWGLATHPSRDFFLSAAEDGTVRLWDIADKKMLNKVNlG 1740
Cdd:COG2319 281 LRTL-------------------------------TGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLT-G 328
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1867163826 1741 HAA--RTVCYSPEGDMVAIGMKNGEFIILLVSSLKIWGKKRDRRCAIHDIRFSPDSRYLAVGSSENSVDFYDLT 1812
Cdd:COG2319 329 HTGavRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
57-353 |
4.87e-30 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 122.06 E-value: 4.87e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 57 YRGHSDDIISLALHPERVLVATGQVGKEpyICIWDSYTVQTISVLKdVHTHGIACLAFDLDGQRLVSVGLDskNAVCVWD 136
Cdd:cd00200 5 LKGHTGGVTCVAFSPDGKLLATGSGDGT--IKVWDLETGELLRTLK-GHTGPVRDVAASADGTYLASGSSD--KTIRLWD 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 137 WKRGKMLSMAPGHTDRIFDISWDlyqPNK--LVSCGV-KHIKFWSLcgNALTPKRGVFGKTGDlqtILCLACARDE-LTY 212
Cdd:cd00200 80 LETGECVRTLTGHTSYVSSVAFS---PDGriLSSSSRdKTIKVWDV--ETGKCLTTLRGHTDW---VNSVAFSPDGtFVA 151
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 213 SGALNGDIYVW--KGINLIRTIQGaHAAGIFSMNACEEG--FATGGRDGCIRLWDL-TFKPITVIDlretdqgYKGLSVR 287
Cdd:cd00200 152 SSSQDGTIKLWdlRTGKCVATLTG-HTGEVNSVAFSPDGekLLSSSSDGTIKLWDLsTGKCLGTLR-------GHENGVN 223
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 288 SVCW--RGDHILVGTQDS--EIFEIVVQERNKPFlimQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWS 353
Cdd:cd00200 224 SVAFspDGYLLASGSEDGtiRVWDLRTGECVQTL---SGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
895-1266 |
1.93e-23 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 104.99 E-value: 1.93e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 895 LVKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTY-----AIKRAALAPGSKglllednpsiraislg 967
Cdd:COG2319 70 LLATLLGHTAAVLSVAFSPDGrlLASASADGTVRLWDLATGLLLRTLtghtgAVRSVAFSPDGK---------------- 133
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 968 hgHILVGTKNGEILEVD-KSGPITLLVQGHmEGEVWGLATHP---YLpicATVSDDKTLRIWDLSPSHCMLAVRKLKKGG 1043
Cdd:COG2319 134 --TLASGSADGTVRLWDlATGKLLRTLTGH-SGAVTSVAFSPdgkLL---ASGSDDGTVRLWDLATGKLLRTLTGHTGAV 207
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1044 RCCCFSPDGKALAVGLNDGSFLMANADTLEDLVSFHHRKDMISDIRFSPgSGKYLAVASHDSFIDIYNVMSSKRVGICKG 1123
Cdd:COG2319 208 RSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSP-DGRLLASGSADGTVRLWDLATGELLRTLTG 286
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1124 ATSYITHIDWDIRGKLLqvntgakeqlffeAPRGKKQTIpsveveKIawasWTSVLGLCcegIWPVIGEVTDVTASCLTS 1203
Cdd:COG2319 287 HSGGVNSVAFSPDGKLL-------------ASGSDDGTV------RL----WDLATGKL---LRTLTGHTGAVRSVAFSP 340
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1867163826 1204 DKMVLATGDDLGFVKLFRYPTKGKFGKFKryvAHSTHVTNVRWTYDDSMLVTlGGTDMSLMVW 1266
Cdd:COG2319 341 DGKTLASGSDDGTVRLWDLATGELLRTLT---GHTGAVTSVAFSPDGRTLAS-GSADGTVRLW 399
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1473-1775 |
7.65e-21 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 95.09 E-value: 7.65e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1473 HSKGVCSVSFSATGKLLLSVGLDpeHTITIWRWQEGAKIASRAGHNQRIFVAEFRPDSdTQFVSVGV-KHVKFWTLagra 1551
Cdd:cd00200 8 HTGGVTCVAFSPDGKLLATGSGD--GTIKVWDLETGELLRTLKGHTGPVRDVAASADG-TYLASGSSdKTIRLWDL---- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1552 llSKKGLLSTLEDARmQTMLAIAFGANNLTFTGTIS-GDVCVWK-DHILCRIVARAHNGPVFAMyTTLRDGLIVTGgker 1629
Cdd:cd00200 81 --ETGECVRTLTGHT-SYVSSVAFSPDGRILSSSSRdKTIKVWDvETGKCLTTLRGHTDWVNSV-AFSPDGTFVAS---- 152
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1630 pSKEGGAVKLWD-QELRRCRAFRLETGQatdcVRSVC--RGKGKILVGTRNAEIIEVGEKNAACNILVNGHvDGPIWGLA 1706
Cdd:cd00200 153 -SSQDGTIKLWDlRTGKCVATLTGHTGE----VNSVAfsPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGH-ENGVNSVA 226
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1867163826 1707 THPSRDFFLSAAEDGTVRLWDIADKKMLNKVNlGHAAR--TVCYSPEGDMVAIGMKNGefiillvsSLKIW 1775
Cdd:cd00200 227 FSPDGYLLASGSEDGTIRVWDLRTGECVQTLS-GHTNSvtSLAWSPDGKRLASGSADG--------TIRIW 288
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
895-1140 |
4.70e-20 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 92.78 E-value: 4.70e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 895 LVKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTYAIKRAALApgskglllednpsiRAISLGHGH-I 971
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGklLATGSGDGTIKVWDLETGELLRTLKGHTGPVR--------------DVAASADGTyL 66
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 972 LVGTKNGEILEVD-KSGPITLLVQGHmEGEVWGLATHPYLPICATVSDDKTLRIWDLSPSHCMLAVRKLKKGGRCCCFSP 1050
Cdd:cd00200 67 ASGSSDKTIRLWDlETGECVRTLTGH-TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSP 145
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1051 DGKALAVGLNDGSFLMANADTLEDLVSFHHRKDMISDIRFSPgSGKYLAVASHDSFIDIYNVMSSKRVGICKGATSYITH 1130
Cdd:cd00200 146 DGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSP-DGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNS 224
|
250
....*....|
gi 1867163826 1131 IDWDIRGKLL 1140
Cdd:cd00200 225 VAFSPDGYLL 234
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
2-49 |
7.07e-19 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 82.60 E-value: 7.07e-19
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 1867163826 2 AARSAPSCHLRLEWVYGYRGHQCRNNLYYTAAKEIVYFVAGVGVVYSP 49
Cdd:pfam03451 25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1694-1967 |
2.87e-18 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 87.39 E-value: 2.87e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1694 VNGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWDIADKKMLnKVNLGHAA--RTVCYSPEGDMVAIGMKNGefiillvsS 1771
Cdd:cd00200 5 LKGH-TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELL-RTLKGHTGpvRDVAASADGTYLASGSSDK--------T 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1772 LKIWgKKRDRRC---------AIHDIRFSPDSRYLAVGSSENSVDFYDLTLGPTLNRISYCKDipsFVIQMDFSADSSYl 1842
Cdd:cd00200 75 IRLW-DLETGECvrtltghtsYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTD---WVNSVAFSPDGTF- 149
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1843 qVSSGCYKR--HVYEVPSGKHLMDHaaidritwatwtsilgdevlgiwsrHAEKADVNCACVSHSGISLVTGDDFGMVKL 1920
Cdd:cd00200 150 -VASSSQDGtiKLWDLRTGKCVATL-------------------------TGHTGEVNSVAFSPDGEKLLSSSSDGTIKL 203
|
250 260 270 280
....*....|....*....|....*....|....*....|....*..
gi 1867163826 1921 FDfpcPEKFAKHKRFLGHSPHVTNIRFtSGDRHVVSAGGDDCSLFVW 1967
Cdd:cd00200 204 WD---LSTGKCLGTLRGHENGVNSVAF-SPDGYLLASGSEDGTIRVW 246
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
672-715 |
9.08e-18 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 79.13 E-value: 9.08e-18
10 20 30 40
....*....|....*....|....*....|....*....|....
gi 1867163826 672 APGNSIRLHFVHGYRGYDCRSNLFYTQIGEIVYHVAAVGVIYNR 715
Cdd:pfam03451 29 PPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
1335-1407 |
2.67e-15 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 72.20 E-value: 2.67e-15
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1867163826 1335 RQGVVRPPVSRAPPqpeklqTNNVGKKKRPIEDLVLELIFGYRGRDCRNNVHYLNDGdDIIYHTASVGILHNV 1407
Cdd:pfam03451 7 RPGAVYPPSNYYPK------DDLDQKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTG-EIVYFTAAVVVLYDV 72
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1706-1972 |
2.79e-09 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 61.47 E-value: 2.79e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1706 ATHPSRDFFLSAAEDGTVRLWDIADKKMLNKVNLGHAARTVC-YSPEGDMVAIGMKNGEFIILLVSSLKIWGKKRDRRCA 1784
Cdd:COG2319 1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLaASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1785 IHDIRFSPDSRYLAVGSSENSVDFYDLTLGPTLNRIsycKDIPSFVIQMDFSADSSYLqVSSGCYKR-HVYEVPSGKhlm 1863
Cdd:COG2319 81 VLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTL---TGHTGAVRSVAFSPDGKTL-ASGSADGTvRLWDLATGK--- 153
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1864 dhaaidritwatwtsilgdeVLGIWSRHAEkaDVNCACVSHSGISLVTGDDFGMVKLFDfpcPEKFAKHKRFLGHSPHVT 1943
Cdd:COG2319 154 --------------------LLRTLTGHSG--AVTSVAFSPDGKLLASGSDDGTVRLWD---LATGKLLRTLTGHTGAVR 208
|
250 260 270
....*....|....*....|....*....|....
gi 1867163826 1944 NIRFTSGDRHVVSaGGDDCSLFVW-----KCVHT 1972
Cdd:COG2319 209 SVAFSPDGKLLAS-GSADGTVRLWdlatgKLLRT 241
|
|
| ANAPC4_WD40 |
pfam12894 |
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ... |
1746-1828 |
1.66e-05 |
|
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,
Pssm-ID: 403945 [Multi-domain] Cd Length: 91 Bit Score: 44.96 E-value: 1.66e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1746 VCYSPEGDMVAIGMKNGEFIILLVSSLKIWGKKRDRR-CAIHDIRFSPDSRYLAVGSSENSVDFYDLTLGPTLNRISYCK 1824
Cdd:pfam12894 1 MSWCPTMDLIALATEDGELLLHRLNWQRVWTLSPDKEdLEVTSLAWRPDGKLLAVGYSDGTVRLLDAENGKIVHHFSAGS 80
|
....
gi 1867163826 1825 DIPS 1828
Cdd:pfam12894 81 DLIT 84
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
760-802 |
4.13e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 39.60 E-value: 4.13e-04
10 20 30 40
....*....|....*....|....*....|....*....|...
gi 1867163826 760 TIKPLSILKGHHQYgVSAVDFSADGKRLASVGIDdsHTVVLWD 802
Cdd:smart00320 1 SGELLKTLKGHTGP-VTSVAFSPDGKYLASGSDD--GTIKLWD 40
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
1695-1727 |
4.65e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 39.60 E-value: 4.65e-04
10 20 30
....*....|....*....|....*....|...
gi 1867163826 1695 NGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWD 1727
Cdd:smart00320 9 KGH-TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
314-353 |
1.42e-03 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 38.10 E-value: 1.42e-03
10 20 30 40
....*....|....*....|....*....|....*....|
gi 1867163826 314 NKPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWS 353
Cdd:pfam00400 1 GKLLKTLEGH-TGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
315-353 |
2.43e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 37.29 E-value: 2.43e-03
10 20 30
....*....|....*....|....*....|....*....
gi 1867163826 315 KPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWS 353
Cdd:smart00320 3 ELLKTLKGH-TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
762-802 |
5.47e-03 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 36.55 E-value: 5.47e-03
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 1867163826 762 KPLSILKGHHQyGVSAVDFSADGKRLASVGIDdsHTVVLWD 802
Cdd:pfam00400 2 KLLKTLEGHTG-SVTSLAFSPDGKLLASGSDD--GTVKVWD 39
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
725-1063 |
1.74e-34 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 134.77 E-value: 1.74e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 725 GHDDDILCLTIHPLKDYVATGqvGRDPSIHIWDTETIKPLSILKGHHQyGVSAVDFSADGKRLASVGIDdsHTVVLWDWK 804
Cdd:cd00200 7 GHTGGVTCVAFSPDGKLLATG--SGDGTIKVWDLETGELLRTLKGHTG-PVRDVAASADGTYLASGSSD--KTIRLWDLE 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 805 KGEKLSIARGSKDKIFVVKMNPYvpDKLITAGIKH--MKFWRKAGGGLIGRkgyigTLGKNDTMMCAVYGWTEEMAFSGT 882
Cdd:cd00200 82 TGECVRTLTGHTSYVSSVAFSPD--GRILSSSSRDktIKVWDVETGKCLTT-----LRGHTDWVNSVAFSPDGTFVASSS 154
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 883 STGDVCIW--RDIFLVKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTYaikraalapgskgllledn 958
Cdd:cd00200 155 QDGTIKLWdlRTGKCVATLTGHTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGTL------------------- 215
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 959 psiraislghghilvgtkngeilevdksgpitllvQGHmEGEVWGLATHPYLPICATVSDDKTLRIWDLSPSHCMLAVRK 1038
Cdd:cd00200 216 -----------------------------------RGH-ENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSG 259
|
330 340
....*....|....*....|....*
gi 1867163826 1039 LKKGGRCCCFSPDGKALAVGLNDGS 1063
Cdd:cd00200 260 HTNSVTSLAWSPDGKRLASGSADGT 284
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
720-1112 |
6.02e-34 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 136.58 E-value: 6.02e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 720 QRFYLGHDDDILCLTIHPLKDYVATGqvGRDPSIHIWDTETIKPLSILKGHHQYgVSAVDFSADGKRLASVGIDdsHTVV 799
Cdd:COG2319 71 LATLLGHTAAVLSVAFSPDGRLLASA--SADGTVRLWDLATGLLLRTLTGHTGA-VRSVAFSPDGKTLASGSAD--GTVR 145
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 800 LWDWKKGEKLSIARGSKDKIFVVKMNPyvpD--KLITAGI-KHMKFWRKAGGGLIgrkgyigtlgkndtmmcavygwtee 876
Cdd:COG2319 146 LWDLATGKLLRTLTGHSGAVTSVAFSP---DgkLLASGSDdGTVRLWDLATGKLL------------------------- 197
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 877 mafsgtstgdvciwrdiflvKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTYAikraalapgskgll 954
Cdd:COG2319 198 --------------------RTLTGHTGAVRSVAFSPDGklLASGSADGTVRLWDLATGKLLRTLT-------------- 243
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 955 lEDNPSIRAISLGH-GHILV-GTKNGEILEVD-KSGPITLLVQGHmEGEVWGLATHP---YLpicATVSDDKTLRIWDLS 1028
Cdd:COG2319 244 -GHSGSVRSVAFSPdGRLLAsGSADGTVRLWDlATGELLRTLTGH-SGGVNSVAFSPdgkLL---ASGSDDGTVRLWDLA 318
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1029 PSHCMLAVRKLKKGGRCCCFSPDGKALAVGLNDGSFLMANADTLEDLVSFHHRKDMISDIRFSPgSGKYLAVASHDSFID 1108
Cdd:COG2319 319 TGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSP-DGRTLASGSADGTVR 397
|
....
gi 1867163826 1109 IYNV 1112
Cdd:COG2319 398 LWDL 401
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
58-438 |
9.65e-31 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 126.95 E-value: 9.65e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 58 RGHSDDIISLALHPERVLVATGQVGKEpyICIWDSYTVQTISVLKDvHTHGIACLAFDLDGQRLVSVGLDskNAVCVWDW 137
Cdd:COG2319 75 LGHTAAVLSVAFSPDGRLLASASADGT--VRLWDLATGLLLRTLTG-HTGAVRSVAFSPDGKTLASGSAD--GTVRLWDL 149
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 138 KRGKMLSMAPGHTDRIFDISWDlyqPNklvscgvkhikfwslcgnaltpkrgvfGKTgdlqtilcLAcardeltySGALN 217
Cdd:COG2319 150 ATGKLLRTLTGHSGAVTSVAFS---PD---------------------------GKL--------LA--------SGSDD 183
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 218 GDIYVW--KGINLIRTIQGaHAAGIFSMNACEEG--FATGGRDGCIRLWDL-TFKPITVIDLRETdqgykglSVRSVCWR 292
Cdd:COG2319 184 GTVRLWdlATGKLLRTLTG-HTGAVRSVAFSPDGklLASGSADGTVRLWDLaTGKLLRTLTGHSG-------SVRSVAFS 255
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 293 --GDHILVGTQDSEIFeIVVQERNKPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWSLVDHALIARCN-MEEPI 369
Cdd:COG2319 256 pdGRLLASGSADGTVR-LWDLATGELLRTLTGH-SGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTgHTGAV 333
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1867163826 370 RCAAVNADGIHLALGMKDGSFTVLRVRDMTEVVHIKDRKEAIHELKYSPDGTYLAVGCNDSSVDIYGVA 438
Cdd:COG2319 334 RSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1425-1812 |
1.28e-30 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 126.56 E-value: 1.28e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1425 LTVNQHPKFINIVATGQVGDSADMSATAPSIHIWDAMNKQTLSILRcYHSKGVCSVSFSATGKLLLSVGLDpeHTITIWR 1504
Cdd:COG2319 72 ATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLT-GHTGAVRSVAFSPDGKTLASGSAD--GTVRLWD 148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1505 WQEGAKIASRAGHNQRIFVAEFRPDSdTQFVSVGV-KHVKFWTLAGRALLSkkgllsTLEdARMQTMLAIAFGANNLTF- 1582
Cdd:COG2319 149 LATGKLLRTLTGHSGAVTSVAFSPDG-KLLASGSDdGTVRLWDLATGKLLR------TLT-GHTGAVRSVAFSPDGKLLa 220
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1583 TGTISGDVCVWK-DHILCRIVARAHNGPVFAMyTTLRDG-LIVTGGkerpskEGGAVKLWDqelrrcrafrLETGQatdC 1660
Cdd:COG2319 221 SGSADGTVRLWDlATGKLLRTLTGHSGSVRSV-AFSPDGrLLASGS------ADGTVRLWD----------LATGE---L 280
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1661 VRSVcrgkgkilvgtrnaeiievgeknaacnilvnGHVDGPIWGLATHPSRDFFLSAAEDGTVRLWDIADKKMLNKVNlG 1740
Cdd:COG2319 281 LRTL-------------------------------TGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLT-G 328
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1867163826 1741 HAA--RTVCYSPEGDMVAIGMKNGEFIILLVSSLKIWGKKRDRRCAIHDIRFSPDSRYLAVGSSENSVDFYDLT 1812
Cdd:COG2319 329 HTGavRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
57-353 |
4.87e-30 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 122.06 E-value: 4.87e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 57 YRGHSDDIISLALHPERVLVATGQVGKEpyICIWDSYTVQTISVLKdVHTHGIACLAFDLDGQRLVSVGLDskNAVCVWD 136
Cdd:cd00200 5 LKGHTGGVTCVAFSPDGKLLATGSGDGT--IKVWDLETGELLRTLK-GHTGPVRDVAASADGTYLASGSSD--KTIRLWD 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 137 WKRGKMLSMAPGHTDRIFDISWDlyqPNK--LVSCGV-KHIKFWSLcgNALTPKRGVFGKTGDlqtILCLACARDE-LTY 212
Cdd:cd00200 80 LETGECVRTLTGHTSYVSSVAFS---PDGriLSSSSRdKTIKVWDV--ETGKCLTTLRGHTDW---VNSVAFSPDGtFVA 151
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 213 SGALNGDIYVW--KGINLIRTIQGaHAAGIFSMNACEEG--FATGGRDGCIRLWDL-TFKPITVIDlretdqgYKGLSVR 287
Cdd:cd00200 152 SSSQDGTIKLWdlRTGKCVATLTG-HTGEVNSVAFSPDGekLLSSSSDGTIKLWDLsTGKCLGTLR-------GHENGVN 223
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 288 SVCW--RGDHILVGTQDS--EIFEIVVQERNKPFlimQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWS 353
Cdd:cd00200 224 SVAFspDGYLLASGSEDGtiRVWDLRTGECVQTL---SGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
52-354 |
5.08e-30 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 125.02 E-value: 5.08e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 52 HRQKFYRGHSDDIISLALHPERVLVATGQVGKEpyICIWDSYTVQTISVLKDvHTHGIACLAFDLDGQRLVSVGLDskNA 131
Cdd:COG2319 111 LLLRTLTGHTGAVRSVAFSPDGKTLASGSADGT--VRLWDLATGKLLRTLTG-HSGAVTSVAFSPDGKLLASGSDD--GT 185
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 132 VCVWDWKRGKMLSMAPGHTDRIFDISWDlyqPN--KLVSCGV-KHIKFWSLcgnaltpKRGVFGKT--GDLQTILCLACA 206
Cdd:COG2319 186 VRLWDLATGKLLRTLTGHTGAVRSVAFS---PDgkLLASGSAdGTVRLWDL-------ATGKLLRTltGHSGSVRSVAFS 255
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 207 RDELT-YSGALNGDIYVW--KGINLIRTIQGaHAAGIFSMNACEEG--FATGGRDGCIRLWDL-TFKPITVIdlretdQG 280
Cdd:COG2319 256 PDGRLlASGSADGTVRLWdlATGELLRTLTG-HSGGVNSVAFSPDGklLASGSDDGTVRLWDLaTGKLLRTL------TG 328
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1867163826 281 YKGlSVRSVCWR--GDHILVGTQDSEI--FEIvvqERNKPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWSL 354
Cdd:COG2319 329 HTG-AVRSVAFSpdGKTLASGSDDGTVrlWDL---ATGELLRTLTGH-TGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
719-1063 |
6.18e-30 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 124.64 E-value: 6.18e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 719 TQRFYLGHDDDILCLTIHPLKDYVATGqvGRDPSIHIWDTETIKPLSILKGHHQyGVSAVDFSADGKRLASVGIDdsHTV 798
Cdd:COG2319 112 LLRTLTGHTGAVRSVAFSPDGKTLASG--SADGTVRLWDLATGKLLRTLTGHSG-AVTSVAFSPDGKLLASGSDD--GTV 186
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 799 VLWDWKKGEKLSIARGSKDKIFVVKMNPyvpD--KLITAGI-KHMKFWRKAGGGLIGrkgyigTLGKNDtmmcavyGWTE 875
Cdd:COG2319 187 RLWDLATGKLLRTLTGHTGAVRSVAFSP---DgkLLASGSAdGTVRLWDLATGKLLR------TLTGHS-------GSVR 250
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 876 EMAFS--------GTSTGDVCIWR--DIFLVKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTYaikr 943
Cdd:COG2319 251 SVAFSpdgrllasGSADGTVRLWDlaTGELLRTLTGHSGGVNSVAFSPDGklLASGSDDGTVRLWDLATGKLLRTL---- 326
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 944 aalapgskglllednpsiraislghghilvgtkngeilevdksgpitllvQGHmEGEVWGLATHPYLPICATVSDDKTLR 1023
Cdd:COG2319 327 --------------------------------------------------TGH-TGAVRSVAFSPDGKTLASGSDDGTVR 355
|
330 340 350 360
....*....|....*....|....*....|....*....|
gi 1867163826 1024 IWDLSPSHCMLAVRKLKKGGRCCCFSPDGKALAVGLNDGS 1063
Cdd:COG2319 356 LWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGT 395
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
260-802 |
1.62e-27 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 117.32 E-value: 1.62e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 260 RLWDLTFKPITVIDLRETDQGYKGLSVRSVCWRGDHILVGTQDSEIFEIVVQERNKPFLIMQGHcEGELWALAVHPTKPL 339
Cdd:COG2319 14 ADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGH-TAAVLSVAFSPDGRL 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 340 AVTGSDDRSVRIWSLVDHALIARCNM-EEPIRCAAVNADGIHLALGMKDGSFTVLRVRDMTEVVHIKDRKEAIHELKYSP 418
Cdd:COG2319 93 LASASADGTVRLWDLATGLLLRTLTGhTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSP 172
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 419 DGTYLAVGCNDSSVDIYGVAQrykkvGECLGSL----SFITHLDWSSDSRYLQTNDGNGK-RLfYRMPGGKEVTSTEEik 493
Cdd:COG2319 173 DGKLLASGSDDGTVRLWDLAT-----GKLLRTLtghtGAVRSVAFSPDGKLLASGSADGTvRL-WDLATGKLLRTLTG-- 244
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 494 gvHWASWTCVSglevngiWpkysdindinSVDGnyigQVLVTADDYGIIKLFRypcLRKGAKFRKYIGHSAHVTNVRWSH 573
Cdd:COG2319 245 --HSGSVRSVA-------F----------SPDG----RLLASGSADGTVRLWD---LATGELLRTLTGHSGGVNSVAFSP 298
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 574 DYQWVISiGGADHSVFQWkfiperklkdavhiapqesladshsdesdsdlsdvpeldseieqetqltyrrqvykedlpql 653
Cdd:COG2319 299 DGKLLAS-GSDDGTVRLW-------------------------------------------------------------- 315
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 654 keqckekqksatskrrERAPGNSIRLHfvhgyrgydcrsnlfytqigeivyhvaavgviynrqqntqrfyLGHDDDILCL 733
Cdd:COG2319 316 ----------------DLATGKLLRTL-------------------------------------------TGHTGAVRSV 336
|
490 500 510 520 530 540
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1867163826 734 TIHPLKDYVATGqvGRDPSIHIWDTETIKPLSILKGHHQyGVSAVDFSADGKRLASVGIDdsHTVVLWD 802
Cdd:COG2319 337 AFSPDGKTLASG--SDDGTVRLWDLATGELLRTLTGHTG-AVTSVAFSPDGRTLASGSAD--GTVRLWD 400
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1499-1967 |
1.81e-27 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 117.32 E-value: 1.81e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1499 TITIWRWQEGAKIASRAGHNQRIFVAEFRPDSDTQFVSVGVKHVKFWTLAGRALLSkkgllsTLEDARMQTMLAIAFGAN 1578
Cdd:COG2319 17 ALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLA------TLLGHTAAVLSVAFSPDG 90
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1579 NLTFTGTISGDVCVWK-DHILCRIVARAHNGPVFAMyTTLRDG-LIVTGGkerpskEGGAVKLWDqelrrcrafrLETGQ 1656
Cdd:COG2319 91 RLLASASADGTVRLWDlATGLLLRTLTGHTGAVRSV-AFSPDGkTLASGS------ADGTVRLWD----------LATGK 153
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1657 atdcvrsvcrgkgkiLVGTrnaeiievgeknaacnilVNGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWDIADKKMLNK 1736
Cdd:COG2319 154 ---------------LLRT------------------LTGH-SGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRT 199
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1737 VNlGHAA--RTVCYSPEGDMVAIGMKNGEFIILLVSSLKIWGKKRDRRCAIHDIRFSPDSRYLAVGSSENSVDFYDLTLG 1814
Cdd:COG2319 200 LT-GHTGavRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATG 278
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1815 PTLNRIsycKDIPSFVIQMDFSADSSYLqVSSGCYKR-HVYEVPSGKhlmdhaaidritwatwtsilgdeVLGIWSRHAe 1893
Cdd:COG2319 279 ELLRTL---TGHSGGVNSVAFSPDGKLL-ASGSDDGTvRLWDLATGK-----------------------LLRTLTGHT- 330
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1867163826 1894 kADVNCACVSHSGISLVTGDDFGMVKLFDfpcPEKFAKHKRFLGHSPHVTNIRFTSGDRHVVSaGGDDCSLFVW 1967
Cdd:COG2319 331 -GAVRSVAFSPDGKTLASGSDDGTVRLWD---LATGELLRTLTGHTGAVTSVAFSPDGRTLAS-GSADGTVRLW 399
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
251-591 |
1.94e-26 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 114.24 E-value: 1.94e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 251 ATGGRDGCIRLWDLTFKPITVIDLRETDqgykglSVRSVCWR--GDHILVGTQDSEI--FEIvvqERNKPFLIMQGHcEG 326
Cdd:COG2319 94 ASASADGTVRLWDLATGLLLRTLTGHTG------AVRSVAFSpdGKTLASGSADGTVrlWDL---ATGKLLRTLTGH-SG 163
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 327 ELWALAVHPTKPLAVTGSDDRSVRIWSLVDHALIARCN-MEEPIRCAAVNADGIHLALGMKDGSFTVLRVRDMTEVVHIK 405
Cdd:COG2319 164 AVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTgHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLT 243
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 406 DRKEAIHELKYSPDGTYLAVGCNDSSVDIYGVAQrykkvGECLGSL----SFITHLDWSSDSRYLQTNDGNGKrlfyrmp 481
Cdd:COG2319 244 GHSGSVRSVAFSPDGRLLASGSADGTVRLWDLAT-----GELLRTLtghsGGVNSVAFSPDGKLLASGSDDGT------- 311
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 482 ggkevtsteeIKGVHWASWTCVSGLEVNGIWpkysdindINSVDGNYIGQVLVTADDYGIIKLFRypcLRKGAKFRKYIG 561
Cdd:COG2319 312 ----------VRLWDLATGKLLRTLTGHTGA--------VRSVAFSPDGKTLASGSDDGTVRLWD---LATGELLRTLTG 370
|
330 340 350
....*....|....*....|....*....|
gi 1867163826 562 HSAHVTNVRWSHDYQWVISiGGADHSVFQW 591
Cdd:COG2319 371 HTGAVTSVAFSPDGRTLAS-GSADGTVRLW 399
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
148-435 |
3.80e-26 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 110.50 E-value: 3.80e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 148 GHTDRIFDISWDlYQPNKLVSCGV-KHIKFWSLCGNalTPKRGVFGKTGDLQTilCLACARDELTYSGALNGDIYVW--K 224
Cdd:cd00200 7 GHTGGVTCVAFS-PDGKLLATGSGdGTIKVWDLETG--ELLRTLKGHTGPVRD--VAASADGTYLASGSSDKTIRLWdlE 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 225 GINLIRTIQGaHAAGIFSMNACEEG--FATGGRDGCIRLWDL-TFKPITVIdlretdQGYKGlSVRSVCWRGDHILV--G 299
Cdd:cd00200 82 TGECVRTLTG-HTSYVSSVAFSPDGriLSSSSRDKTIKVWDVeTGKCLTTL------RGHTD-WVNSVAFSPDGTFVasS 153
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 300 TQDSEIFEIVVQErNKPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWSLVDHALIARCN-MEEPIRCAAVNADG 378
Cdd:cd00200 154 SQDGTIKLWDLRT-GKCVATLTGH-TGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRgHENGVNSVAFSPDG 231
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*..
gi 1867163826 379 IHLALGMKDGSFTVLRVRDMTEVVHIKDRKEAIHELKYSPDGTYLAVGCNDSSVDIY 435
Cdd:cd00200 232 YLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIRIW 288
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1454-1730 |
1.03e-24 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 108.85 E-value: 1.03e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1454 SIHIWDAMNKQTLSILRCyHSKGVCSVSFSATGKLLLSVGLDpeHTITIWRWQEGAKIASRAGHNQRIFVAEFRPDSDTq 1533
Cdd:COG2319 143 TVRLWDLATGKLLRTLTG-HSGAVTSVAFSPDGKLLASGSDD--GTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKL- 218
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1534 FVSVGV-KHVKFWTLAGRALLSkkgllsTLEDARmQTMLAIAFGANNLTF-TGTISGDVCVWK-DHILCRIVARAHNGPV 1610
Cdd:COG2319 219 LASGSAdGTVRLWDLATGKLLR------TLTGHS-GSVRSVAFSPDGRLLaSGSADGTVRLWDlATGELLRTLTGHSGGV 291
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1611 FAMYTTLRDGLIVTGGkerpskEGGAVKLWDQELRRC-RAFRLETGQatdcVRSVC-RGKGKILVGTRNAEIIEVGE-KN 1687
Cdd:COG2319 292 NSVAFSPDGKLLASGS------DDGTVRLWDLATGKLlRTLTGHTGA----VRSVAfSPDGKTLASGSDDGTVRLWDlAT 361
|
250 260 270 280
....*....|....*....|....*....|....*....|...
gi 1867163826 1688 AACNILVNGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWDIAD 1730
Cdd:COG2319 362 GELLRTLTGH-TGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1425-1922 |
8.61e-24 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 106.15 E-value: 8.61e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1425 LTVNQHPKFINIVATGQVGDSADMSATAPSIHIWDAMNKQTLSILRcYHSKGVCSVSFSATGKLLLSVGLDpeHTITIWR 1504
Cdd:COG2319 30 LLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLL-GHTAAVLSVAFSPDGRLLASASAD--GTVRLWD 106
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1505 WQEGAKIASRAGHNQRIFVAEFRPDSDTqFVSVGV-KHVKFWTLAGRALLSkkgllstledarmqtmlaiafgannlTFT 1583
Cdd:COG2319 107 LATGLLLRTLTGHTGAVRSVAFSPDGKT-LASGSAdGTVRLWDLATGKLLR--------------------------TLT 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1584 GtisgdvcvwkdhilcrivaraHNGPVFAMyTTLRDG-LIVTGGkerpskEGGAVKLWDqelrrcrafrLETGQatdCVR 1662
Cdd:COG2319 160 G---------------------HSGAVTSV-AFSPDGkLLASGS------DDGTVRLWD----------LATGK---LLR 198
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1663 SVcrgkgkilvgtrnaeiievgeknaacnilvNGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWDIADKKMLNKVNlGHA 1742
Cdd:COG2319 199 TL------------------------------TGH-TGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLT-GHS 246
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1743 A--RTVCYSPEGDMVAIGMKNGEFIILLVSSLKIWGKKRDRRCAIHDIRFSPDSRYLAVGSSENSVDFYDLTLGPTLNRI 1820
Cdd:COG2319 247 GsvRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTL 326
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1821 sycKDIPSFVIQMDFSADSSYLQVSSGCYKRHVYEVPSGKhlmdhaaidritwatwtsilgdeVLGIWSRHAekADVNCA 1900
Cdd:COG2319 327 ---TGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGE-----------------------LLRTLTGHT--GAVTSV 378
|
490 500
....*....|....*....|..
gi 1867163826 1901 CVSHSGISLVTGDDFGMVKLFD 1922
Cdd:COG2319 379 AFSPDGRTLASGSADGTVRLWD 400
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
895-1266 |
1.93e-23 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 104.99 E-value: 1.93e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 895 LVKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTY-----AIKRAALAPGSKglllednpsiraislg 967
Cdd:COG2319 70 LLATLLGHTAAVLSVAFSPDGrlLASASADGTVRLWDLATGLLLRTLtghtgAVRSVAFSPDGK---------------- 133
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 968 hgHILVGTKNGEILEVD-KSGPITLLVQGHmEGEVWGLATHP---YLpicATVSDDKTLRIWDLSPSHCMLAVRKLKKGG 1043
Cdd:COG2319 134 --TLASGSADGTVRLWDlATGKLLRTLTGH-SGAVTSVAFSPdgkLL---ASGSDDGTVRLWDLATGKLLRTLTGHTGAV 207
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1044 RCCCFSPDGKALAVGLNDGSFLMANADTLEDLVSFHHRKDMISDIRFSPgSGKYLAVASHDSFIDIYNVMSSKRVGICKG 1123
Cdd:COG2319 208 RSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSP-DGRLLASGSADGTVRLWDLATGELLRTLTG 286
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1124 ATSYITHIDWDIRGKLLqvntgakeqlffeAPRGKKQTIpsveveKIawasWTSVLGLCcegIWPVIGEVTDVTASCLTS 1203
Cdd:COG2319 287 HSGGVNSVAFSPDGKLL-------------ASGSDDGTV------RL----WDLATGKL---LRTLTGHTGAVRSVAFSP 340
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1867163826 1204 DKMVLATGDDLGFVKLFRYPTKGKFGKFKryvAHSTHVTNVRWTYDDSMLVTlGGTDMSLMVW 1266
Cdd:COG2319 341 DGKTLASGSDDGTVRLWDLATGELLRTLT---GHTGAVTSVAFSPDGRTLAS-GSADGTVRLW 399
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
712-929 |
2.29e-21 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 96.64 E-value: 2.29e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 712 IYNRQQNTQRFYL-GHDDDILCLTIHPLKDYVATGqvGRDPSIHIWDTETIKPLSILKGHHQYgVSAVDFSADGKRLASV 790
Cdd:cd00200 77 LWDLETGECVRTLtGHTSYVSSVAFSPDGRILSSS--SRDKTIKVWDVETGKCLTTLRGHTDW-VNSVAFSPDGTFVASS 153
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 791 GIDdsHTVVLWDWKKGEKLSIARGSKDKIFVVKmnpYVPD--KLITAGI-KHMKFWRKAGGGLigrkgyIGTL-GKNDTM 866
Cdd:cd00200 154 SQD--GTIKLWDLRTGKCVATLTGHTGEVNSVA---FSPDgeKLLSSSSdGTIKLWDLSTGKC------LGTLrGHENGV 222
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1867163826 867 MCAVYGWTEEMAFSGTSTGDVCIW--RDIFLVKTVKAHDGPVFSM--HALEKGFVTGGKDGIVALWD 929
Cdd:cd00200 223 NSVAFSPDGYLLASGSEDGTIRVWdlRTGECVQTLSGHTNSVTSLawSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1473-1775 |
7.65e-21 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 95.09 E-value: 7.65e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1473 HSKGVCSVSFSATGKLLLSVGLDpeHTITIWRWQEGAKIASRAGHNQRIFVAEFRPDSdTQFVSVGV-KHVKFWTLagra 1551
Cdd:cd00200 8 HTGGVTCVAFSPDGKLLATGSGD--GTIKVWDLETGELLRTLKGHTGPVRDVAASADG-TYLASGSSdKTIRLWDL---- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1552 llSKKGLLSTLEDARmQTMLAIAFGANNLTFTGTIS-GDVCVWK-DHILCRIVARAHNGPVFAMyTTLRDGLIVTGgker 1629
Cdd:cd00200 81 --ETGECVRTLTGHT-SYVSSVAFSPDGRILSSSSRdKTIKVWDvETGKCLTTLRGHTDWVNSV-AFSPDGTFVAS---- 152
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1630 pSKEGGAVKLWD-QELRRCRAFRLETGQatdcVRSVC--RGKGKILVGTRNAEIIEVGEKNAACNILVNGHvDGPIWGLA 1706
Cdd:cd00200 153 -SSQDGTIKLWDlRTGKCVATLTGHTGE----VNSVAfsPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGH-ENGVNSVA 226
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1867163826 1707 THPSRDFFLSAAEDGTVRLWDIADKKMLNKVNlGHAAR--TVCYSPEGDMVAIGMKNGefiillvsSLKIW 1775
Cdd:cd00200 227 FSPDGYLLASGSEDGTIRVWDLRTGECVQTLS-GHTNSvtSLAWSPDGKRLASGSADG--------TIRIW 288
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
228-592 |
3.17e-20 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 93.17 E-value: 3.17e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 228 LIRTIQGaHAAGIFSM--NACEEGFATGGRDGCIRLWDLTFKpitviDLRETDQGYKGlSVRSVCWRGDH--ILVGTQDS 303
Cdd:cd00200 1 LRRTLKG-HTGGVTCVafSPDGKLLATGSGDGTIKVWDLETG-----ELLRTLKGHTG-PVRDVAASADGtyLASGSSDK 73
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 304 EIFeIVVQERNKPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWSLVDHALIARCNM-EEPIRCAAVNADGIHLA 382
Cdd:cd00200 74 TIR-LWDLETGECVRTLTGH-TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGhTDWVNSVAFSPDGTFVA 151
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 383 LGMKDGSFTVLRVRDMTEVVHIKDRKEAIHELKYSPDGTYLAVGCNDSSVDIYGVAQRyKKVGECLGSLSFITHLDWSSD 462
Cdd:cd00200 152 SSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTG-KCLGTLRGHENGVNSVAFSPD 230
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 463 SRYLqtndgngkrlfyrmpggkevtsteeikgvhwaswtcvsglevngiwpkysdindinsvdgnyigqvlVTADDYGII 542
Cdd:cd00200 231 GYLL-------------------------------------------------------------------ASGSEDGTI 243
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|
gi 1867163826 543 KLFRypcLRKGAKFRKYIGHSAHVTNVRWSHDYQWVISiGGADHSVFQWK 592
Cdd:cd00200 244 RVWD---LRTGECVQTLSGHTNSVTSLAWSPDGKRLAS-GSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
895-1140 |
4.70e-20 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 92.78 E-value: 4.70e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 895 LVKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTYAIKRAALApgskglllednpsiRAISLGHGH-I 971
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGklLATGSGDGTIKVWDLETGELLRTLKGHTGPVR--------------DVAASADGTyL 66
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 972 LVGTKNGEILEVD-KSGPITLLVQGHmEGEVWGLATHPYLPICATVSDDKTLRIWDLSPSHCMLAVRKLKKGGRCCCFSP 1050
Cdd:cd00200 67 ASGSSDKTIRLWDlETGECVRTLTGH-TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSP 145
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1051 DGKALAVGLNDGSFLMANADTLEDLVSFHHRKDMISDIRFSPgSGKYLAVASHDSFIDIYNVMSSKRVGICKGATSYITH 1130
Cdd:cd00200 146 DGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSP-DGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNS 224
|
250
....*....|
gi 1867163826 1131 IDWDIRGKLL 1140
Cdd:cd00200 225 VAFSPDGYLL 234
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
2-49 |
7.07e-19 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 82.60 E-value: 7.07e-19
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 1867163826 2 AARSAPSCHLRLEWVYGYRGHQCRNNLYYTAAKEIVYFVAGVGVVYSP 49
Cdd:pfam03451 25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1694-1967 |
2.87e-18 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 87.39 E-value: 2.87e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1694 VNGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWDIADKKMLnKVNLGHAA--RTVCYSPEGDMVAIGMKNGefiillvsS 1771
Cdd:cd00200 5 LKGH-TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELL-RTLKGHTGpvRDVAASADGTYLASGSSDK--------T 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1772 LKIWgKKRDRRC---------AIHDIRFSPDSRYLAVGSSENSVDFYDLTLGPTLNRISYCKDipsFVIQMDFSADSSYl 1842
Cdd:cd00200 75 IRLW-DLETGECvrtltghtsYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTD---WVNSVAFSPDGTF- 149
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1843 qVSSGCYKR--HVYEVPSGKHLMDHaaidritwatwtsilgdevlgiwsrHAEKADVNCACVSHSGISLVTGDDFGMVKL 1920
Cdd:cd00200 150 -VASSSQDGtiKLWDLRTGKCVATL-------------------------TGHTGEVNSVAFSPDGEKLLSSSSDGTIKL 203
|
250 260 270 280
....*....|....*....|....*....|....*....|....*..
gi 1867163826 1921 FDfpcPEKFAKHKRFLGHSPHVTNIRFtSGDRHVVSAGGDDCSLFVW 1967
Cdd:cd00200 204 WD---LSTGKCLGTLRGHENGVNSVAF-SPDGYLLASGSEDGTIRVW 246
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
672-715 |
9.08e-18 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 79.13 E-value: 9.08e-18
10 20 30 40
....*....|....*....|....*....|....*....|....
gi 1867163826 672 APGNSIRLHFVHGYRGYDCRSNLFYTQIGEIVYHVAAVGVIYNR 715
Cdd:pfam03451 29 PPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1415-1727 |
1.95e-17 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 85.08 E-value: 1.95e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1415 YQEHNDDILCLTVNQHPKFInivATGqvgdSADmsataPSIHIWDaMNKQTLSILRCYHSKGVCSVSFSATGKLLLSVGL 1494
Cdd:cd00200 5 LKGHTGGVTCVAFSPDGKLL---ATG----SGD-----GTIKVWD-LETGELLRTLKGHTGPVRDVAASADGTYLASGSS 71
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1495 DpeHTITIWRWQEGAKIASRAGHNQRIFVAEFRPDSDTQFVSVGVKHVKFWTLAgrallsKKGLLSTLEDARMQTMlAIA 1574
Cdd:cd00200 72 D--KTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVE------TGKCLTTLRGHTDWVN-SVA 142
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1575 F-GANNLTFTGTISGDVCVWKDHIL-CRIVARAHNGPVFAMYTTLRDGLIVTGGkerpskEGGAVKLWDQELRRCRA-FR 1651
Cdd:cd00200 143 FsPDGTFVASSSQDGTIKLWDLRTGkCVATLTGHTGEVNSVAFSPDGEKLLSSS------SDGTIKLWDLSTGKCLGtLR 216
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1867163826 1652 LETGQATDCVRSvcrGKGKILVGTRNAEIIEVGE-KNAACNILVNGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWD 1727
Cdd:cd00200 217 GHENGVNSVAFS---PDGYLLASGSEDGTIRVWDlRTGECVQTLSGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
1335-1407 |
2.67e-15 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 72.20 E-value: 2.67e-15
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1867163826 1335 RQGVVRPPVSRAPPqpeklqTNNVGKKKRPIEDLVLELIFGYRGRDCRNNVHYLNDGdDIIYHTASVGILHNV 1407
Cdd:pfam03451 7 RPGAVYPPSNYYPK------DDLDQKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTG-EIVYFTAAVVVLYDV 72
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1661-1968 |
2.45e-14 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 75.83 E-value: 2.45e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1661 VRSVC--RGKGKILVGTRNAEIIEVGEKNAACNILVNGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWDIADKKMLNKVN 1738
Cdd:cd00200 12 VTCVAfsPDGKLLATGSGDGTIKVWDLETGELLRTLKGH-TGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVRTLT 90
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1739 lGHAA--RTVCYSPEGDMVAIGMKNGefiillvsSLKIW----GKK----RDRRCAIHDIRFSPDSRYLAVGSSENSVDF 1808
Cdd:cd00200 91 -GHTSyvSSVAFSPDGRILSSSSRDK--------TIKVWdvetGKClttlRGHTDWVNSVAFSPDGTFVASSSQDGTIKL 161
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1809 YDLTLGPTLNRIsycKDIPSFVIQMDFSADSSYLQVSSGCYKRHVYEVPSGKHLMDHaaidritwatwtsilgdevlgiw 1888
Cdd:cd00200 162 WDLRTGKCVATL---TGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTL----------------------- 215
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1889 srHAEKADVNCACVSHSGISLVTGDDFGMVKLFDFpcpEKFAKHKRFLGHSPHVTNIRFtSGDRHVVSAGGDDCSLFVWK 1968
Cdd:cd00200 216 --RGHENGVNSVAFSPDGYLLASGSEDGTIRVWDL---RTGECVQTLSGHTNSVTSLAW-SPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
46-178 |
4.66e-12 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 68.90 E-value: 4.66e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 46 VYSPREHR-QKFYRGHSDDIISLALHPERVLVATGQVGKEpyICIWDSYTVQTISVLKdVHTHGIACLAFDLDGQRLVSV 124
Cdd:cd00200 161 LWDLRTGKcVATLTGHTGEVNSVAFSPDGEKLLSSSSDGT--IKLWDLSTGKCLGTLR-GHENGVNSVAFSPDGYLLASG 237
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*
gi 1867163826 125 GLDSKnaVCVWDWKRGKMLSMAPGHTDRIFDISWDlYQPNKLVSCGV-KHIKFWS 178
Cdd:cd00200 238 SEDGT--IRVWDLRTGECVQTLSGHTNSVTSLAWS-PDGKRLASGSAdGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1706-1972 |
2.79e-09 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 61.47 E-value: 2.79e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1706 ATHPSRDFFLSAAEDGTVRLWDIADKKMLNKVNLGHAARTVC-YSPEGDMVAIGMKNGEFIILLVSSLKIWGKKRDRRCA 1784
Cdd:COG2319 1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLaASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1785 IHDIRFSPDSRYLAVGSSENSVDFYDLTLGPTLNRIsycKDIPSFVIQMDFSADSSYLqVSSGCYKR-HVYEVPSGKhlm 1863
Cdd:COG2319 81 VLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTL---TGHTGAVRSVAFSPDGKTL-ASGSADGTvRLWDLATGK--- 153
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1864 dhaaidritwatwtsilgdeVLGIWSRHAEkaDVNCACVSHSGISLVTGDDFGMVKLFDfpcPEKFAKHKRFLGHSPHVT 1943
Cdd:COG2319 154 --------------------LLRTLTGHSG--AVTSVAFSPDGKLLASGSDDGTVRLWD---LATGKLLRTLTGHTGAVR 208
|
250 260 270
....*....|....*....|....*....|....
gi 1867163826 1944 NIRFTSGDRHVVSaGGDDCSLFVW-----KCVHT 1972
Cdd:COG2319 209 SVAFSPDGKLLAS-GSADGTVRLWdlatgKLLRT 241
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1782-1972 |
3.21e-07 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 54.26 E-value: 3.21e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1782 RCAIHDIRFSPDSRYLAVGSSENSVDFYDLTLGPTLNRisyCKDIPSFVIQMDFSADSSYLQVSSGCYKRHVYEVPSGK- 1860
Cdd:cd00200 9 TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRT---LKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGEc 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1861 --HLMDHAaiDRITWATW-------TSILGDEVLGIWSRHAEK---------ADVNCACVSHSGISLVTGDDFGMVKLFD 1922
Cdd:cd00200 86 vrTLTGHT--SYVSSVAFspdgrilSSSSRDKTIKVWDVETGKclttlrghtDWVNSVAFSPDGTFVASSSQDGTIKLWD 163
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*
gi 1867163826 1923 FPCPEKFakhKRFLGHSPHVTNIRFtSGDRHVVSAGGDDCSLFVW-----KCVHT 1972
Cdd:cd00200 164 LRTGKCV---ATLTGHTGEVNSVAF-SPDGEKLLSSSSDGTIKLWdlstgKCLGT 214
|
|
| ANAPC4_WD40 |
pfam12894 |
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ... |
1746-1828 |
1.66e-05 |
|
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,
Pssm-ID: 403945 [Multi-domain] Cd Length: 91 Bit Score: 44.96 E-value: 1.66e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1746 VCYSPEGDMVAIGMKNGEFIILLVSSLKIWGKKRDRR-CAIHDIRFSPDSRYLAVGSSENSVDFYDLTLGPTLNRISYCK 1824
Cdd:pfam12894 1 MSWCPTMDLIALATEDGELLLHRLNWQRVWTLSPDKEdLEVTSLAWRPDGKLLAVGYSDGTVRLLDAENGKIVHHFSAGS 80
|
....
gi 1867163826 1825 DIPS 1828
Cdd:pfam12894 81 DLIT 84
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
1695-1727 |
3.62e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 39.64 E-value: 3.62e-04
10 20 30
....*....|....*....|....*....|...
gi 1867163826 1695 NGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWD 1727
Cdd:pfam00400 8 EGH-TGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
760-802 |
4.13e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 39.60 E-value: 4.13e-04
10 20 30 40
....*....|....*....|....*....|....*....|...
gi 1867163826 760 TIKPLSILKGHHQYgVSAVDFSADGKRLASVGIDdsHTVVLWD 802
Cdd:smart00320 1 SGELLKTLKGHTGP-VTSVAFSPDGKYLASGSDD--GTIKLWD 40
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
1695-1727 |
4.65e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 39.60 E-value: 4.65e-04
10 20 30
....*....|....*....|....*....|...
gi 1867163826 1695 NGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWD 1727
Cdd:smart00320 9 KGH-TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
314-353 |
1.42e-03 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 38.10 E-value: 1.42e-03
10 20 30 40
....*....|....*....|....*....|....*....|
gi 1867163826 314 NKPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWS 353
Cdd:pfam00400 1 GKLLKTLEGH-TGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| WDR74 |
cd22857 |
WD repeat-containing protein 74; WDR74 (WD repeat-containing protein 74) from mammals and ... |
250-354 |
2.31e-03 |
|
WD repeat-containing protein 74; WDR74 (WD repeat-containing protein 74) from mammals and plants is an essential factor for ribosome assembly. In cooperation with the assembly factor NVL2, WDR74 participates in an early cleavage of the pre-rRNA processing pathway. NVL2 is a type II double ring, AAA-ATPase, that may mediate the release of WDR74 from nucleolar pre-60S particles. WDR74 has been implicated in tumorigenesis. In lung cancer, it regulates cell proliferation, cell cycle progression, chemoresistance and cell aggressiveness, by inducing nuclear beta-catenin accumulation and driving downstream Wnt-responsive genes expression. In melanoma, it promotes apoptosis resistance and aggressive behavior by regulating the RPL5-MDM2-p53 pathway. WDR74 contains an N-terminal seven-bladed beta-propeller WD40 domain that associates with the D1-AAA domain of the AAA-ATPase NVL2, and a flexible lysine-rich C-terminus that extends outward from the WD40 domain, and is required for nucleolar localization.
Pssm-ID: 439303 [Multi-domain] Cd Length: 325 Bit Score: 42.21 E-value: 2.31e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 250 FATGGRDGCIRLWDLTF--KPITVIDLRETdqGYKGLSVRSvcwRGDHILVGTQDSEIFEIVVQErNKPFLIMQGHCEGE 327
Cdd:cd22857 195 IVTGTGYHQVRLYDTRAqrRPVVSVDFGET--PIKAVAEDP---DGHTVYVGDTSGDLASIDLRT-GKLLGCFKGKCGGS 268
|
90 100
....*....|....*....|....*..
gi 1867163826 328 LWALAVHPTKPLAVTGSDDRSVRIWSL 354
Cdd:cd22857 269 IRSIARHPELPLIASCGLDRYLRIWDT 295
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
315-353 |
2.43e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 37.29 E-value: 2.43e-03
10 20 30
....*....|....*....|....*....|....*....
gi 1867163826 315 KPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWS 353
Cdd:smart00320 3 ELLKTLKGH-TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
994-1026 |
3.53e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 36.91 E-value: 3.53e-03
10 20 30
....*....|....*....|....*....|...
gi 1867163826 994 QGHmEGEVWGLATHPYLPICATVSDDKTLRIWD 1026
Cdd:smart00320 9 KGH-TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
762-802 |
5.47e-03 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 36.55 E-value: 5.47e-03
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 1867163826 762 KPLSILKGHHQyGVSAVDFSADGKRLASVGIDdsHTVVLWD 802
Cdd:pfam00400 2 KLLKTLEGHTG-SVTSLAFSPDGKLLASGSDD--GTVKVWD 39
|
|
|