NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1867163826|ref|NP_001372045|]
View 

echinoderm microtubule-associated protein-like 5 isoform 2 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
WD40 super family cl29593
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
725-1063 1.74e-34

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


The actual alignment was detected with superfamily member cd00200:

Pssm-ID: 475233 [Multi-domain]  Cd Length: 289  Bit Score: 134.77  E-value: 1.74e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  725 GHDDDILCLTIHPLKDYVATGqvGRDPSIHIWDTETIKPLSILKGHHQyGVSAVDFSADGKRLASVGIDdsHTVVLWDWK 804
Cdd:cd00200      7 GHTGGVTCVAFSPDGKLLATG--SGDGTIKVWDLETGELLRTLKGHTG-PVRDVAASADGTYLASGSSD--KTIRLWDLE 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  805 KGEKLSIARGSKDKIFVVKMNPYvpDKLITAGIKH--MKFWRKAGGGLIGRkgyigTLGKNDTMMCAVYGWTEEMAFSGT 882
Cdd:cd00200     82 TGECVRTLTGHTSYVSSVAFSPD--GRILSSSSRDktIKVWDVETGKCLTT-----LRGHTDWVNSVAFSPDGTFVASSS 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  883 STGDVCIW--RDIFLVKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTYaikraalapgskgllledn 958
Cdd:cd00200    155 QDGTIKLWdlRTGKCVATLTGHTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGTL------------------- 215
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  959 psiraislghghilvgtkngeilevdksgpitllvQGHmEGEVWGLATHPYLPICATVSDDKTLRIWDLSPSHCMLAVRK 1038
Cdd:cd00200    216 -----------------------------------RGH-ENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSG 259
                          330       340
                   ....*....|....*....|....*
gi 1867163826 1039 LKKGGRCCCFSPDGKALAVGLNDGS 1063
Cdd:cd00200    260 HTNSVTSLAWSPDGKRLASGSADGT 284
WD40 COG2319
WD40 repeat [General function prediction only];
58-438 9.65e-31

WD40 repeat [General function prediction only];


:

Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 126.95  E-value: 9.65e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826   58 RGHSDDIISLALHPERVLVATGQVGKEpyICIWDSYTVQTISVLKDvHTHGIACLAFDLDGQRLVSVGLDskNAVCVWDW 137
Cdd:COG2319     75 LGHTAAVLSVAFSPDGRLLASASADGT--VRLWDLATGLLLRTLTG-HTGAVRSVAFSPDGKTLASGSAD--GTVRLWDL 149
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  138 KRGKMLSMAPGHTDRIFDISWDlyqPNklvscgvkhikfwslcgnaltpkrgvfGKTgdlqtilcLAcardeltySGALN 217
Cdd:COG2319    150 ATGKLLRTLTGHSGAVTSVAFS---PD---------------------------GKL--------LA--------SGSDD 183
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  218 GDIYVW--KGINLIRTIQGaHAAGIFSMNACEEG--FATGGRDGCIRLWDL-TFKPITVIDLRETdqgykglSVRSVCWR 292
Cdd:COG2319    184 GTVRLWdlATGKLLRTLTG-HTGAVRSVAFSPDGklLASGSADGTVRLWDLaTGKLLRTLTGHSG-------SVRSVAFS 255
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  293 --GDHILVGTQDSEIFeIVVQERNKPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWSLVDHALIARCN-MEEPI 369
Cdd:COG2319    256 pdGRLLASGSADGTVR-LWDLATGELLRTLTGH-SGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTgHTGAV 333
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1867163826  370 RCAAVNADGIHLALGMKDGSFTVLRVRDMTEVVHIKDRKEAIHELKYSPDGTYLAVGCNDSSVDIYGVA 438
Cdd:COG2319    334 RSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
WD40 COG2319
WD40 repeat [General function prediction only];
1425-1812 1.28e-30

WD40 repeat [General function prediction only];


:

Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 126.56  E-value: 1.28e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1425 LTVNQHPKFINIVATGQVGDSADMSATAPSIHIWDAMNKQTLSILRcYHSKGVCSVSFSATGKLLLSVGLDpeHTITIWR 1504
Cdd:COG2319     72 ATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLT-GHTGAVRSVAFSPDGKTLASGSAD--GTVRLWD 148
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1505 WQEGAKIASRAGHNQRIFVAEFRPDSdTQFVSVGV-KHVKFWTLAGRALLSkkgllsTLEdARMQTMLAIAFGANNLTF- 1582
Cdd:COG2319    149 LATGKLLRTLTGHSGAVTSVAFSPDG-KLLASGSDdGTVRLWDLATGKLLR------TLT-GHTGAVRSVAFSPDGKLLa 220
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1583 TGTISGDVCVWK-DHILCRIVARAHNGPVFAMyTTLRDG-LIVTGGkerpskEGGAVKLWDqelrrcrafrLETGQatdC 1660
Cdd:COG2319    221 SGSADGTVRLWDlATGKLLRTLTGHSGSVRSV-AFSPDGrLLASGS------ADGTVRLWD----------LATGE---L 280
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1661 VRSVcrgkgkilvgtrnaeiievgeknaacnilvnGHVDGPIWGLATHPSRDFFLSAAEDGTVRLWDIADKKMLNKVNlG 1740
Cdd:COG2319    281 LRTL-------------------------------TGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLT-G 328
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1867163826 1741 HAA--RTVCYSPEGDMVAIGMKNGEFIILLVSSLKIWGKKRDRRCAIHDIRFSPDSRYLAVGSSENSVDFYDLT 1812
Cdd:COG2319    329 HTGavRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
WD40 COG2319
WD40 repeat [General function prediction only];
895-1266 1.93e-23

WD40 repeat [General function prediction only];


:

Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 104.99  E-value: 1.93e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  895 LVKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTY-----AIKRAALAPGSKglllednpsiraislg 967
Cdd:COG2319     70 LLATLLGHTAAVLSVAFSPDGrlLASASADGTVRLWDLATGLLLRTLtghtgAVRSVAFSPDGK---------------- 133
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  968 hgHILVGTKNGEILEVD-KSGPITLLVQGHmEGEVWGLATHP---YLpicATVSDDKTLRIWDLSPSHCMLAVRKLKKGG 1043
Cdd:COG2319    134 --TLASGSADGTVRLWDlATGKLLRTLTGH-SGAVTSVAFSPdgkLL---ASGSDDGTVRLWDLATGKLLRTLTGHTGAV 207
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1044 RCCCFSPDGKALAVGLNDGSFLMANADTLEDLVSFHHRKDMISDIRFSPgSGKYLAVASHDSFIDIYNVMSSKRVGICKG 1123
Cdd:COG2319    208 RSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSP-DGRLLASGSADGTVRLWDLATGELLRTLTG 286
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1124 ATSYITHIDWDIRGKLLqvntgakeqlffeAPRGKKQTIpsveveKIawasWTSVLGLCcegIWPVIGEVTDVTASCLTS 1203
Cdd:COG2319    287 HSGGVNSVAFSPDGKLL-------------ASGSDDGTV------RL----WDLATGKL---LRTLTGHTGAVRSVAFSP 340
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1867163826 1204 DKMVLATGDDLGFVKLFRYPTKGKFGKFKryvAHSTHVTNVRWTYDDSMLVTlGGTDMSLMVW 1266
Cdd:COG2319    341 DGKTLASGSDDGTVRLWDLATGELLRTLT---GHTGAVTSVAFSPDGRTLAS-GSADGTVRLW 399
HELP pfam03451
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ...
2-49 7.07e-19

HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.


:

Pssm-ID: 460922  Cd Length: 72  Bit Score: 82.60  E-value: 7.07e-19
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*...
gi 1867163826    2 AARSAPSCHLRLEWVYGYRGHQCRNNLYYTAAKEIVYFVAGVGVVYSP 49
Cdd:pfam03451   25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
WD40 super family cl29593
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1694-1967 2.87e-18

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


The actual alignment was detected with superfamily member cd00200:

Pssm-ID: 475233 [Multi-domain]  Cd Length: 289  Bit Score: 87.39  E-value: 2.87e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1694 VNGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWDIADKKMLnKVNLGHAA--RTVCYSPEGDMVAIGMKNGefiillvsS 1771
Cdd:cd00200      5 LKGH-TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELL-RTLKGHTGpvRDVAASADGTYLASGSSDK--------T 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1772 LKIWgKKRDRRC---------AIHDIRFSPDSRYLAVGSSENSVDFYDLTLGPTLNRISYCKDipsFVIQMDFSADSSYl 1842
Cdd:cd00200     75 IRLW-DLETGECvrtltghtsYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTD---WVNSVAFSPDGTF- 149
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1843 qVSSGCYKR--HVYEVPSGKHLMDHaaidritwatwtsilgdevlgiwsrHAEKADVNCACVSHSGISLVTGDDFGMVKL 1920
Cdd:cd00200    150 -VASSSQDGtiKLWDLRTGKCVATL-------------------------TGHTGEVNSVAFSPDGEKLLSSSSDGTIKL 203
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*..
gi 1867163826 1921 FDfpcPEKFAKHKRFLGHSPHVTNIRFtSGDRHVVSAGGDDCSLFVW 1967
Cdd:cd00200    204 WD---LSTGKCLGTLRGHENGVNSVAF-SPDGYLLASGSEDGTIRVW 246
HELP pfam03451
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ...
672-715 9.08e-18

HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.


:

Pssm-ID: 460922  Cd Length: 72  Bit Score: 79.13  E-value: 9.08e-18
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 1867163826  672 APGNSIRLHFVHGYRGYDCRSNLFYTQIGEIVYHVAAVGVIYNR 715
Cdd:pfam03451   29 PPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
HELP pfam03451
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ...
1335-1407 2.67e-15

HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.


:

Pssm-ID: 460922  Cd Length: 72  Bit Score: 72.20  E-value: 2.67e-15
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1867163826 1335 RQGVVRPPVSRAPPqpeklqTNNVGKKKRPIEDLVLELIFGYRGRDCRNNVHYLNDGdDIIYHTASVGILHNV 1407
Cdd:pfam03451    7 RPGAVYPPSNYYPK------DDLDQKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTG-EIVYFTAAVVVLYDV 72
 
Name Accession Description Interval E-value
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
725-1063 1.74e-34

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 134.77  E-value: 1.74e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  725 GHDDDILCLTIHPLKDYVATGqvGRDPSIHIWDTETIKPLSILKGHHQyGVSAVDFSADGKRLASVGIDdsHTVVLWDWK 804
Cdd:cd00200      7 GHTGGVTCVAFSPDGKLLATG--SGDGTIKVWDLETGELLRTLKGHTG-PVRDVAASADGTYLASGSSD--KTIRLWDLE 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  805 KGEKLSIARGSKDKIFVVKMNPYvpDKLITAGIKH--MKFWRKAGGGLIGRkgyigTLGKNDTMMCAVYGWTEEMAFSGT 882
Cdd:cd00200     82 TGECVRTLTGHTSYVSSVAFSPD--GRILSSSSRDktIKVWDVETGKCLTT-----LRGHTDWVNSVAFSPDGTFVASSS 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  883 STGDVCIW--RDIFLVKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTYaikraalapgskgllledn 958
Cdd:cd00200    155 QDGTIKLWdlRTGKCVATLTGHTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGTL------------------- 215
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  959 psiraislghghilvgtkngeilevdksgpitllvQGHmEGEVWGLATHPYLPICATVSDDKTLRIWDLSPSHCMLAVRK 1038
Cdd:cd00200    216 -----------------------------------RGH-ENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSG 259
                          330       340
                   ....*....|....*....|....*
gi 1867163826 1039 LKKGGRCCCFSPDGKALAVGLNDGS 1063
Cdd:cd00200    260 HTNSVTSLAWSPDGKRLASGSADGT 284
WD40 COG2319
WD40 repeat [General function prediction only];
720-1112 6.02e-34

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 136.58  E-value: 6.02e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  720 QRFYLGHDDDILCLTIHPLKDYVATGqvGRDPSIHIWDTETIKPLSILKGHHQYgVSAVDFSADGKRLASVGIDdsHTVV 799
Cdd:COG2319     71 LATLLGHTAAVLSVAFSPDGRLLASA--SADGTVRLWDLATGLLLRTLTGHTGA-VRSVAFSPDGKTLASGSAD--GTVR 145
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  800 LWDWKKGEKLSIARGSKDKIFVVKMNPyvpD--KLITAGI-KHMKFWRKAGGGLIgrkgyigtlgkndtmmcavygwtee 876
Cdd:COG2319    146 LWDLATGKLLRTLTGHSGAVTSVAFSP---DgkLLASGSDdGTVRLWDLATGKLL------------------------- 197
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  877 mafsgtstgdvciwrdiflvKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTYAikraalapgskgll 954
Cdd:COG2319    198 --------------------RTLTGHTGAVRSVAFSPDGklLASGSADGTVRLWDLATGKLLRTLT-------------- 243
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  955 lEDNPSIRAISLGH-GHILV-GTKNGEILEVD-KSGPITLLVQGHmEGEVWGLATHP---YLpicATVSDDKTLRIWDLS 1028
Cdd:COG2319    244 -GHSGSVRSVAFSPdGRLLAsGSADGTVRLWDlATGELLRTLTGH-SGGVNSVAFSPdgkLL---ASGSDDGTVRLWDLA 318
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1029 PSHCMLAVRKLKKGGRCCCFSPDGKALAVGLNDGSFLMANADTLEDLVSFHHRKDMISDIRFSPgSGKYLAVASHDSFID 1108
Cdd:COG2319    319 TGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSP-DGRTLASGSADGTVR 397

                   ....
gi 1867163826 1109 IYNV 1112
Cdd:COG2319    398 LWDL 401
WD40 COG2319
WD40 repeat [General function prediction only];
58-438 9.65e-31

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 126.95  E-value: 9.65e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826   58 RGHSDDIISLALHPERVLVATGQVGKEpyICIWDSYTVQTISVLKDvHTHGIACLAFDLDGQRLVSVGLDskNAVCVWDW 137
Cdd:COG2319     75 LGHTAAVLSVAFSPDGRLLASASADGT--VRLWDLATGLLLRTLTG-HTGAVRSVAFSPDGKTLASGSAD--GTVRLWDL 149
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  138 KRGKMLSMAPGHTDRIFDISWDlyqPNklvscgvkhikfwslcgnaltpkrgvfGKTgdlqtilcLAcardeltySGALN 217
Cdd:COG2319    150 ATGKLLRTLTGHSGAVTSVAFS---PD---------------------------GKL--------LA--------SGSDD 183
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  218 GDIYVW--KGINLIRTIQGaHAAGIFSMNACEEG--FATGGRDGCIRLWDL-TFKPITVIDLRETdqgykglSVRSVCWR 292
Cdd:COG2319    184 GTVRLWdlATGKLLRTLTG-HTGAVRSVAFSPDGklLASGSADGTVRLWDLaTGKLLRTLTGHSG-------SVRSVAFS 255
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  293 --GDHILVGTQDSEIFeIVVQERNKPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWSLVDHALIARCN-MEEPI 369
Cdd:COG2319    256 pdGRLLASGSADGTVR-LWDLATGELLRTLTGH-SGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTgHTGAV 333
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1867163826  370 RCAAVNADGIHLALGMKDGSFTVLRVRDMTEVVHIKDRKEAIHELKYSPDGTYLAVGCNDSSVDIYGVA 438
Cdd:COG2319    334 RSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
WD40 COG2319
WD40 repeat [General function prediction only];
1425-1812 1.28e-30

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 126.56  E-value: 1.28e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1425 LTVNQHPKFINIVATGQVGDSADMSATAPSIHIWDAMNKQTLSILRcYHSKGVCSVSFSATGKLLLSVGLDpeHTITIWR 1504
Cdd:COG2319     72 ATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLT-GHTGAVRSVAFSPDGKTLASGSAD--GTVRLWD 148
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1505 WQEGAKIASRAGHNQRIFVAEFRPDSdTQFVSVGV-KHVKFWTLAGRALLSkkgllsTLEdARMQTMLAIAFGANNLTF- 1582
Cdd:COG2319    149 LATGKLLRTLTGHSGAVTSVAFSPDG-KLLASGSDdGTVRLWDLATGKLLR------TLT-GHTGAVRSVAFSPDGKLLa 220
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1583 TGTISGDVCVWK-DHILCRIVARAHNGPVFAMyTTLRDG-LIVTGGkerpskEGGAVKLWDqelrrcrafrLETGQatdC 1660
Cdd:COG2319    221 SGSADGTVRLWDlATGKLLRTLTGHSGSVRSV-AFSPDGrLLASGS------ADGTVRLWD----------LATGE---L 280
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1661 VRSVcrgkgkilvgtrnaeiievgeknaacnilvnGHVDGPIWGLATHPSRDFFLSAAEDGTVRLWDIADKKMLNKVNlG 1740
Cdd:COG2319    281 LRTL-------------------------------TGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLT-G 328
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1867163826 1741 HAA--RTVCYSPEGDMVAIGMKNGEFIILLVSSLKIWGKKRDRRCAIHDIRFSPDSRYLAVGSSENSVDFYDLT 1812
Cdd:COG2319    329 HTGavRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
57-353 4.87e-30

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 122.06  E-value: 4.87e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826   57 YRGHSDDIISLALHPERVLVATGQVGKEpyICIWDSYTVQTISVLKdVHTHGIACLAFDLDGQRLVSVGLDskNAVCVWD 136
Cdd:cd00200      5 LKGHTGGVTCVAFSPDGKLLATGSGDGT--IKVWDLETGELLRTLK-GHTGPVRDVAASADGTYLASGSSD--KTIRLWD 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  137 WKRGKMLSMAPGHTDRIFDISWDlyqPNK--LVSCGV-KHIKFWSLcgNALTPKRGVFGKTGDlqtILCLACARDE-LTY 212
Cdd:cd00200     80 LETGECVRTLTGHTSYVSSVAFS---PDGriLSSSSRdKTIKVWDV--ETGKCLTTLRGHTDW---VNSVAFSPDGtFVA 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  213 SGALNGDIYVW--KGINLIRTIQGaHAAGIFSMNACEEG--FATGGRDGCIRLWDL-TFKPITVIDlretdqgYKGLSVR 287
Cdd:cd00200    152 SSSQDGTIKLWdlRTGKCVATLTG-HTGEVNSVAFSPDGekLLSSSSDGTIKLWDLsTGKCLGTLR-------GHENGVN 223
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  288 SVCW--RGDHILVGTQDS--EIFEIVVQERNKPFlimQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWS 353
Cdd:cd00200    224 SVAFspDGYLLASGSEDGtiRVWDLRTGECVQTL---SGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
WD40 COG2319
WD40 repeat [General function prediction only];
895-1266 1.93e-23

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 104.99  E-value: 1.93e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  895 LVKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTY-----AIKRAALAPGSKglllednpsiraislg 967
Cdd:COG2319     70 LLATLLGHTAAVLSVAFSPDGrlLASASADGTVRLWDLATGLLLRTLtghtgAVRSVAFSPDGK---------------- 133
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  968 hgHILVGTKNGEILEVD-KSGPITLLVQGHmEGEVWGLATHP---YLpicATVSDDKTLRIWDLSPSHCMLAVRKLKKGG 1043
Cdd:COG2319    134 --TLASGSADGTVRLWDlATGKLLRTLTGH-SGAVTSVAFSPdgkLL---ASGSDDGTVRLWDLATGKLLRTLTGHTGAV 207
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1044 RCCCFSPDGKALAVGLNDGSFLMANADTLEDLVSFHHRKDMISDIRFSPgSGKYLAVASHDSFIDIYNVMSSKRVGICKG 1123
Cdd:COG2319    208 RSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSP-DGRLLASGSADGTVRLWDLATGELLRTLTG 286
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1124 ATSYITHIDWDIRGKLLqvntgakeqlffeAPRGKKQTIpsveveKIawasWTSVLGLCcegIWPVIGEVTDVTASCLTS 1203
Cdd:COG2319    287 HSGGVNSVAFSPDGKLL-------------ASGSDDGTV------RL----WDLATGKL---LRTLTGHTGAVRSVAFSP 340
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1867163826 1204 DKMVLATGDDLGFVKLFRYPTKGKFGKFKryvAHSTHVTNVRWTYDDSMLVTlGGTDMSLMVW 1266
Cdd:COG2319    341 DGKTLASGSDDGTVRLWDLATGELLRTLT---GHTGAVTSVAFSPDGRTLAS-GSADGTVRLW 399
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1473-1775 7.65e-21

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 95.09  E-value: 7.65e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1473 HSKGVCSVSFSATGKLLLSVGLDpeHTITIWRWQEGAKIASRAGHNQRIFVAEFRPDSdTQFVSVGV-KHVKFWTLagra 1551
Cdd:cd00200      8 HTGGVTCVAFSPDGKLLATGSGD--GTIKVWDLETGELLRTLKGHTGPVRDVAASADG-TYLASGSSdKTIRLWDL---- 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1552 llSKKGLLSTLEDARmQTMLAIAFGANNLTFTGTIS-GDVCVWK-DHILCRIVARAHNGPVFAMyTTLRDGLIVTGgker 1629
Cdd:cd00200     81 --ETGECVRTLTGHT-SYVSSVAFSPDGRILSSSSRdKTIKVWDvETGKCLTTLRGHTDWVNSV-AFSPDGTFVAS---- 152
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1630 pSKEGGAVKLWD-QELRRCRAFRLETGQatdcVRSVC--RGKGKILVGTRNAEIIEVGEKNAACNILVNGHvDGPIWGLA 1706
Cdd:cd00200    153 -SSQDGTIKLWDlRTGKCVATLTGHTGE----VNSVAfsPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGH-ENGVNSVA 226
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1867163826 1707 THPSRDFFLSAAEDGTVRLWDIADKKMLNKVNlGHAAR--TVCYSPEGDMVAIGMKNGefiillvsSLKIW 1775
Cdd:cd00200    227 FSPDGYLLASGSEDGTIRVWDLRTGECVQTLS-GHTNSvtSLAWSPDGKRLASGSADG--------TIRIW 288
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
895-1140 4.70e-20

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 92.78  E-value: 4.70e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  895 LVKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTYAIKRAALApgskglllednpsiRAISLGHGH-I 971
Cdd:cd00200      1 LRRTLKGHTGGVTCVAFSPDGklLATGSGDGTIKVWDLETGELLRTLKGHTGPVR--------------DVAASADGTyL 66
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  972 LVGTKNGEILEVD-KSGPITLLVQGHmEGEVWGLATHPYLPICATVSDDKTLRIWDLSPSHCMLAVRKLKKGGRCCCFSP 1050
Cdd:cd00200     67 ASGSSDKTIRLWDlETGECVRTLTGH-TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSP 145
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1051 DGKALAVGLNDGSFLMANADTLEDLVSFHHRKDMISDIRFSPgSGKYLAVASHDSFIDIYNVMSSKRVGICKGATSYITH 1130
Cdd:cd00200    146 DGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSP-DGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNS 224
                          250
                   ....*....|
gi 1867163826 1131 IDWDIRGKLL 1140
Cdd:cd00200    225 VAFSPDGYLL 234
HELP pfam03451
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ...
2-49 7.07e-19

HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.


Pssm-ID: 460922  Cd Length: 72  Bit Score: 82.60  E-value: 7.07e-19
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*...
gi 1867163826    2 AARSAPSCHLRLEWVYGYRGHQCRNNLYYTAAKEIVYFVAGVGVVYSP 49
Cdd:pfam03451   25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1694-1967 2.87e-18

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 87.39  E-value: 2.87e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1694 VNGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWDIADKKMLnKVNLGHAA--RTVCYSPEGDMVAIGMKNGefiillvsS 1771
Cdd:cd00200      5 LKGH-TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELL-RTLKGHTGpvRDVAASADGTYLASGSSDK--------T 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1772 LKIWgKKRDRRC---------AIHDIRFSPDSRYLAVGSSENSVDFYDLTLGPTLNRISYCKDipsFVIQMDFSADSSYl 1842
Cdd:cd00200     75 IRLW-DLETGECvrtltghtsYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTD---WVNSVAFSPDGTF- 149
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1843 qVSSGCYKR--HVYEVPSGKHLMDHaaidritwatwtsilgdevlgiwsrHAEKADVNCACVSHSGISLVTGDDFGMVKL 1920
Cdd:cd00200    150 -VASSSQDGtiKLWDLRTGKCVATL-------------------------TGHTGEVNSVAFSPDGEKLLSSSSDGTIKL 203
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*..
gi 1867163826 1921 FDfpcPEKFAKHKRFLGHSPHVTNIRFtSGDRHVVSAGGDDCSLFVW 1967
Cdd:cd00200    204 WD---LSTGKCLGTLRGHENGVNSVAF-SPDGYLLASGSEDGTIRVW 246
HELP pfam03451
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ...
672-715 9.08e-18

HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.


Pssm-ID: 460922  Cd Length: 72  Bit Score: 79.13  E-value: 9.08e-18
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 1867163826  672 APGNSIRLHFVHGYRGYDCRSNLFYTQIGEIVYHVAAVGVIYNR 715
Cdd:pfam03451   29 PPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
HELP pfam03451
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ...
1335-1407 2.67e-15

HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.


Pssm-ID: 460922  Cd Length: 72  Bit Score: 72.20  E-value: 2.67e-15
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1867163826 1335 RQGVVRPPVSRAPPqpeklqTNNVGKKKRPIEDLVLELIFGYRGRDCRNNVHYLNDGdDIIYHTASVGILHNV 1407
Cdd:pfam03451    7 RPGAVYPPSNYYPK------DDLDQKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTG-EIVYFTAAVVVLYDV 72
WD40 COG2319
WD40 repeat [General function prediction only];
1706-1972 2.79e-09

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 61.47  E-value: 2.79e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1706 ATHPSRDFFLSAAEDGTVRLWDIADKKMLNKVNLGHAARTVC-YSPEGDMVAIGMKNGEFIILLVSSLKIWGKKRDRRCA 1784
Cdd:COG2319      1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLaASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1785 IHDIRFSPDSRYLAVGSSENSVDFYDLTLGPTLNRIsycKDIPSFVIQMDFSADSSYLqVSSGCYKR-HVYEVPSGKhlm 1863
Cdd:COG2319     81 VLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTL---TGHTGAVRSVAFSPDGKTL-ASGSADGTvRLWDLATGK--- 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1864 dhaaidritwatwtsilgdeVLGIWSRHAEkaDVNCACVSHSGISLVTGDDFGMVKLFDfpcPEKFAKHKRFLGHSPHVT 1943
Cdd:COG2319    154 --------------------LLRTLTGHSG--AVTSVAFSPDGKLLASGSDDGTVRLWD---LATGKLLRTLTGHTGAVR 208
                          250       260       270
                   ....*....|....*....|....*....|....
gi 1867163826 1944 NIRFTSGDRHVVSaGGDDCSLFVW-----KCVHT 1972
Cdd:COG2319    209 SVAFSPDGKLLAS-GSADGTVRLWdlatgKLLRT 241
ANAPC4_WD40 pfam12894
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ...
1746-1828 1.66e-05

Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,


Pssm-ID: 403945 [Multi-domain]  Cd Length: 91  Bit Score: 44.96  E-value: 1.66e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1746 VCYSPEGDMVAIGMKNGEFIILLVSSLKIWGKKRDRR-CAIHDIRFSPDSRYLAVGSSENSVDFYDLTLGPTLNRISYCK 1824
Cdd:pfam12894    1 MSWCPTMDLIALATEDGELLLHRLNWQRVWTLSPDKEdLEVTSLAWRPDGKLLAVGYSDGTVRLLDAENGKIVHHFSAGS 80

                   ....
gi 1867163826 1825 DIPS 1828
Cdd:pfam12894   81 DLIT 84
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
760-802 4.13e-04

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 39.60  E-value: 4.13e-04
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|...
gi 1867163826   760 TIKPLSILKGHHQYgVSAVDFSADGKRLASVGIDdsHTVVLWD 802
Cdd:smart00320    1 SGELLKTLKGHTGP-VTSVAFSPDGKYLASGSDD--GTIKLWD 40
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
1695-1727 4.65e-04

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 39.60  E-value: 4.65e-04
                            10        20        30
                    ....*....|....*....|....*....|...
gi 1867163826  1695 NGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWD 1727
Cdd:smart00320    9 KGH-TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
314-353 1.42e-03

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 38.10  E-value: 1.42e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 1867163826  314 NKPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWS 353
Cdd:pfam00400    1 GKLLKTLEGH-TGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
315-353 2.43e-03

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 37.29  E-value: 2.43e-03
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 1867163826   315 KPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWS 353
Cdd:smart00320    3 ELLKTLKGH-TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
762-802 5.47e-03

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 36.55  E-value: 5.47e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1867163826  762 KPLSILKGHHQyGVSAVDFSADGKRLASVGIDdsHTVVLWD 802
Cdd:pfam00400    2 KLLKTLEGHTG-SVTSLAFSPDGKLLASGSDD--GTVKVWD 39
 
Name Accession Description Interval E-value
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
725-1063 1.74e-34

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 134.77  E-value: 1.74e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  725 GHDDDILCLTIHPLKDYVATGqvGRDPSIHIWDTETIKPLSILKGHHQyGVSAVDFSADGKRLASVGIDdsHTVVLWDWK 804
Cdd:cd00200      7 GHTGGVTCVAFSPDGKLLATG--SGDGTIKVWDLETGELLRTLKGHTG-PVRDVAASADGTYLASGSSD--KTIRLWDLE 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  805 KGEKLSIARGSKDKIFVVKMNPYvpDKLITAGIKH--MKFWRKAGGGLIGRkgyigTLGKNDTMMCAVYGWTEEMAFSGT 882
Cdd:cd00200     82 TGECVRTLTGHTSYVSSVAFSPD--GRILSSSSRDktIKVWDVETGKCLTT-----LRGHTDWVNSVAFSPDGTFVASSS 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  883 STGDVCIW--RDIFLVKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTYaikraalapgskgllledn 958
Cdd:cd00200    155 QDGTIKLWdlRTGKCVATLTGHTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGTL------------------- 215
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  959 psiraislghghilvgtkngeilevdksgpitllvQGHmEGEVWGLATHPYLPICATVSDDKTLRIWDLSPSHCMLAVRK 1038
Cdd:cd00200    216 -----------------------------------RGH-ENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSG 259
                          330       340
                   ....*....|....*....|....*
gi 1867163826 1039 LKKGGRCCCFSPDGKALAVGLNDGS 1063
Cdd:cd00200    260 HTNSVTSLAWSPDGKRLASGSADGT 284
WD40 COG2319
WD40 repeat [General function prediction only];
720-1112 6.02e-34

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 136.58  E-value: 6.02e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  720 QRFYLGHDDDILCLTIHPLKDYVATGqvGRDPSIHIWDTETIKPLSILKGHHQYgVSAVDFSADGKRLASVGIDdsHTVV 799
Cdd:COG2319     71 LATLLGHTAAVLSVAFSPDGRLLASA--SADGTVRLWDLATGLLLRTLTGHTGA-VRSVAFSPDGKTLASGSAD--GTVR 145
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  800 LWDWKKGEKLSIARGSKDKIFVVKMNPyvpD--KLITAGI-KHMKFWRKAGGGLIgrkgyigtlgkndtmmcavygwtee 876
Cdd:COG2319    146 LWDLATGKLLRTLTGHSGAVTSVAFSP---DgkLLASGSDdGTVRLWDLATGKLL------------------------- 197
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  877 mafsgtstgdvciwrdiflvKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTYAikraalapgskgll 954
Cdd:COG2319    198 --------------------RTLTGHTGAVRSVAFSPDGklLASGSADGTVRLWDLATGKLLRTLT-------------- 243
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  955 lEDNPSIRAISLGH-GHILV-GTKNGEILEVD-KSGPITLLVQGHmEGEVWGLATHP---YLpicATVSDDKTLRIWDLS 1028
Cdd:COG2319    244 -GHSGSVRSVAFSPdGRLLAsGSADGTVRLWDlATGELLRTLTGH-SGGVNSVAFSPdgkLL---ASGSDDGTVRLWDLA 318
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1029 PSHCMLAVRKLKKGGRCCCFSPDGKALAVGLNDGSFLMANADTLEDLVSFHHRKDMISDIRFSPgSGKYLAVASHDSFID 1108
Cdd:COG2319    319 TGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSP-DGRTLASGSADGTVR 397

                   ....
gi 1867163826 1109 IYNV 1112
Cdd:COG2319    398 LWDL 401
WD40 COG2319
WD40 repeat [General function prediction only];
58-438 9.65e-31

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 126.95  E-value: 9.65e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826   58 RGHSDDIISLALHPERVLVATGQVGKEpyICIWDSYTVQTISVLKDvHTHGIACLAFDLDGQRLVSVGLDskNAVCVWDW 137
Cdd:COG2319     75 LGHTAAVLSVAFSPDGRLLASASADGT--VRLWDLATGLLLRTLTG-HTGAVRSVAFSPDGKTLASGSAD--GTVRLWDL 149
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  138 KRGKMLSMAPGHTDRIFDISWDlyqPNklvscgvkhikfwslcgnaltpkrgvfGKTgdlqtilcLAcardeltySGALN 217
Cdd:COG2319    150 ATGKLLRTLTGHSGAVTSVAFS---PD---------------------------GKL--------LA--------SGSDD 183
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  218 GDIYVW--KGINLIRTIQGaHAAGIFSMNACEEG--FATGGRDGCIRLWDL-TFKPITVIDLRETdqgykglSVRSVCWR 292
Cdd:COG2319    184 GTVRLWdlATGKLLRTLTG-HTGAVRSVAFSPDGklLASGSADGTVRLWDLaTGKLLRTLTGHSG-------SVRSVAFS 255
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  293 --GDHILVGTQDSEIFeIVVQERNKPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWSLVDHALIARCN-MEEPI 369
Cdd:COG2319    256 pdGRLLASGSADGTVR-LWDLATGELLRTLTGH-SGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTgHTGAV 333
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1867163826  370 RCAAVNADGIHLALGMKDGSFTVLRVRDMTEVVHIKDRKEAIHELKYSPDGTYLAVGCNDSSVDIYGVA 438
Cdd:COG2319    334 RSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
WD40 COG2319
WD40 repeat [General function prediction only];
1425-1812 1.28e-30

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 126.56  E-value: 1.28e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1425 LTVNQHPKFINIVATGQVGDSADMSATAPSIHIWDAMNKQTLSILRcYHSKGVCSVSFSATGKLLLSVGLDpeHTITIWR 1504
Cdd:COG2319     72 ATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLT-GHTGAVRSVAFSPDGKTLASGSAD--GTVRLWD 148
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1505 WQEGAKIASRAGHNQRIFVAEFRPDSdTQFVSVGV-KHVKFWTLAGRALLSkkgllsTLEdARMQTMLAIAFGANNLTF- 1582
Cdd:COG2319    149 LATGKLLRTLTGHSGAVTSVAFSPDG-KLLASGSDdGTVRLWDLATGKLLR------TLT-GHTGAVRSVAFSPDGKLLa 220
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1583 TGTISGDVCVWK-DHILCRIVARAHNGPVFAMyTTLRDG-LIVTGGkerpskEGGAVKLWDqelrrcrafrLETGQatdC 1660
Cdd:COG2319    221 SGSADGTVRLWDlATGKLLRTLTGHSGSVRSV-AFSPDGrLLASGS------ADGTVRLWD----------LATGE---L 280
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1661 VRSVcrgkgkilvgtrnaeiievgeknaacnilvnGHVDGPIWGLATHPSRDFFLSAAEDGTVRLWDIADKKMLNKVNlG 1740
Cdd:COG2319    281 LRTL-------------------------------TGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLT-G 328
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1867163826 1741 HAA--RTVCYSPEGDMVAIGMKNGEFIILLVSSLKIWGKKRDRRCAIHDIRFSPDSRYLAVGSSENSVDFYDLT 1812
Cdd:COG2319    329 HTGavRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
57-353 4.87e-30

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 122.06  E-value: 4.87e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826   57 YRGHSDDIISLALHPERVLVATGQVGKEpyICIWDSYTVQTISVLKdVHTHGIACLAFDLDGQRLVSVGLDskNAVCVWD 136
Cdd:cd00200      5 LKGHTGGVTCVAFSPDGKLLATGSGDGT--IKVWDLETGELLRTLK-GHTGPVRDVAASADGTYLASGSSD--KTIRLWD 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  137 WKRGKMLSMAPGHTDRIFDISWDlyqPNK--LVSCGV-KHIKFWSLcgNALTPKRGVFGKTGDlqtILCLACARDE-LTY 212
Cdd:cd00200     80 LETGECVRTLTGHTSYVSSVAFS---PDGriLSSSSRdKTIKVWDV--ETGKCLTTLRGHTDW---VNSVAFSPDGtFVA 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  213 SGALNGDIYVW--KGINLIRTIQGaHAAGIFSMNACEEG--FATGGRDGCIRLWDL-TFKPITVIDlretdqgYKGLSVR 287
Cdd:cd00200    152 SSSQDGTIKLWdlRTGKCVATLTG-HTGEVNSVAFSPDGekLLSSSSDGTIKLWDLsTGKCLGTLR-------GHENGVN 223
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  288 SVCW--RGDHILVGTQDS--EIFEIVVQERNKPFlimQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWS 353
Cdd:cd00200    224 SVAFspDGYLLASGSEDGtiRVWDLRTGECVQTL---SGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
WD40 COG2319
WD40 repeat [General function prediction only];
52-354 5.08e-30

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 125.02  E-value: 5.08e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826   52 HRQKFYRGHSDDIISLALHPERVLVATGQVGKEpyICIWDSYTVQTISVLKDvHTHGIACLAFDLDGQRLVSVGLDskNA 131
Cdd:COG2319    111 LLLRTLTGHTGAVRSVAFSPDGKTLASGSADGT--VRLWDLATGKLLRTLTG-HSGAVTSVAFSPDGKLLASGSDD--GT 185
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  132 VCVWDWKRGKMLSMAPGHTDRIFDISWDlyqPN--KLVSCGV-KHIKFWSLcgnaltpKRGVFGKT--GDLQTILCLACA 206
Cdd:COG2319    186 VRLWDLATGKLLRTLTGHTGAVRSVAFS---PDgkLLASGSAdGTVRLWDL-------ATGKLLRTltGHSGSVRSVAFS 255
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  207 RDELT-YSGALNGDIYVW--KGINLIRTIQGaHAAGIFSMNACEEG--FATGGRDGCIRLWDL-TFKPITVIdlretdQG 280
Cdd:COG2319    256 PDGRLlASGSADGTVRLWdlATGELLRTLTG-HSGGVNSVAFSPDGklLASGSDDGTVRLWDLaTGKLLRTL------TG 328
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1867163826  281 YKGlSVRSVCWR--GDHILVGTQDSEI--FEIvvqERNKPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWSL 354
Cdd:COG2319    329 HTG-AVRSVAFSpdGKTLASGSDDGTVrlWDL---ATGELLRTLTGH-TGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
WD40 COG2319
WD40 repeat [General function prediction only];
719-1063 6.18e-30

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 124.64  E-value: 6.18e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  719 TQRFYLGHDDDILCLTIHPLKDYVATGqvGRDPSIHIWDTETIKPLSILKGHHQyGVSAVDFSADGKRLASVGIDdsHTV 798
Cdd:COG2319    112 LLRTLTGHTGAVRSVAFSPDGKTLASG--SADGTVRLWDLATGKLLRTLTGHSG-AVTSVAFSPDGKLLASGSDD--GTV 186
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  799 VLWDWKKGEKLSIARGSKDKIFVVKMNPyvpD--KLITAGI-KHMKFWRKAGGGLIGrkgyigTLGKNDtmmcavyGWTE 875
Cdd:COG2319    187 RLWDLATGKLLRTLTGHTGAVRSVAFSP---DgkLLASGSAdGTVRLWDLATGKLLR------TLTGHS-------GSVR 250
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  876 EMAFS--------GTSTGDVCIWR--DIFLVKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTYaikr 943
Cdd:COG2319    251 SVAFSpdgrllasGSADGTVRLWDlaTGELLRTLTGHSGGVNSVAFSPDGklLASGSDDGTVRLWDLATGKLLRTL---- 326
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  944 aalapgskglllednpsiraislghghilvgtkngeilevdksgpitllvQGHmEGEVWGLATHPYLPICATVSDDKTLR 1023
Cdd:COG2319    327 --------------------------------------------------TGH-TGAVRSVAFSPDGKTLASGSDDGTVR 355
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|
gi 1867163826 1024 IWDLSPSHCMLAVRKLKKGGRCCCFSPDGKALAVGLNDGS 1063
Cdd:COG2319    356 LWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGT 395
WD40 COG2319
WD40 repeat [General function prediction only];
260-802 1.62e-27

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 117.32  E-value: 1.62e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  260 RLWDLTFKPITVIDLRETDQGYKGLSVRSVCWRGDHILVGTQDSEIFEIVVQERNKPFLIMQGHcEGELWALAVHPTKPL 339
Cdd:COG2319     14 ADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGH-TAAVLSVAFSPDGRL 92
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  340 AVTGSDDRSVRIWSLVDHALIARCNM-EEPIRCAAVNADGIHLALGMKDGSFTVLRVRDMTEVVHIKDRKEAIHELKYSP 418
Cdd:COG2319     93 LASASADGTVRLWDLATGLLLRTLTGhTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSP 172
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  419 DGTYLAVGCNDSSVDIYGVAQrykkvGECLGSL----SFITHLDWSSDSRYLQTNDGNGK-RLfYRMPGGKEVTSTEEik 493
Cdd:COG2319    173 DGKLLASGSDDGTVRLWDLAT-----GKLLRTLtghtGAVRSVAFSPDGKLLASGSADGTvRL-WDLATGKLLRTLTG-- 244
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  494 gvHWASWTCVSglevngiWpkysdindinSVDGnyigQVLVTADDYGIIKLFRypcLRKGAKFRKYIGHSAHVTNVRWSH 573
Cdd:COG2319    245 --HSGSVRSVA-------F----------SPDG----RLLASGSADGTVRLWD---LATGELLRTLTGHSGGVNSVAFSP 298
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  574 DYQWVISiGGADHSVFQWkfiperklkdavhiapqesladshsdesdsdlsdvpeldseieqetqltyrrqvykedlpql 653
Cdd:COG2319    299 DGKLLAS-GSDDGTVRLW-------------------------------------------------------------- 315
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  654 keqckekqksatskrrERAPGNSIRLHfvhgyrgydcrsnlfytqigeivyhvaavgviynrqqntqrfyLGHDDDILCL 733
Cdd:COG2319    316 ----------------DLATGKLLRTL-------------------------------------------TGHTGAVRSV 336
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1867163826  734 TIHPLKDYVATGqvGRDPSIHIWDTETIKPLSILKGHHQyGVSAVDFSADGKRLASVGIDdsHTVVLWD 802
Cdd:COG2319    337 AFSPDGKTLASG--SDDGTVRLWDLATGELLRTLTGHTG-AVTSVAFSPDGRTLASGSAD--GTVRLWD 400
WD40 COG2319
WD40 repeat [General function prediction only];
1499-1967 1.81e-27

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 117.32  E-value: 1.81e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1499 TITIWRWQEGAKIASRAGHNQRIFVAEFRPDSDTQFVSVGVKHVKFWTLAGRALLSkkgllsTLEDARMQTMLAIAFGAN 1578
Cdd:COG2319     17 ALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLA------TLLGHTAAVLSVAFSPDG 90
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1579 NLTFTGTISGDVCVWK-DHILCRIVARAHNGPVFAMyTTLRDG-LIVTGGkerpskEGGAVKLWDqelrrcrafrLETGQ 1656
Cdd:COG2319     91 RLLASASADGTVRLWDlATGLLLRTLTGHTGAVRSV-AFSPDGkTLASGS------ADGTVRLWD----------LATGK 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1657 atdcvrsvcrgkgkiLVGTrnaeiievgeknaacnilVNGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWDIADKKMLNK 1736
Cdd:COG2319    154 ---------------LLRT------------------LTGH-SGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRT 199
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1737 VNlGHAA--RTVCYSPEGDMVAIGMKNGEFIILLVSSLKIWGKKRDRRCAIHDIRFSPDSRYLAVGSSENSVDFYDLTLG 1814
Cdd:COG2319    200 LT-GHTGavRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATG 278
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1815 PTLNRIsycKDIPSFVIQMDFSADSSYLqVSSGCYKR-HVYEVPSGKhlmdhaaidritwatwtsilgdeVLGIWSRHAe 1893
Cdd:COG2319    279 ELLRTL---TGHSGGVNSVAFSPDGKLL-ASGSDDGTvRLWDLATGK-----------------------LLRTLTGHT- 330
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1867163826 1894 kADVNCACVSHSGISLVTGDDFGMVKLFDfpcPEKFAKHKRFLGHSPHVTNIRFTSGDRHVVSaGGDDCSLFVW 1967
Cdd:COG2319    331 -GAVRSVAFSPDGKTLASGSDDGTVRLWD---LATGELLRTLTGHTGAVTSVAFSPDGRTLAS-GSADGTVRLW 399
WD40 COG2319
WD40 repeat [General function prediction only];
251-591 1.94e-26

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 114.24  E-value: 1.94e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  251 ATGGRDGCIRLWDLTFKPITVIDLRETDqgykglSVRSVCWR--GDHILVGTQDSEI--FEIvvqERNKPFLIMQGHcEG 326
Cdd:COG2319     94 ASASADGTVRLWDLATGLLLRTLTGHTG------AVRSVAFSpdGKTLASGSADGTVrlWDL---ATGKLLRTLTGH-SG 163
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  327 ELWALAVHPTKPLAVTGSDDRSVRIWSLVDHALIARCN-MEEPIRCAAVNADGIHLALGMKDGSFTVLRVRDMTEVVHIK 405
Cdd:COG2319    164 AVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTgHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLT 243
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  406 DRKEAIHELKYSPDGTYLAVGCNDSSVDIYGVAQrykkvGECLGSL----SFITHLDWSSDSRYLQTNDGNGKrlfyrmp 481
Cdd:COG2319    244 GHSGSVRSVAFSPDGRLLASGSADGTVRLWDLAT-----GELLRTLtghsGGVNSVAFSPDGKLLASGSDDGT------- 311
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  482 ggkevtsteeIKGVHWASWTCVSGLEVNGIWpkysdindINSVDGNYIGQVLVTADDYGIIKLFRypcLRKGAKFRKYIG 561
Cdd:COG2319    312 ----------VRLWDLATGKLLRTLTGHTGA--------VRSVAFSPDGKTLASGSDDGTVRLWD---LATGELLRTLTG 370
                          330       340       350
                   ....*....|....*....|....*....|
gi 1867163826  562 HSAHVTNVRWSHDYQWVISiGGADHSVFQW 591
Cdd:COG2319    371 HTGAVTSVAFSPDGRTLAS-GSADGTVRLW 399
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
148-435 3.80e-26

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 110.50  E-value: 3.80e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  148 GHTDRIFDISWDlYQPNKLVSCGV-KHIKFWSLCGNalTPKRGVFGKTGDLQTilCLACARDELTYSGALNGDIYVW--K 224
Cdd:cd00200      7 GHTGGVTCVAFS-PDGKLLATGSGdGTIKVWDLETG--ELLRTLKGHTGPVRD--VAASADGTYLASGSSDKTIRLWdlE 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  225 GINLIRTIQGaHAAGIFSMNACEEG--FATGGRDGCIRLWDL-TFKPITVIdlretdQGYKGlSVRSVCWRGDHILV--G 299
Cdd:cd00200     82 TGECVRTLTG-HTSYVSSVAFSPDGriLSSSSRDKTIKVWDVeTGKCLTTL------RGHTD-WVNSVAFSPDGTFVasS 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  300 TQDSEIFEIVVQErNKPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWSLVDHALIARCN-MEEPIRCAAVNADG 378
Cdd:cd00200    154 SQDGTIKLWDLRT-GKCVATLTGH-TGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRgHENGVNSVAFSPDG 231
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1867163826  379 IHLALGMKDGSFTVLRVRDMTEVVHIKDRKEAIHELKYSPDGTYLAVGCNDSSVDIY 435
Cdd:cd00200    232 YLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIRIW 288
WD40 COG2319
WD40 repeat [General function prediction only];
1454-1730 1.03e-24

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 108.85  E-value: 1.03e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1454 SIHIWDAMNKQTLSILRCyHSKGVCSVSFSATGKLLLSVGLDpeHTITIWRWQEGAKIASRAGHNQRIFVAEFRPDSDTq 1533
Cdd:COG2319    143 TVRLWDLATGKLLRTLTG-HSGAVTSVAFSPDGKLLASGSDD--GTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKL- 218
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1534 FVSVGV-KHVKFWTLAGRALLSkkgllsTLEDARmQTMLAIAFGANNLTF-TGTISGDVCVWK-DHILCRIVARAHNGPV 1610
Cdd:COG2319    219 LASGSAdGTVRLWDLATGKLLR------TLTGHS-GSVRSVAFSPDGRLLaSGSADGTVRLWDlATGELLRTLTGHSGGV 291
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1611 FAMYTTLRDGLIVTGGkerpskEGGAVKLWDQELRRC-RAFRLETGQatdcVRSVC-RGKGKILVGTRNAEIIEVGE-KN 1687
Cdd:COG2319    292 NSVAFSPDGKLLASGS------DDGTVRLWDLATGKLlRTLTGHTGA----VRSVAfSPDGKTLASGSDDGTVRLWDlAT 361
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|...
gi 1867163826 1688 AACNILVNGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWDIAD 1730
Cdd:COG2319    362 GELLRTLTGH-TGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
WD40 COG2319
WD40 repeat [General function prediction only];
1425-1922 8.61e-24

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 106.15  E-value: 8.61e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1425 LTVNQHPKFINIVATGQVGDSADMSATAPSIHIWDAMNKQTLSILRcYHSKGVCSVSFSATGKLLLSVGLDpeHTITIWR 1504
Cdd:COG2319     30 LLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLL-GHTAAVLSVAFSPDGRLLASASAD--GTVRLWD 106
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1505 WQEGAKIASRAGHNQRIFVAEFRPDSDTqFVSVGV-KHVKFWTLAGRALLSkkgllstledarmqtmlaiafgannlTFT 1583
Cdd:COG2319    107 LATGLLLRTLTGHTGAVRSVAFSPDGKT-LASGSAdGTVRLWDLATGKLLR--------------------------TLT 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1584 GtisgdvcvwkdhilcrivaraHNGPVFAMyTTLRDG-LIVTGGkerpskEGGAVKLWDqelrrcrafrLETGQatdCVR 1662
Cdd:COG2319    160 G---------------------HSGAVTSV-AFSPDGkLLASGS------DDGTVRLWD----------LATGK---LLR 198
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1663 SVcrgkgkilvgtrnaeiievgeknaacnilvNGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWDIADKKMLNKVNlGHA 1742
Cdd:COG2319    199 TL------------------------------TGH-TGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLT-GHS 246
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1743 A--RTVCYSPEGDMVAIGMKNGEFIILLVSSLKIWGKKRDRRCAIHDIRFSPDSRYLAVGSSENSVDFYDLTLGPTLNRI 1820
Cdd:COG2319    247 GsvRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTL 326
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1821 sycKDIPSFVIQMDFSADSSYLQVSSGCYKRHVYEVPSGKhlmdhaaidritwatwtsilgdeVLGIWSRHAekADVNCA 1900
Cdd:COG2319    327 ---TGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGE-----------------------LLRTLTGHT--GAVTSV 378
                          490       500
                   ....*....|....*....|..
gi 1867163826 1901 CVSHSGISLVTGDDFGMVKLFD 1922
Cdd:COG2319    379 AFSPDGRTLASGSADGTVRLWD 400
WD40 COG2319
WD40 repeat [General function prediction only];
895-1266 1.93e-23

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 104.99  E-value: 1.93e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  895 LVKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTY-----AIKRAALAPGSKglllednpsiraislg 967
Cdd:COG2319     70 LLATLLGHTAAVLSVAFSPDGrlLASASADGTVRLWDLATGLLLRTLtghtgAVRSVAFSPDGK---------------- 133
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  968 hgHILVGTKNGEILEVD-KSGPITLLVQGHmEGEVWGLATHP---YLpicATVSDDKTLRIWDLSPSHCMLAVRKLKKGG 1043
Cdd:COG2319    134 --TLASGSADGTVRLWDlATGKLLRTLTGH-SGAVTSVAFSPdgkLL---ASGSDDGTVRLWDLATGKLLRTLTGHTGAV 207
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1044 RCCCFSPDGKALAVGLNDGSFLMANADTLEDLVSFHHRKDMISDIRFSPgSGKYLAVASHDSFIDIYNVMSSKRVGICKG 1123
Cdd:COG2319    208 RSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSP-DGRLLASGSADGTVRLWDLATGELLRTLTG 286
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1124 ATSYITHIDWDIRGKLLqvntgakeqlffeAPRGKKQTIpsveveKIawasWTSVLGLCcegIWPVIGEVTDVTASCLTS 1203
Cdd:COG2319    287 HSGGVNSVAFSPDGKLL-------------ASGSDDGTV------RL----WDLATGKL---LRTLTGHTGAVRSVAFSP 340
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1867163826 1204 DKMVLATGDDLGFVKLFRYPTKGKFGKFKryvAHSTHVTNVRWTYDDSMLVTlGGTDMSLMVW 1266
Cdd:COG2319    341 DGKTLASGSDDGTVRLWDLATGELLRTLT---GHTGAVTSVAFSPDGRTLAS-GSADGTVRLW 399
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
712-929 2.29e-21

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 96.64  E-value: 2.29e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  712 IYNRQQNTQRFYL-GHDDDILCLTIHPLKDYVATGqvGRDPSIHIWDTETIKPLSILKGHHQYgVSAVDFSADGKRLASV 790
Cdd:cd00200     77 LWDLETGECVRTLtGHTSYVSSVAFSPDGRILSSS--SRDKTIKVWDVETGKCLTTLRGHTDW-VNSVAFSPDGTFVASS 153
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  791 GIDdsHTVVLWDWKKGEKLSIARGSKDKIFVVKmnpYVPD--KLITAGI-KHMKFWRKAGGGLigrkgyIGTL-GKNDTM 866
Cdd:cd00200    154 SQD--GTIKLWDLRTGKCVATLTGHTGEVNSVA---FSPDgeKLLSSSSdGTIKLWDLSTGKC------LGTLrGHENGV 222
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1867163826  867 MCAVYGWTEEMAFSGTSTGDVCIW--RDIFLVKTVKAHDGPVFSM--HALEKGFVTGGKDGIVALWD 929
Cdd:cd00200    223 NSVAFSPDGYLLASGSEDGTIRVWdlRTGECVQTLSGHTNSVTSLawSPDGKRLASGSADGTIRIWD 289
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1473-1775 7.65e-21

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 95.09  E-value: 7.65e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1473 HSKGVCSVSFSATGKLLLSVGLDpeHTITIWRWQEGAKIASRAGHNQRIFVAEFRPDSdTQFVSVGV-KHVKFWTLagra 1551
Cdd:cd00200      8 HTGGVTCVAFSPDGKLLATGSGD--GTIKVWDLETGELLRTLKGHTGPVRDVAASADG-TYLASGSSdKTIRLWDL---- 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1552 llSKKGLLSTLEDARmQTMLAIAFGANNLTFTGTIS-GDVCVWK-DHILCRIVARAHNGPVFAMyTTLRDGLIVTGgker 1629
Cdd:cd00200     81 --ETGECVRTLTGHT-SYVSSVAFSPDGRILSSSSRdKTIKVWDvETGKCLTTLRGHTDWVNSV-AFSPDGTFVAS---- 152
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1630 pSKEGGAVKLWD-QELRRCRAFRLETGQatdcVRSVC--RGKGKILVGTRNAEIIEVGEKNAACNILVNGHvDGPIWGLA 1706
Cdd:cd00200    153 -SSQDGTIKLWDlRTGKCVATLTGHTGE----VNSVAfsPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGH-ENGVNSVA 226
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1867163826 1707 THPSRDFFLSAAEDGTVRLWDIADKKMLNKVNlGHAAR--TVCYSPEGDMVAIGMKNGefiillvsSLKIW 1775
Cdd:cd00200    227 FSPDGYLLASGSEDGTIRVWDLRTGECVQTLS-GHTNSvtSLAWSPDGKRLASGSADG--------TIRIW 288
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
228-592 3.17e-20

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 93.17  E-value: 3.17e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  228 LIRTIQGaHAAGIFSM--NACEEGFATGGRDGCIRLWDLTFKpitviDLRETDQGYKGlSVRSVCWRGDH--ILVGTQDS 303
Cdd:cd00200      1 LRRTLKG-HTGGVTCVafSPDGKLLATGSGDGTIKVWDLETG-----ELLRTLKGHTG-PVRDVAASADGtyLASGSSDK 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  304 EIFeIVVQERNKPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWSLVDHALIARCNM-EEPIRCAAVNADGIHLA 382
Cdd:cd00200     74 TIR-LWDLETGECVRTLTGH-TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGhTDWVNSVAFSPDGTFVA 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  383 LGMKDGSFTVLRVRDMTEVVHIKDRKEAIHELKYSPDGTYLAVGCNDSSVDIYGVAQRyKKVGECLGSLSFITHLDWSSD 462
Cdd:cd00200    152 SSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTG-KCLGTLRGHENGVNSVAFSPD 230
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  463 SRYLqtndgngkrlfyrmpggkevtsteeikgvhwaswtcvsglevngiwpkysdindinsvdgnyigqvlVTADDYGII 542
Cdd:cd00200    231 GYLL-------------------------------------------------------------------ASGSEDGTI 243
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|
gi 1867163826  543 KLFRypcLRKGAKFRKYIGHSAHVTNVRWSHDYQWVISiGGADHSVFQWK 592
Cdd:cd00200    244 RVWD---LRTGECVQTLSGHTNSVTSLAWSPDGKRLAS-GSADGTIRIWD 289
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
895-1140 4.70e-20

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 92.78  E-value: 4.70e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  895 LVKTVKAHDGPVFSMHALEKG--FVTGGKDGIVALWDDSFERCLKTYAIKRAALApgskglllednpsiRAISLGHGH-I 971
Cdd:cd00200      1 LRRTLKGHTGGVTCVAFSPDGklLATGSGDGTIKVWDLETGELLRTLKGHTGPVR--------------DVAASADGTyL 66
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  972 LVGTKNGEILEVD-KSGPITLLVQGHmEGEVWGLATHPYLPICATVSDDKTLRIWDLSPSHCMLAVRKLKKGGRCCCFSP 1050
Cdd:cd00200     67 ASGSSDKTIRLWDlETGECVRTLTGH-TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSP 145
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1051 DGKALAVGLNDGSFLMANADTLEDLVSFHHRKDMISDIRFSPgSGKYLAVASHDSFIDIYNVMSSKRVGICKGATSYITH 1130
Cdd:cd00200    146 DGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSP-DGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNS 224
                          250
                   ....*....|
gi 1867163826 1131 IDWDIRGKLL 1140
Cdd:cd00200    225 VAFSPDGYLL 234
HELP pfam03451
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ...
2-49 7.07e-19

HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.


Pssm-ID: 460922  Cd Length: 72  Bit Score: 82.60  E-value: 7.07e-19
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*...
gi 1867163826    2 AARSAPSCHLRLEWVYGYRGHQCRNNLYYTAAKEIVYFVAGVGVVYSP 49
Cdd:pfam03451   25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1694-1967 2.87e-18

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 87.39  E-value: 2.87e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1694 VNGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWDIADKKMLnKVNLGHAA--RTVCYSPEGDMVAIGMKNGefiillvsS 1771
Cdd:cd00200      5 LKGH-TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELL-RTLKGHTGpvRDVAASADGTYLASGSSDK--------T 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1772 LKIWgKKRDRRC---------AIHDIRFSPDSRYLAVGSSENSVDFYDLTLGPTLNRISYCKDipsFVIQMDFSADSSYl 1842
Cdd:cd00200     75 IRLW-DLETGECvrtltghtsYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTD---WVNSVAFSPDGTF- 149
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1843 qVSSGCYKR--HVYEVPSGKHLMDHaaidritwatwtsilgdevlgiwsrHAEKADVNCACVSHSGISLVTGDDFGMVKL 1920
Cdd:cd00200    150 -VASSSQDGtiKLWDLRTGKCVATL-------------------------TGHTGEVNSVAFSPDGEKLLSSSSDGTIKL 203
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*..
gi 1867163826 1921 FDfpcPEKFAKHKRFLGHSPHVTNIRFtSGDRHVVSAGGDDCSLFVW 1967
Cdd:cd00200    204 WD---LSTGKCLGTLRGHENGVNSVAF-SPDGYLLASGSEDGTIRVW 246
HELP pfam03451
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ...
672-715 9.08e-18

HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.


Pssm-ID: 460922  Cd Length: 72  Bit Score: 79.13  E-value: 9.08e-18
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 1867163826  672 APGNSIRLHFVHGYRGYDCRSNLFYTQIGEIVYHVAAVGVIYNR 715
Cdd:pfam03451   29 PPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1415-1727 1.95e-17

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 85.08  E-value: 1.95e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1415 YQEHNDDILCLTVNQHPKFInivATGqvgdSADmsataPSIHIWDaMNKQTLSILRCYHSKGVCSVSFSATGKLLLSVGL 1494
Cdd:cd00200      5 LKGHTGGVTCVAFSPDGKLL---ATG----SGD-----GTIKVWD-LETGELLRTLKGHTGPVRDVAASADGTYLASGSS 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1495 DpeHTITIWRWQEGAKIASRAGHNQRIFVAEFRPDSDTQFVSVGVKHVKFWTLAgrallsKKGLLSTLEDARMQTMlAIA 1574
Cdd:cd00200     72 D--KTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVE------TGKCLTTLRGHTDWVN-SVA 142
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1575 F-GANNLTFTGTISGDVCVWKDHIL-CRIVARAHNGPVFAMYTTLRDGLIVTGGkerpskEGGAVKLWDQELRRCRA-FR 1651
Cdd:cd00200    143 FsPDGTFVASSSQDGTIKLWDLRTGkCVATLTGHTGEVNSVAFSPDGEKLLSSS------SDGTIKLWDLSTGKCLGtLR 216
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1867163826 1652 LETGQATDCVRSvcrGKGKILVGTRNAEIIEVGE-KNAACNILVNGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWD 1727
Cdd:cd00200    217 GHENGVNSVAFS---PDGYLLASGSEDGTIRVWDlRTGECVQTLSGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
HELP pfam03451
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ...
1335-1407 2.67e-15

HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.


Pssm-ID: 460922  Cd Length: 72  Bit Score: 72.20  E-value: 2.67e-15
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1867163826 1335 RQGVVRPPVSRAPPqpeklqTNNVGKKKRPIEDLVLELIFGYRGRDCRNNVHYLNDGdDIIYHTASVGILHNV 1407
Cdd:pfam03451    7 RPGAVYPPSNYYPK------DDLDQKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTG-EIVYFTAAVVVLYDV 72
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1661-1968 2.45e-14

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 75.83  E-value: 2.45e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1661 VRSVC--RGKGKILVGTRNAEIIEVGEKNAACNILVNGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWDIADKKMLNKVN 1738
Cdd:cd00200     12 VTCVAfsPDGKLLATGSGDGTIKVWDLETGELLRTLKGH-TGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVRTLT 90
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1739 lGHAA--RTVCYSPEGDMVAIGMKNGefiillvsSLKIW----GKK----RDRRCAIHDIRFSPDSRYLAVGSSENSVDF 1808
Cdd:cd00200     91 -GHTSyvSSVAFSPDGRILSSSSRDK--------TIKVWdvetGKClttlRGHTDWVNSVAFSPDGTFVASSSQDGTIKL 161
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1809 YDLTLGPTLNRIsycKDIPSFVIQMDFSADSSYLQVSSGCYKRHVYEVPSGKHLMDHaaidritwatwtsilgdevlgiw 1888
Cdd:cd00200    162 WDLRTGKCVATL---TGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTL----------------------- 215
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1889 srHAEKADVNCACVSHSGISLVTGDDFGMVKLFDFpcpEKFAKHKRFLGHSPHVTNIRFtSGDRHVVSAGGDDCSLFVWK 1968
Cdd:cd00200    216 --RGHENGVNSVAFSPDGYLLASGSEDGTIRVWDL---RTGECVQTLSGHTNSVTSLAW-SPDGKRLASGSADGTIRIWD 289
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
46-178 4.66e-12

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 68.90  E-value: 4.66e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826   46 VYSPREHR-QKFYRGHSDDIISLALHPERVLVATGQVGKEpyICIWDSYTVQTISVLKdVHTHGIACLAFDLDGQRLVSV 124
Cdd:cd00200    161 LWDLRTGKcVATLTGHTGEVNSVAFSPDGEKLLSSSSDGT--IKLWDLSTGKCLGTLR-GHENGVNSVAFSPDGYLLASG 237
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1867163826  125 GLDSKnaVCVWDWKRGKMLSMAPGHTDRIFDISWDlYQPNKLVSCGV-KHIKFWS 178
Cdd:cd00200    238 SEDGT--IRVWDLRTGECVQTLSGHTNSVTSLAWS-PDGKRLASGSAdGTIRIWD 289
WD40 COG2319
WD40 repeat [General function prediction only];
1706-1972 2.79e-09

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 61.47  E-value: 2.79e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1706 ATHPSRDFFLSAAEDGTVRLWDIADKKMLNKVNLGHAARTVC-YSPEGDMVAIGMKNGEFIILLVSSLKIWGKKRDRRCA 1784
Cdd:COG2319      1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLaASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1785 IHDIRFSPDSRYLAVGSSENSVDFYDLTLGPTLNRIsycKDIPSFVIQMDFSADSSYLqVSSGCYKR-HVYEVPSGKhlm 1863
Cdd:COG2319     81 VLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTL---TGHTGAVRSVAFSPDGKTL-ASGSADGTvRLWDLATGK--- 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1864 dhaaidritwatwtsilgdeVLGIWSRHAEkaDVNCACVSHSGISLVTGDDFGMVKLFDfpcPEKFAKHKRFLGHSPHVT 1943
Cdd:COG2319    154 --------------------LLRTLTGHSG--AVTSVAFSPDGKLLASGSDDGTVRLWD---LATGKLLRTLTGHTGAVR 208
                          250       260       270
                   ....*....|....*....|....*....|....
gi 1867163826 1944 NIRFTSGDRHVVSaGGDDCSLFVW-----KCVHT 1972
Cdd:COG2319    209 SVAFSPDGKLLAS-GSADGTVRLWdlatgKLLRT 241
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1782-1972 3.21e-07

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 54.26  E-value: 3.21e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1782 RCAIHDIRFSPDSRYLAVGSSENSVDFYDLTLGPTLNRisyCKDIPSFVIQMDFSADSSYLQVSSGCYKRHVYEVPSGK- 1860
Cdd:cd00200      9 TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRT---LKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGEc 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1861 --HLMDHAaiDRITWATW-------TSILGDEVLGIWSRHAEK---------ADVNCACVSHSGISLVTGDDFGMVKLFD 1922
Cdd:cd00200     86 vrTLTGHT--SYVSSVAFspdgrilSSSSRDKTIKVWDVETGKclttlrghtDWVNSVAFSPDGTFVASSSQDGTIKLWD 163
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1867163826 1923 FPCPEKFakhKRFLGHSPHVTNIRFtSGDRHVVSAGGDDCSLFVW-----KCVHT 1972
Cdd:cd00200    164 LRTGKCV---ATLTGHTGEVNSVAF-SPDGEKLLSSSSDGTIKLWdlstgKCLGT 214
ANAPC4_WD40 pfam12894
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ...
1746-1828 1.66e-05

Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,


Pssm-ID: 403945 [Multi-domain]  Cd Length: 91  Bit Score: 44.96  E-value: 1.66e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826 1746 VCYSPEGDMVAIGMKNGEFIILLVSSLKIWGKKRDRR-CAIHDIRFSPDSRYLAVGSSENSVDFYDLTLGPTLNRISYCK 1824
Cdd:pfam12894    1 MSWCPTMDLIALATEDGELLLHRLNWQRVWTLSPDKEdLEVTSLAWRPDGKLLAVGYSDGTVRLLDAENGKIVHHFSAGS 80

                   ....
gi 1867163826 1825 DIPS 1828
Cdd:pfam12894   81 DLIT 84
WD40 pfam00400
WD domain, G-beta repeat;
1695-1727 3.62e-04

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 39.64  E-value: 3.62e-04
                           10        20        30
                   ....*....|....*....|....*....|...
gi 1867163826 1695 NGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWD 1727
Cdd:pfam00400    8 EGH-TGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
760-802 4.13e-04

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 39.60  E-value: 4.13e-04
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|...
gi 1867163826   760 TIKPLSILKGHHQYgVSAVDFSADGKRLASVGIDdsHTVVLWD 802
Cdd:smart00320    1 SGELLKTLKGHTGP-VTSVAFSPDGKYLASGSDD--GTIKLWD 40
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
1695-1727 4.65e-04

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 39.60  E-value: 4.65e-04
                            10        20        30
                    ....*....|....*....|....*....|...
gi 1867163826  1695 NGHvDGPIWGLATHPSRDFFLSAAEDGTVRLWD 1727
Cdd:smart00320    9 KGH-TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
314-353 1.42e-03

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 38.10  E-value: 1.42e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 1867163826  314 NKPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWS 353
Cdd:pfam00400    1 GKLLKTLEGH-TGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
WDR74 cd22857
WD repeat-containing protein 74; WDR74 (WD repeat-containing protein 74) from mammals and ...
250-354 2.31e-03

WD repeat-containing protein 74; WDR74 (WD repeat-containing protein 74) from mammals and plants is an essential factor for ribosome assembly. In cooperation with the assembly factor NVL2, WDR74 participates in an early cleavage of the pre-rRNA processing pathway. NVL2 is a type II double ring, AAA-ATPase, that may mediate the release of WDR74 from nucleolar pre-60S particles. WDR74 has been implicated in tumorigenesis. In lung cancer, it regulates cell proliferation, cell cycle progression, chemoresistance and cell aggressiveness, by inducing nuclear beta-catenin accumulation and driving downstream Wnt-responsive genes expression. In melanoma, it promotes apoptosis resistance and aggressive behavior by regulating the RPL5-MDM2-p53 pathway. WDR74 contains an N-terminal seven-bladed beta-propeller WD40 domain that associates with the D1-AAA domain of the AAA-ATPase NVL2, and a flexible lysine-rich C-terminus that extends outward from the WD40 domain, and is required for nucleolar localization.


Pssm-ID: 439303 [Multi-domain]  Cd Length: 325  Bit Score: 42.21  E-value: 2.31e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1867163826  250 FATGGRDGCIRLWDLTF--KPITVIDLRETdqGYKGLSVRSvcwRGDHILVGTQDSEIFEIVVQErNKPFLIMQGHCEGE 327
Cdd:cd22857    195 IVTGTGYHQVRLYDTRAqrRPVVSVDFGET--PIKAVAEDP---DGHTVYVGDTSGDLASIDLRT-GKLLGCFKGKCGGS 268
                           90       100
                   ....*....|....*....|....*..
gi 1867163826  328 LWALAVHPTKPLAVTGSDDRSVRIWSL 354
Cdd:cd22857    269 IRSIARHPELPLIASCGLDRYLRIWDT 295
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
315-353 2.43e-03

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 37.29  E-value: 2.43e-03
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 1867163826   315 KPFLIMQGHcEGELWALAVHPTKPLAVTGSDDRSVRIWS 353
Cdd:smart00320    3 ELLKTLKGH-TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
994-1026 3.53e-03

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 36.91  E-value: 3.53e-03
                            10        20        30
                    ....*....|....*....|....*....|...
gi 1867163826   994 QGHmEGEVWGLATHPYLPICATVSDDKTLRIWD 1026
Cdd:smart00320    9 KGH-TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
762-802 5.47e-03

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 36.55  E-value: 5.47e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1867163826  762 KPLSILKGHHQyGVSAVDFSADGKRLASVGIDdsHTVVLWD 802
Cdd:pfam00400    2 KLLKTLEGHTG-SVTSLAFSPDGKLLASGSDD--GTVKVWD 39
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH