NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|217416452|ref|NP_075462|]
View 

dynein axonemal intermediate chain 2 isoform 1 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
WD40 super family cl29593
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
166-472 8.60e-17

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


The actual alignment was detected with superfamily member cd00200:

Pssm-ID: 475233 [Multi-domain]  Cd Length: 289  Bit Score: 81.23  E-value: 8.60e-17
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217416452 166 THLSWHPDGNRkLAVAyscldfqrapvgmSSDS--YIWDLENpNKPELALK-PSSPLVTLEFNPkDSHVLLGGCYNGQIA 242
Cdd:cd00200   13 TCVAFSPDGKL-LATG-------------SGDGtiKVWDLET-GELLRTLKgHTGPVRDVAASA-DGTYLASGSSDKTIR 76
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217416452 243 CWDTRKGSLVAELStiesSHRDPVYGTIWLQSktGTECFSASTDGQVMWWDIRkmSEPTEVVILDITKkeqlenalGAIS 322
Cdd:cd00200   77 LWDLETGECVRTLT----GHTSYVSSVAFSPD--GRILSSSSRDKTIKVWDVE--TGKCLTTLRGHTD--------WVNS 140
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217416452 323 LEFestLPTKFMVGTeqgivISCNRKAK---TSAEKIVCTFPGHHGPIYALQrnpFYPKN---FLTVGDWTARIWSEDSr 396
Cdd:cd00200  141 VAF---SPDGTFVAS-----SSQDGTIKlwdLRTGKCVATLTGHTGEVNSVA---FSPDGeklLSSSSDGTIKLWDLST- 208
                        250       260       270       280       290       300       310
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 217416452 397 ESSIMWTKYHMAYLTDAAWSPVRpTVFFTTRMDGTLDIWDFMFEQCDPTLSLKvcDEALFCLRVQDNGCLIACGSQ 472
Cdd:cd00200  209 GKCLGTLRGHENGVNSVAFSPDG-YLLASGSEDGTIRVWDLRTGECVQTLSGH--TNSVTSLAWSPDGKRLASGSA 281
DUF4795 super family cl23731
Domain of unknown function (DUF4795); This family of proteins is functionally uncharacterized. ...
493-562 3.82e-04

Domain of unknown function (DUF4795); This family of proteins is functionally uncharacterized. This family of proteins is found in bacteria and eukaryotes. Proteins in this family are typically between 285 and 978 amino acids in length.


The actual alignment was detected with superfamily member pfam16043:

Pssm-ID: 464990 [Multi-domain]  Cd Length: 181  Bit Score: 41.52  E-value: 3.82e-04
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 217416452  493 KNVASSMFERETRREKILEARHREMRLKEKGKAegrDEEQTDEELAV--DLEALVSKAEEEFFDIIFAELKK 562
Cdd:pfam16043  27 SETTSELSERLQQRQKHLEALYQQIEKLEKVKA---DKEVVEEELDEkaDKEALASKVSRDQFDETLEELNQ 95
 
Name Accession Description Interval E-value
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
166-472 8.60e-17

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 81.23  E-value: 8.60e-17
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217416452 166 THLSWHPDGNRkLAVAyscldfqrapvgmSSDS--YIWDLENpNKPELALK-PSSPLVTLEFNPkDSHVLLGGCYNGQIA 242
Cdd:cd00200   13 TCVAFSPDGKL-LATG-------------SGDGtiKVWDLET-GELLRTLKgHTGPVRDVAASA-DGTYLASGSSDKTIR 76
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217416452 243 CWDTRKGSLVAELStiesSHRDPVYGTIWLQSktGTECFSASTDGQVMWWDIRkmSEPTEVVILDITKkeqlenalGAIS 322
Cdd:cd00200   77 LWDLETGECVRTLT----GHTSYVSSVAFSPD--GRILSSSSRDKTIKVWDVE--TGKCLTTLRGHTD--------WVNS 140
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217416452 323 LEFestLPTKFMVGTeqgivISCNRKAK---TSAEKIVCTFPGHHGPIYALQrnpFYPKN---FLTVGDWTARIWSEDSr 396
Cdd:cd00200  141 VAF---SPDGTFVAS-----SSQDGTIKlwdLRTGKCVATLTGHTGEVNSVA---FSPDGeklLSSSSDGTIKLWDLST- 208
                        250       260       270       280       290       300       310
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 217416452 397 ESSIMWTKYHMAYLTDAAWSPVRpTVFFTTRMDGTLDIWDFMFEQCDPTLSLKvcDEALFCLRVQDNGCLIACGSQ 472
Cdd:cd00200  209 GKCLGTLRGHENGVNSVAFSPDG-YLLASGSEDGTIRVWDLRTGECVQTLSGH--TNSVTSLAWSPDGKRLASGSA 281
WD40 COG2319
WD40 repeat [General function prediction only];
150-436 2.40e-11

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 66.09  E-value: 2.40e-11
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217416452 150 KTINVF--RDPQEIKRAATH------LSWHPDGNRkLAVAyscldfqrapvgmSSDS--YIWDLENpNKPELALKPSSPL 219
Cdd:COG2319  142 GTVRLWdlATGKLLRTLTGHsgavtsVAFSPDGKL-LASG-------------SDDGtvRLWDLAT-GKLLRTLTGHTGA 206
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217416452 220 VT-LEFNPkDSHVLLGGCYNGQIACWDTRKGSLVAELSTiessHRDPVYGTIWlqSKTGTECFSASTDGQVMWWDIRkms 298
Cdd:COG2319  207 VRsVAFSP-DGKLLASGSADGTVRLWDLATGKLLRTLTG----HSGSVRSVAF--SPDGRLLASGSADGTVRLWDLA--- 276
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217416452 299 eptevvildiTKKEQ--LENALGAI-SLEFeSTLPTKFMVGTEQGIVISCNrkakTSAEKIVCTFPGHHGPIYALQrnpF 375
Cdd:COG2319  277 ----------TGELLrtLTGHSGGVnSVAF-SPDGKLLASGSDDGTVRLWD----LATGKLLRTLTGHTGAVRSVA---F 338
                        250       260       270       280       290       300
                 ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 217416452 376 YPK-NFLTVG--DWTARIWSEDSRESSIMWTKyHMAYLTDAAWSPVRPTVfFTTRMDGTLDIWD 436
Cdd:COG2319  339 SPDgKTLASGsdDGTVRLWDLATGELLRTLTG-HTGAVTSVAFSPDGRTL-ASGSADGTVRLWD 400
DUF4795 pfam16043
Domain of unknown function (DUF4795); This family of proteins is functionally uncharacterized. ...
493-562 3.82e-04

Domain of unknown function (DUF4795); This family of proteins is functionally uncharacterized. This family of proteins is found in bacteria and eukaryotes. Proteins in this family are typically between 285 and 978 amino acids in length.


Pssm-ID: 464990 [Multi-domain]  Cd Length: 181  Bit Score: 41.52  E-value: 3.82e-04
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 217416452  493 KNVASSMFERETRREKILEARHREMRLKEKGKAegrDEEQTDEELAV--DLEALVSKAEEEFFDIIFAELKK 562
Cdd:pfam16043  27 SETTSELSERLQQRQKHLEALYQQIEKLEKVKA---DKEVVEEELDEkaDKEALASKVSRDQFDETLEELNQ 95
PTZ00421 PTZ00421
coronin; Provisional
168-307 2.24e-03

coronin; Provisional


Pssm-ID: 173611 [Multi-domain]  Cd Length: 493  Bit Score: 41.03  E-value: 2.24e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217416452 168 LSWHPDGNRKLAVAyscldfqrapvGMSSDSYIWDLENPNKPELALKPSSPLVTLEFNPKDShVLLGGCYNGQIACWDTR 247
Cdd:PTZ00421 131 VSFHPSAMNVLASA-----------GADMVVNVWDVERGKAVEVIKCHSDQITSLEWNLDGS-LLCTTSKDKKLNIIDPR 198
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 217416452 248 KGSLVAELSTIES--SHRdpvygTIWLQSKTG--TECFSASTDGQVMWWDIRKMSEPTEVVILD 307
Cdd:PTZ00421 199 DGTIVSSVEAHASakSQR-----CLWAKRKDLiiTLGCSKSQQRQIMLWDTRKMASPYSTVDLD 257
 
Name Accession Description Interval E-value
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
166-472 8.60e-17

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 81.23  E-value: 8.60e-17
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217416452 166 THLSWHPDGNRkLAVAyscldfqrapvgmSSDS--YIWDLENpNKPELALK-PSSPLVTLEFNPkDSHVLLGGCYNGQIA 242
Cdd:cd00200   13 TCVAFSPDGKL-LATG-------------SGDGtiKVWDLET-GELLRTLKgHTGPVRDVAASA-DGTYLASGSSDKTIR 76
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217416452 243 CWDTRKGSLVAELStiesSHRDPVYGTIWLQSktGTECFSASTDGQVMWWDIRkmSEPTEVVILDITKkeqlenalGAIS 322
Cdd:cd00200   77 LWDLETGECVRTLT----GHTSYVSSVAFSPD--GRILSSSSRDKTIKVWDVE--TGKCLTTLRGHTD--------WVNS 140
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217416452 323 LEFestLPTKFMVGTeqgivISCNRKAK---TSAEKIVCTFPGHHGPIYALQrnpFYPKN---FLTVGDWTARIWSEDSr 396
Cdd:cd00200  141 VAF---SPDGTFVAS-----SSQDGTIKlwdLRTGKCVATLTGHTGEVNSVA---FSPDGeklLSSSSDGTIKLWDLST- 208
                        250       260       270       280       290       300       310
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 217416452 397 ESSIMWTKYHMAYLTDAAWSPVRpTVFFTTRMDGTLDIWDFMFEQCDPTLSLKvcDEALFCLRVQDNGCLIACGSQ 472
Cdd:cd00200  209 GKCLGTLRGHENGVNSVAFSPDG-YLLASGSEDGTIRVWDLRTGECVQTLSGH--TNSVTSLAWSPDGKRLASGSA 281
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
150-436 1.08e-12

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 68.90  E-value: 1.08e-12
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217416452 150 KTINVF--RDPQEIKRAATH------LSWHPDGNrklaVAYSCldfqrapvgmSSDSYI--WDLENpNKPELALK-PSSP 218
Cdd:cd00200   73 KTIRLWdlETGECVRTLTGHtsyvssVAFSPDGR----ILSSS----------SRDKTIkvWDVET-GKCLTTLRgHTDW 137
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217416452 219 LVTLEFNPkDSHVLLGGCYNGQIACWDTRKGSLVAELstieSSHRDPVYGTIWlqSKTGTECFSASTDGQVMWWDIRkms 298
Cdd:cd00200  138 VNSVAFSP-DGTFVASSSQDGTIKLWDLRTGKCVATL----TGHTGEVNSVAF--SPDGEKLLSSSSDGTIKLWDLS--- 207
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217416452 299 eptevvilditkkeqlenalgaislefestlptkfmvgteqgiviscnrkaktsAEKIVCTFPGHHGPIYALQRNPfyPK 378
Cdd:cd00200  208 ------------------------------------------------------TGKCLGTLRGHENGVNSVAFSP--DG 231
                        250       260       270       280       290       300
                 ....*....|....*....|....*....|....*....|....*....|....*....|
gi 217416452 379 NFLTVGDW--TARIWSEDSRESSIMWTKyHMAYLTDAAWSPVRPTVfFTTRMDGTLDIWD 436
Cdd:cd00200  232 YLLASGSEdgTIRVWDLRTGECVQTLSG-HTNSVTSLAWSPDGKRL-ASGSADGTIRIWD 289
WD40 COG2319
WD40 repeat [General function prediction only];
150-436 2.40e-11

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 66.09  E-value: 2.40e-11
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217416452 150 KTINVF--RDPQEIKRAATH------LSWHPDGNRkLAVAyscldfqrapvgmSSDS--YIWDLENpNKPELALKPSSPL 219
Cdd:COG2319  142 GTVRLWdlATGKLLRTLTGHsgavtsVAFSPDGKL-LASG-------------SDDGtvRLWDLAT-GKLLRTLTGHTGA 206
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217416452 220 VT-LEFNPkDSHVLLGGCYNGQIACWDTRKGSLVAELSTiessHRDPVYGTIWlqSKTGTECFSASTDGQVMWWDIRkms 298
Cdd:COG2319  207 VRsVAFSP-DGKLLASGSADGTVRLWDLATGKLLRTLTG----HSGSVRSVAF--SPDGRLLASGSADGTVRLWDLA--- 276
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217416452 299 eptevvildiTKKEQ--LENALGAI-SLEFeSTLPTKFMVGTEQGIVISCNrkakTSAEKIVCTFPGHHGPIYALQrnpF 375
Cdd:COG2319  277 ----------TGELLrtLTGHSGGVnSVAF-SPDGKLLASGSDDGTVRLWD----LATGKLLRTLTGHTGAVRSVA---F 338
                        250       260       270       280       290       300
                 ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 217416452 376 YPK-NFLTVG--DWTARIWSEDSRESSIMWTKyHMAYLTDAAWSPVRPTVfFTTRMDGTLDIWD 436
Cdd:COG2319  339 SPDgKTLASGsdDGTVRLWDLATGELLRTLTG-HTGAVTSVAFSPDGRTL-ASGSADGTVRLWD 400
WD40 COG2319
WD40 repeat [General function prediction only];
168-497 1.57e-10

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 63.39  E-value: 1.57e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217416452 168 LSWHPDGNRKLAVAYSCLDFQRAPVGMSSDSYIWDLENPNKPELALKPSSPLVTLEFNPkDSHVLLGGCYNGQIACWDTR 247
Cdd:COG2319   72 ATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSP-DGKTLASGSADGTVRLWDLA 150
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217416452 248 KGSLVAELstieSSHRDPVYGTIWlqSKTGTECFSASTDGQVMWWDIRkmsepTEVVILDITKKEQLENALgAISlefes 327
Cdd:COG2319  151 TGKLLRTL----TGHSGAVTSVAF--SPDGKLLASGSDDGTVRLWDLA-----TGKLLRTLTGHTGAVRSV-AFS----- 213
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217416452 328 tlPTkfmvGTeqgIVISC--NRKAK---TSAEKIVCTFPGHHGPIYALQrnpFYPKN-FLTVGDW--TARIWseDSRESS 399
Cdd:COG2319  214 --PD----GK---LLASGsaDGTVRlwdLATGKLLRTLTGHSGSVRSVA---FSPDGrLLASGSAdgTVRLW--DLATGE 279
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217416452 400 IMWT-KYHMAYLTDAAWSPVRPTVfFTTRMDGTLDIWDfmFEQCDPTLSLKVCDEALFCLRVQDNGCLIACGSQLGTTTL 478
Cdd:COG2319  280 LLRTlTGHSGGVNSVAFSPDGKLL-ASGSDDGTVRLWD--LATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRL 356
                        330       340
                 ....*....|....*....|.
gi 217416452 479 LEVSPG--LSTLQRNEKNVAS 497
Cdd:COG2319  357 WDLATGelLRTLTGHTGAVTS 377
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
147-296 2.67e-08

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 55.42  E-value: 2.67e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217416452 147 PSAKTINVFRDpqeIKRAATHLSWHPDGNRklaVAYSCLDfqrapvgmsSDSYIWDLENPNKPELALKPSSPLVTLEFNP 226
Cdd:cd00200  123 ETGKCLTTLRG---HTDWVNSVAFSPDGTF---VASSSQD---------GTIKLWDLRTGKCVATLTGHTGEVNSVAFSP 187
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217416452 227 KDSHVLLGGCyNGQIACWDTRKGSLVAELstieSSHRDPVYGTIWlqSKTGTECFSASTDGQVMWWDIRK 296
Cdd:cd00200  188 DGEKLLSSSS-DGTIKLWDLSTGKCLGTL----RGHENGVNSVAF--SPDGYLLASGSEDGTIRVWDLRT 250
WD40 COG2319
WD40 repeat [General function prediction only];
164-295 5.30e-06

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 49.14  E-value: 5.30e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217416452 164 AATHLSWHPDGNRkLAVAyscldfqrapvgmSSDS--YIWDLENPnKPELALKPSSPLVT-LEFNPkDSHVLLGGCYNGQ 240
Cdd:COG2319  290 GVNSVAFSPDGKL-LASG-------------SDDGtvRLWDLATG-KLLRTLTGHTGAVRsVAFSP-DGKTLASGSDDGT 353
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|....*
gi 217416452 241 IACWDTRKGSLVAELStiesSHRDPVYGTIWlqSKTGTECFSASTDGQVMWWDIR 295
Cdd:COG2319  354 VRLWDLATGELLRTLT----GHTGAVTSVAF--SPDGRTLASGSADGTVRLWDLA 402
WD40 COG2319
WD40 repeat [General function prediction only];
163-497 1.07e-05

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 47.98  E-value: 1.07e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217416452 163 RAATHLSWHPDGNRKLAVAYSCLDFQRAPVGMSSDSYIWDLENPNKPELALKPSSPLVTLEFNPkDSHVLLGGCYNGQIA 242
Cdd:COG2319   25 LGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSP-DGRLLASASADGTVR 103
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217416452 243 CWDTRKGSLVAELStiesSHRDPVYGTIWlqSKTGTECFSASTDGQVMWWDIrkmseptevvilditkkeqlenalgais 322
Cdd:COG2319  104 LWDLATGLLLRTLT----GHTGAVRSVAF--SPDGKTLASGSADGTVRLWDL---------------------------- 149
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217416452 323 lefestlptkfmvgtEQGiviscnrkaktsaeKIVCTFPGHHGPIYALQRNPfyPKNFLTVGDW--TARIWSEDSRESSI 400
Cdd:COG2319  150 ---------------ATG--------------KLLRTLTGHSGAVTSVAFSP--DGKLLASGSDdgTVRLWDLATGKLLR 198
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217416452 401 MWTKyHMAYLTDAAWSPvRPTVFFTTRMDGTLDIWDfmFEQCDPTLSLKVCDEALFCLRVQDNGCLIACGSQLGTTTLLE 480
Cdd:COG2319  199 TLTG-HTGAVRSVAFSP-DGKLLASGSADGTVRLWD--LATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWD 274
                        330
                 ....*....|....*....
gi 217416452 481 VSPG--LSTLQRNEKNVAS 497
Cdd:COG2319  275 LATGelLRTLTGHSGGVNS 293
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
322-478 1.30e-05

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 47.33  E-value: 1.30e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217416452 322 SLEFeSTLPTKFMVGTEQGIVISCNrkakTSAEKIVCTFPGHHGPIYALQRNPFYPKnFLTVG-DWTARIWSEDSRESSI 400
Cdd:cd00200   14 CVAF-SPDGKLLATGSGDGTIKVWD----LETGELLRTLKGHTGPVRDVAASADGTY-LASGSsDKTIRLWDLETGECVR 87
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 217416452 401 MWTKyHMAYLTDAAWSPVRPtVFFTTRMDGTLDIWDfmFEQCDPTLSLKVCDEALFCLRVQDNGCLIACGSQLGTTTL 478
Cdd:cd00200   88 TLTG-HTSYVSSVAFSPDGR-ILSSSSRDKTIKVWD--VETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKL 161
DUF4795 pfam16043
Domain of unknown function (DUF4795); This family of proteins is functionally uncharacterized. ...
493-562 3.82e-04

Domain of unknown function (DUF4795); This family of proteins is functionally uncharacterized. This family of proteins is found in bacteria and eukaryotes. Proteins in this family are typically between 285 and 978 amino acids in length.


Pssm-ID: 464990 [Multi-domain]  Cd Length: 181  Bit Score: 41.52  E-value: 3.82e-04
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 217416452  493 KNVASSMFERETRREKILEARHREMRLKEKGKAegrDEEQTDEELAV--DLEALVSKAEEEFFDIIFAELKK 562
Cdd:pfam16043  27 SETTSELSERLQQRQKHLEALYQQIEKLEKVKA---DKEVVEEELDEkaDKEALASKVSRDQFDETLEELNQ 95
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
357-484 1.12e-03

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 41.17  E-value: 1.12e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217416452 357 VCTFPGHHGPIYALQRNPfyPKNFLTVG--DWTARIWseDSRESSIMWT-KYHMAYLTDAAWSPVRPTVfFTTRMDGTLD 433
Cdd:cd00200    2 RRTLKGHTGGVTCVAFSP--DGKLLATGsgDGTIKVW--DLETGELLRTlKGHTGPVRDVAASADGTYL-ASGSSDKTIR 76
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|.
gi 217416452 434 IWDFMFEQCdpTLSLKVCDEALFCLRVQDNGCLIACGSQLGTTTLLEVSPG 484
Cdd:cd00200   77 LWDLETGEC--VRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETG 125
PTZ00421 PTZ00421
coronin; Provisional
168-307 2.24e-03

coronin; Provisional


Pssm-ID: 173611 [Multi-domain]  Cd Length: 493  Bit Score: 41.03  E-value: 2.24e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217416452 168 LSWHPDGNRKLAVAyscldfqrapvGMSSDSYIWDLENPNKPELALKPSSPLVTLEFNPKDShVLLGGCYNGQIACWDTR 247
Cdd:PTZ00421 131 VSFHPSAMNVLASA-----------GADMVVNVWDVERGKAVEVIKCHSDQITSLEWNLDGS-LLCTTSKDKKLNIIDPR 198
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 217416452 248 KGSLVAELSTIES--SHRdpvygTIWLQSKTG--TECFSASTDGQVMWWDIRKMSEPTEVVILD 307
Cdd:PTZ00421 199 DGTIVSSVEAHASakSQR-----CLWAKRKDLiiTLGCSKSQQRQIMLWDTRKMASPYSTVDLD 257
PTZ00420 PTZ00420
coronin; Provisional
77-307 3.50e-03

coronin; Provisional


Pssm-ID: 240412 [Multi-domain]  Cd Length: 568  Bit Score: 40.32  E-value: 3.50e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217416452  77 VEGGwpKDVNPLELEQTIRfRKKVEKDENYVNAIMQL------GSIMEHCiKQNNAIDIYEEYFNDEEAMEVmeedpsak 150
Cdd:PTZ00420  47 VEGG--GLIGAIRLENQMR-KPPVIKLKGHTSSILDLqfnpcfSEILASG-SEDLTIRVWEIPHNDESVKEI-------- 114
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217416452 151 tinvfRDPQEI----KRAATHLSWHPdgnrklaVAYscldFQRAPVGMSSDSYIWDLENPNKPELALKPSSpLVTLEFNP 226
Cdd:PTZ00420 115 -----KDPQCIlkghKKKISIIDWNP-------MNY----YIMCSSGFDSFVNIWDIENEKRAFQINMPKK-LSSLKWNI 177
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 217416452 227 KDShVLLGGCYNGQIACWDTRKGSLVAELStIESSHRDPvyGTIWLQSKTG------TECFSASTDGQVMWWDIRKMSEP 300
Cdd:PTZ00420 178 KGN-LLSGTCVGKHMHIIDPRKQEIASSFH-IHDGGKNT--KNIWIDGLGGddnyilSTGFSKNNMREMKLWDLKNTTSA 253

                 ....*..
gi 217416452 301 TEVVILD 307
Cdd:PTZ00420 254 LVTMSID 260
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH