NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|50302539|ref|XP_451204|]
View 

uncharacterized protein KLLA0_A04653g [Kluyveromyces lactis]

Protein Classification

WD repeat NOL10/ENP2 family protein( domain architecture ID 13236865)

WD repeat NOL10/ENP2 family protein contains WD40 repeats that fold into a beta-propeller structure and functions as a scaffold, such as nucleolar protein 10

CATH:  2.130.10.10
Gene Ontology:  GO:0005515
SCOP:  4002744

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
WD40 super family cl29593
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
62-306 1.36e-09

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


The actual alignment was detected with superfamily member cd00200:

Pssm-ID: 475233 [Multi-domain]  Cd Length: 289  Bit Score: 59.66  E-value: 1.36e-09
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50302539  62 IKVTPDGNYvMATGTYKPQIHVYDFSNLSLKFER--HTDAenIDFCILSEDWTK--SVHlqNDRTIQFQNKGGIHYTTRI 137
Cdd:cd00200  15 VAFSPDGKL-LATGSGDGTIKVWDLETGELLRTLkgHTGP--VRDVAASADGTYlaSGS--SDKTIRLWDLETGECVRTL 89
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50302539 138 ---PKFGRSLAYNKvNCDLYVGASSNELYRL-NLEQGRFLNPFKLDTEGVNHVSINEVNGLLAASMETNVVEFWDPRSRS 213
Cdd:cd00200  90 tghTSYVSSVAFSP-DGRILSSSSRDKTIKVwDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGK 168
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50302539 214 RVAKLHLENNldsadfQVTCSSFKNDGLNFACGTSNGYSYIYDLRTSEPSivkdqGYGFAVNKIIWLDSVEDSNKILTC- 292
Cdd:cd00200 169 CVATLTGHTG------EVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCL-----GTLRGHENGVNSVAFSPDGYLLASg 237
                       250
                ....*....|....*.
gi 50302539 293 --DKRIaKIWNKNDGK 306
Cdd:cd00200 238 seDGTI-RVWDLRTGE 252
NUC153 pfam08159
NUC153 domain; This small domain is found in a a novel nucleolar family.
479-507 3.40e-09

NUC153 domain; This small domain is found in a a novel nucleolar family.


:

Pssm-ID: 462385 [Multi-domain]  Cd Length: 29  Bit Score: 52.34  E-value: 3.40e-09
                          10        20
                  ....*....|....*....|....*....
gi 50302539   479 DDRFKEMFEDDAFQVDEDDYDYKQLNPVK 507
Cdd:pfam08159   1 DPRFKALFEDHDFAIDPTSPEFKKTNPMK 29
 
Name Accession Description Interval E-value
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
62-306 1.36e-09

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 59.66  E-value: 1.36e-09
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50302539  62 IKVTPDGNYvMATGTYKPQIHVYDFSNLSLKFER--HTDAenIDFCILSEDWTK--SVHlqNDRTIQFQNKGGIHYTTRI 137
Cdd:cd00200  15 VAFSPDGKL-LATGSGDGTIKVWDLETGELLRTLkgHTGP--VRDVAASADGTYlaSGS--SDKTIRLWDLETGECVRTL 89
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50302539 138 ---PKFGRSLAYNKvNCDLYVGASSNELYRL-NLEQGRFLNPFKLDTEGVNHVSINEVNGLLAASMETNVVEFWDPRSRS 213
Cdd:cd00200  90 tghTSYVSSVAFSP-DGRILSSSSRDKTIKVwDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGK 168
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50302539 214 RVAKLHLENNldsadfQVTCSSFKNDGLNFACGTSNGYSYIYDLRTSEPSivkdqGYGFAVNKIIWLDSVEDSNKILTC- 292
Cdd:cd00200 169 CVATLTGHTG------EVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCL-----GTLRGHENGVNSVAFSPDGYLLASg 237
                       250
                ....*....|....*.
gi 50302539 293 --DKRIaKIWNKNDGK 306
Cdd:cd00200 238 seDGTI-RVWDLRTGE 252
NUC153 pfam08159
NUC153 domain; This small domain is found in a a novel nucleolar family.
479-507 3.40e-09

NUC153 domain; This small domain is found in a a novel nucleolar family.


Pssm-ID: 462385 [Multi-domain]  Cd Length: 29  Bit Score: 52.34  E-value: 3.40e-09
                          10        20
                  ....*....|....*....|....*....
gi 50302539   479 DDRFKEMFEDDAFQVDEDDYDYKQLNPVK 507
Cdd:pfam08159   1 DPRFKALFEDHDFAIDPTSPEFKKTNPMK 29
WD40 COG2319
WD40 repeat [General function prediction only];
153-316 3.99e-05

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 46.44  E-value: 3.99e-05
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50302539 153 LYVGASSNELYRLNLEQGRFLNPFKLDTEGVNHVSINEVNGLLAASMETNVVEFWDPRSRSRVAKLHLENNldsadfQVT 232
Cdd:COG2319 177 LASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSG------SVR 250
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50302539 233 CSSFKNDGLNFACGTSNGYSYIYDLRTSEPsIVKDQGYGFAVNKIIWLDsveDSNKILTC--DKRIaKIWNKNDGKAYAS 310
Cdd:COG2319 251 SVAFSPDGRLLASGSADGTVRLWDLATGEL-LRTLTGHSGGVNSVAFSP---DGKLLASGsdDGTV-RLWDLATGKLLRT 325

                ....*.
gi 50302539 311 MEPSVD 316
Cdd:COG2319 326 LTGHTG 331
 
Name Accession Description Interval E-value
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
62-306 1.36e-09

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 59.66  E-value: 1.36e-09
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50302539  62 IKVTPDGNYvMATGTYKPQIHVYDFSNLSLKFER--HTDAenIDFCILSEDWTK--SVHlqNDRTIQFQNKGGIHYTTRI 137
Cdd:cd00200  15 VAFSPDGKL-LATGSGDGTIKVWDLETGELLRTLkgHTGP--VRDVAASADGTYlaSGS--SDKTIRLWDLETGECVRTL 89
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50302539 138 ---PKFGRSLAYNKvNCDLYVGASSNELYRL-NLEQGRFLNPFKLDTEGVNHVSINEVNGLLAASMETNVVEFWDPRSRS 213
Cdd:cd00200  90 tghTSYVSSVAFSP-DGRILSSSSRDKTIKVwDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGK 168
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50302539 214 RVAKLHLENNldsadfQVTCSSFKNDGLNFACGTSNGYSYIYDLRTSEPSivkdqGYGFAVNKIIWLDSVEDSNKILTC- 292
Cdd:cd00200 169 CVATLTGHTG------EVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCL-----GTLRGHENGVNSVAFSPDGYLLASg 237
                       250
                ....*....|....*.
gi 50302539 293 --DKRIaKIWNKNDGK 306
Cdd:cd00200 238 seDGTI-RVWDLRTGE 252
NUC153 pfam08159
NUC153 domain; This small domain is found in a a novel nucleolar family.
479-507 3.40e-09

NUC153 domain; This small domain is found in a a novel nucleolar family.


Pssm-ID: 462385 [Multi-domain]  Cd Length: 29  Bit Score: 52.34  E-value: 3.40e-09
                          10        20
                  ....*....|....*....|....*....
gi 50302539   479 DDRFKEMFEDDAFQVDEDDYDYKQLNPVK 507
Cdd:pfam08159   1 DPRFKALFEDHDFAIDPTSPEFKKTNPMK 29
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
60-301 6.09e-08

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 54.65  E-value: 6.09e-08
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50302539  60 NKIKVTPDGNYvMATGTYKPQIHVYDFSNLSL--KFERHTDAenidfcILSEDWTKSVHL----QNDRTIQFQN--KGGI 131
Cdd:cd00200  55 RDVAASADGTY-LASGSSDKTIRLWDLETGECvrTLTGHTSY------VSSVAFSPDGRIlsssSRDKTIKVWDveTGKC 127
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50302539 132 HYTTR-IPKFGRSLAYNKVNcDLYVGASSNELYRL-NLEQGRFLNPFKLDTEGVNHVSINEVNGLLAASMETNVVEFWDP 209
Cdd:cd00200 128 LTTLRgHTDWVNSVAFSPDG-TFVASSSQDGTIKLwDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDL 206
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50302539 210 RSRSRVAKLHLENNldsadfQVTCSSFKNDGLNFACGTSNGYSYIYDLRTSEPSIVkdqgyGFAVNKIIW-LDSVEDSNK 288
Cdd:cd00200 207 STGKCLGTLRGHEN------GVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQT-----LSGHTNSVTsLAWSPDGKR 275
                       250
                ....*....|....*
gi 50302539 289 ILTC--DKRIaKIWN 301
Cdd:cd00200 276 LASGsaDGTI-RIWD 289
WD40 COG2319
WD40 repeat [General function prediction only];
153-316 3.99e-05

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 46.44  E-value: 3.99e-05
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50302539 153 LYVGASSNELYRLNLEQGRFLNPFKLDTEGVNHVSINEVNGLLAASMETNVVEFWDPRSRSRVAKLHLENNldsadfQVT 232
Cdd:COG2319 177 LASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSG------SVR 250
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50302539 233 CSSFKNDGLNFACGTSNGYSYIYDLRTSEPsIVKDQGYGFAVNKIIWLDsveDSNKILTC--DKRIaKIWNKNDGKAYAS 310
Cdd:COG2319 251 SVAFSPDGRLLASGSADGTVRLWDLATGEL-LRTLTGHSGGVNSVAFSP---DGKLLASGsdDGTV-RLWDLATGKLLRT 325

                ....*.
gi 50302539 311 MEPSVD 316
Cdd:COG2319 326 LTGHTG 331
WD40 COG2319
WD40 repeat [General function prediction only];
64-306 6.28e-04

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 42.59  E-value: 6.28e-04
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50302539  64 VTPDGNYVmATGTYKPQIHVYDFSNLSL--KFERHTDA-ENIDFC-----ILSEDWTKSVHLQNDRTIQFQNKGGIHyTT 135
Cdd:COG2319 128 FSPDGKTL-ASGSADGTVRLWDLATGKLlrTLTGHSGAvTSVAFSpdgklLASGSDDGTVRLWDLATGKLLRTLTGH-TG 205
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50302539 136 RIpkfgRSLAYNKVNCDLYVGASSNELYRLNLEQGRFLNPFKLDTEGVNHVSINEVNGLLAASMETNVVEFWDPRSRSRV 215
Cdd:COG2319 206 AV----RSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELL 281
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50302539 216 AKLHLENNldsadfQVTCSSFKNDGLNFACGTSNGYSYIYDLRTSEPsIVKDQGYGFAVNKIIWLDsveDSNKILTC--D 293
Cdd:COG2319 282 RTLTGHSG------GVNSVAFSPDGKLLASGSDDGTVRLWDLATGKL-LRTLTGHTGAVRSVAFSP---DGKTLASGsdD 351
                       250
                ....*....|...
gi 50302539 294 KRIaKIWNKNDGK 306
Cdd:COG2319 352 GTV-RLWDLATGE 363
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH