U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination
    • Showing Current items.

    Wdr33 WD repeat domain 33 [ Mus musculus (house mouse) ]

    Gene ID: 74320, updated on 9-Dec-2024

    Summary

    Official Symbol
    Wdr33provided by MGI
    Official Full Name
    WD repeat domain 33provided by MGI
    Primary source
    MGI:MGI:1921570
    See related
    Ensembl:ENSMUSG00000024400 AllianceGenome:MGI:1921570
    Gene type
    protein coding
    RefSeq status
    VALIDATED
    Organism
    Mus musculus
    Lineage
    Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus
    Also known as
    WDC146; 1110001N06Rik; 2310011G05Rik; 2810021O11Rik; 8430413N20Rik
    Summary
    Predicted to be involved in mRNA 3'-end processing. Located in nucleus. Orthologous to human WDR33 (WD repeat domain 33). [provided by Alliance of Genome Resources, Dec 2024]
    Expression
    Ubiquitous expression in CNS E11.5 (RPKM 5.8), limb E14.5 (RPKM 5.4) and 28 other tissues See more
    Orthologs
    NEW
    Try the new Gene table
    Try the new Transcript table

    Genomic context

    See Wdr33 in Genome Data Viewer
    Location:
    18 B1; 18 17.85 cM
    Exon count:
    26
    Annotation release Status Assembly Chr Location
    RS_2024_02 current GRCm39 (GCF_000001635.27) 18 NC_000084.7 (31937079..32042040)
    108.20200622 previous assembly GRCm38.p6 (GCF_000001635.26) 18 NC_000084.6 (31804057..31908987)

    Chromosome 18 - NC_000084.7Genomic Context describing neighboring genes Neighboring gene AMME chromosomal region gene 1-like Neighboring gene STARR-positive B cell enhancer ABC_E3226 Neighboring gene predicted gene, 26533 Neighboring gene STARR-positive B cell enhancer mm9_chr18:31963552-31963852 Neighboring gene polymerase (RNA) II (DNA directed) polypeptide D Neighboring gene STARR-positive B cell enhancer ABC_E10983 Neighboring gene glutathione S-transferase, mu 2 pseudogene Neighboring gene STARR-seq mESC enhancer starr_44149 Neighboring gene CapStarr-seq enhancer MGSCv37_chr18:32069964-32070147 Neighboring gene SFT2 domain containing 3 Neighboring gene STARR-seq mESC enhancer starr_44150 Neighboring gene LIM and senescent cell antigen like domains 2 Neighboring gene predicted gene, 46619

    Genomic regions, transcripts, and products

    Expression

    • Project title: Mouse ENCODE transcriptome data
    • Description: RNA profiling data sets generated by the Mouse ENCODE project.
    • BioProject: PRJNA66167
    • Publication: PMID 25409824
    • Analysis date: n/a

    Variation

    Alleles

    Alleles of this type are documented at Mouse Genome Informatics  (MGI)
    • Endonuclease-mediated (2) 

    Pathways from PubChem

    Interactions

    Products Interactant Other Gene Complex Source Pubs Description

    General gene information

    Markers

    Gene Ontology Provided by MGI

    Process Evidence Code Pubs
    involved_in mRNA 3'-end processing IEA
    Inferred from Electronic Annotation
    more info
     
    Component Evidence Code Pubs
    part_of collagen trimer IEA
    Inferred from Electronic Annotation
    more info
     
    located_in fibrillar center IEA
    Inferred from Electronic Annotation
    more info
     
    located_in fibrillar center ISO
    Inferred from Sequence Orthology
    more info
     
    part_of mRNA cleavage and polyadenylation specificity factor complex IBA
    Inferred from Biological aspect of Ancestor
    more info
     
    located_in nucleoplasm IEA
    Inferred from Electronic Annotation
    more info
     
    located_in nucleoplasm ISO
    Inferred from Sequence Orthology
    more info
     
    located_in nucleus IDA
    Inferred from Direct Assay
    more info
    PubMed 
    located_in nucleus ISO
    Inferred from Sequence Orthology
    more info
     

    General protein information

    Preferred Names
    pre-mRNA 3' end processing protein WDR33
    Names
    WD repeat-containing protein 33
    WD repeat-containing protein WDC146
    WD repeat-containing protein of 146 kDa

    NCBI Reference Sequences (RefSeq)

    NEW Try the new Transcript table

    RefSeqs maintained independently of Annotated Genomes

    These reference sequences exist independently of genome builds. Explain

    These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

    mRNA and Protein(s)

    1. NM_001170966.1NP_001164437.1  pre-mRNA 3' end processing protein WDR33 isoform 4

      See identical proteins and their annotated locations for NP_001164437.1

      Status: VALIDATED

      Description
      Transcript Variant: This variant (4) has multiple differences, compared to variant 1. The encoded isoform (4) is shorter and has a distinct C-terminus, compared to isoform 1.
      Source sequence(s)
      AC124393, AC161511, AK045923
      Consensus CDS
      CCDS89211.1
      UniProtKB/TrEMBL
      Q8BRC5, Q9D1P6
      Related
      ENSMUSP00000157238.2, ENSMUST00000234344.2
      Conserved Domains (3) summary
      COG2319
      Location:127219
      WD40; WD40 repeat [General function prediction only]
      sd00039
      Location:122158
      7WD40; WD40 repeat [structural motif]
      cl02567
      Location:119205
      WD40; WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from ...
    2. NM_001170967.1NP_001164438.1  pre-mRNA 3' end processing protein WDR33 isoform 3

      See identical proteins and their annotated locations for NP_001164438.1

      Status: VALIDATED

      Description
      Transcript Variant: This variant (3) has multiple differences, compared to variant 1. The encoded isoform (3) is shorter and has a distinct C-terminus, compared to isoform 1.
      Source sequence(s)
      AC124393, AC161511, AK078286
      Consensus CDS
      CCDS89210.1
      UniProtKB/TrEMBL
      D3YX80, Q8K1G7
      Related
      ENSMUSP00000080936.9, ENSMUST00000082319.15
      Conserved Domains (3) summary
      COG2319
      Location:104231
      WD40; WD40 repeat [General function prediction only]
      sd00039
      Location:122159
      7WD40; WD40 repeat [structural motif]
      cl02567
      Location:119230
      WD40; WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from ...
    3. NM_001170970.1NP_001164441.1  pre-mRNA 3' end processing protein WDR33 isoform 2

      Status: VALIDATED

      Description
      Transcript Variant: This variant (2) has multiple differences, compared to variant 1. The encoded isoform (2) is shorter and has a distinct C-terminus, compared to isoform 1.
      Source sequence(s)
      AC124393, AC161511, AK009297
      Consensus CDS
      CCDS50242.1
      UniProtKB/TrEMBL
      A0A3Q4EGD8
      Related
      ENSMUSP00000157157.2, ENSMUST00000234957.2
      Conserved Domains (2) summary
      sd00039
      Location:122159
      7WD40; WD40 repeat [structural motif]
      cl29593
      Location:119230
      WD40; WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from ...
    4. NM_028866.3NP_083142.2  pre-mRNA 3' end processing protein WDR33 isoform 1

      See identical proteins and their annotated locations for NP_083142.2

      Status: VALIDATED

      Description
      Transcript Variant: This variant (1) represents the longest transcript and encodes the longest isoform (1).
      Source sequence(s)
      AC124393, AC131761, AC161511
      Consensus CDS
      CCDS29112.1
      UniProtKB/Swiss-Prot
      Q8C7C6, Q8CD02, Q8K4P0
      Related
      ENSMUSP00000025264.7, ENSMUST00000025264.8
      Conserved Domains (5) summary
      pfam01391
      Location:717769
      Collagen; Collagen triple helix repeat (20 copies)
      COG2319
      Location:121405
      WD40; WD40 repeat [General function prediction only]
      cd00200
      Location:121402
      WD40; WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from ...
      pfam09606
      Location:602963
      Med15; ARC105 or Med15 subunit of Mediator complex non-fungal
      sd00039
      Location:122159
      7WD40; WD40 repeat [structural motif]

    RefSeqs of Annotated Genomes: GCF_000001635.27-RS_2024_02

    The following sections contain reference sequences that belong to a specific genome build. Explain

    Reference GRCm39 C57BL/6J

    Genomic

    1. NC_000084.7 Reference GRCm39 C57BL/6J

      Range
      31937079..32042040
      Download
      GenBank, FASTA, Sequence Viewer (Graphics)

    mRNA and Protein(s)

    1. XM_006526300.2XP_006526363.1  pre-mRNA 3' end processing protein WDR33 isoform X2

      Conserved Domains (4) summary
      cd00200
      Location:121402
      WD40; WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from ...
      sd00039
      Location:122159
      7WD40; WD40 repeat [structural motif]
      pfam09606
      Location:590926
      Med15; ARC105 or Med15 subunit of Mediator complex non-fungal
      cl26593
      Location:537640
      DUF2076; Uncharacterized protein conserved in bacteria (DUF2076)
    2. XM_006526301.2XP_006526364.1  pre-mRNA 3' end processing protein WDR33 isoform X3

      Conserved Domains (3) summary
      cd00200
      Location:121402
      WD40; WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from ...
      sd00039
      Location:122159
      7WD40; WD40 repeat [structural motif]
      pfam09606
      Location:590926
      Med15; ARC105 or Med15 subunit of Mediator complex non-fungal
    3. XM_006526298.2XP_006526361.1  pre-mRNA 3' end processing protein WDR33 isoform X1

      Conserved Domains (4) summary
      cd00200
      Location:121402
      WD40; WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from ...
      sd00039
      Location:122159
      7WD40; WD40 repeat [structural motif]
      pfam09606
      Location:590926
      Med15; ARC105 or Med15 subunit of Mediator complex non-fungal
      cl26593
      Location:537640
      DUF2076; Uncharacterized protein conserved in bacteria (DUF2076)
    4. XM_036161297.1XP_036017190.1  pre-mRNA 3' end processing protein WDR33 isoform X5

      Conserved Domains (2) summary
      sd00039
      Location:122159
      7WD40; WD40 repeat [structural motif]
      cl29593
      Location:119230
      WD40; WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from ...
    5. XM_017317996.3XP_017173485.1  pre-mRNA 3' end processing protein WDR33 isoform X1

      Conserved Domains (4) summary
      cd00200
      Location:121402
      WD40; WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from ...
      sd00039
      Location:122159
      7WD40; WD40 repeat [structural motif]
      pfam09606
      Location:590926
      Med15; ARC105 or Med15 subunit of Mediator complex non-fungal
      cl26593
      Location:537640
      DUF2076; Uncharacterized protein conserved in bacteria (DUF2076)
    6. XM_036161296.1XP_036017189.1  pre-mRNA 3' end processing protein WDR33 isoform X4

      UniProtKB/TrEMBL
      A0A3Q4EGD8
      Conserved Domains (2) summary
      sd00039
      Location:122159
      7WD40; WD40 repeat [structural motif]
      cl29593
      Location:119230
      WD40; WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from ...