U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Cd33 CD33 molecule [ Mus musculus (house mouse) ]

Gene ID: 12489, updated on 2-Nov-2024

Summary

Official Symbol
Cd33provided by MGI
Official Full Name
CD33 moleculeprovided by MGI
Primary source
MGI:MGI:99440
See related
Ensembl:ENSMUSG00000004609 AllianceGenome:MGI:99440
Gene type
protein coding
RefSeq status
VALIDATED
Organism
Mus musculus
Lineage
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus
Also known as
gp67; Siglec-3
Summary
Predicted to enable protein phosphatase binding activity; protein tyrosine phosphatase activator activity; and sialic acid binding activity. Predicted to be involved in cell adhesion. Predicted to be located in several cellular components, including Golgi apparatus; external side of plasma membrane; and peroxisome. Predicted to be active in plasma membrane. Is expressed in central nervous system; genitourinary system; spleen; and submandibular gland. Orthologous to several human genes including CD33 (CD33 molecule). [provided by Alliance of Genome Resources, Nov 2024]
Expression
Broad expression in liver E18 (RPKM 4.3), mammary gland adult (RPKM 4.2) and 27 other tissues See more
Orthologs
NEW
Try the new Gene table
Try the new Transcript table

Genomic context

See Cd33 in Genome Data Viewer
Location:
7 B3; 7 28.25 cM
Exon count:
9
Annotation release Status Assembly Chr Location
RS_2024_02 current GRCm39 (GCF_000001635.27) 7 NC_000073.7 (43176823..43186679, complement)
108.20200622 previous assembly GRCm38.p6 (GCF_000001635.26) 7 NC_000073.6 (43527399..43537424, complement)

Chromosome 7 - NC_000073.7Genomic Context describing neighboring genes Neighboring gene predicted gene 9278 Neighboring gene Siglec family like 1 Neighboring gene zinc finger protein 658 Neighboring gene STARR-positive B cell enhancer ABC_E9290 Neighboring gene zinc finger protein 719

Genomic regions, transcripts, and products

Expression

  • Project title: Mouse ENCODE transcriptome data Mouse ENCODE transcriptome data
  • Description: RNA profiling data sets generated by the Mouse ENCODE project.
  • BioProject: PRJNA66167
  • Publication: PMID 25409824
  • Analysis date: n/a

Bibliography

GeneRIFs: Gene References Into Functions

What's a GeneRIF?

Variation

Alleles

Alleles of this type are documented at Mouse Genome Informatics  (MGI)

General gene information

Gene Ontology Provided by MGI

Function Evidence Code Pubs
enables carbohydrate binding IEA
Inferred from Electronic Annotation
more info
 
enables protein phosphatase binding ISO
Inferred from Sequence Orthology
more info
 
enables protein tyrosine phosphatase activator activity ISO
Inferred from Sequence Orthology
more info
 
enables sialic acid binding IBA
Inferred from Biological aspect of Ancestor
more info
 
enables sialic acid binding ISO
Inferred from Sequence Orthology
more info
 
Process Evidence Code Pubs
involved_in cell adhesion IBA
Inferred from Biological aspect of Ancestor
more info
 
involved_in cell adhesion IEA
Inferred from Electronic Annotation
more info
 
Component Evidence Code Pubs
located_in Golgi apparatus ISO
Inferred from Sequence Orthology
more info
 
located_in cell surface ISO
Inferred from Sequence Orthology
more info
 
located_in cytosol ISO
Inferred from Sequence Orthology
more info
 
located_in external side of plasma membrane ISO
Inferred from Sequence Orthology
more info
PubMed 
located_in nucleoplasm ISO
Inferred from Sequence Orthology
more info
 
located_in peroxisome ISO
Inferred from Sequence Orthology
more info
 
is_active_in plasma membrane IBA
Inferred from Biological aspect of Ancestor
more info
 
located_in plasma membrane ISO
Inferred from Sequence Orthology
more info
 

General protein information

Preferred Names
myeloid cell surface antigen CD33
Names
CD33 antigen
sialic acid-binding Ig-like lectin 3

NCBI Reference Sequences (RefSeq)

NEW Try the new Transcript table

RefSeqs maintained independently of Annotated Genomes

These reference sequences exist independently of genome builds. Explain

These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

mRNA and Protein(s)

  1. NM_001111058.1NP_001104528.1  myeloid cell surface antigen CD33 isoform 1 precursor

    See identical proteins and their annotated locations for NP_001104528.1

    Status: VALIDATED

    Description
    Transcript Variant: This variant (1) represents the longer transcript and encodes the longer protein (isoform 1).
    Source sequence(s)
    AC154108
    Consensus CDS
    CCDS52224.1
    UniProtKB/Swiss-Prot
    A2RT59, Q63994, Q63997
    Related
    ENSMUSP00000004728.6, ENSMUST00000004728.12
    Conserved Domains (2) summary
    cd00096
    Location:2125
    Ig; Ig strand A [structural motif]
    cl11960
    Location:21139
    Ig; Immunoglobulin domain
  2. NM_021293.3NP_067268.1  myeloid cell surface antigen CD33 isoform 2 precursor

    See identical proteins and their annotated locations for NP_067268.1

    Status: VALIDATED

    Description
    Transcript Variant: This variant (2) lacks an exon in the 3' coding region that results in a frameshift compared to variant 1. The resulting protein (isoform 2) is shorter and has a distinct C-terminus compared to isoform 1.
    Source sequence(s)
    AK155745, BC132379
    Consensus CDS
    CCDS21173.1
    UniProtKB/Swiss-Prot
    Q63994
    Related
    ENSMUSP00000146225.2, ENSMUST00000205503.2
    Conserved Domains (2) summary
    smart00410
    Location:26119
    IG_like; Immunoglobulin like
    cl11960
    Location:21139
    Ig; Immunoglobulin domain

RefSeqs of Annotated Genomes: GCF_000001635.27-RS_2024_02

The following sections contain reference sequences that belong to a specific genome build. Explain

Reference GRCm39 C57BL/6J

Genomic

  1. NC_000073.7 Reference GRCm39 C57BL/6J

    Range
    43176823..43186679 complement
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)

mRNA and Protein(s)

  1. XM_006540590.4XP_006540653.1  myeloid cell surface antigen CD33 isoform X1

    See identical proteins and their annotated locations for XP_006540653.1

    Conserved Domains (1) summary
    cl11960
    Location:46164
    Ig; Immunoglobulin domain