U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

CENPW centromere protein W [ Homo sapiens (human) ]

Gene ID: 387103, updated on 5-Mar-2024

Summary

Official Symbol
CENPWprovided by HGNC
Official Full Name
centromere protein Wprovided by HGNC
Primary source
HGNC:HGNC:21488
See related
Ensembl:ENSG00000203760 MIM:611264; AllianceGenome:HGNC:21488
Gene type
protein coding
RefSeq status
VALIDATED
Organism
Homo sapiens
Lineage
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo
Also known as
CUG2; CENP-W; C6orf173
Summary
Predicted to enable DNA binding activity and protein heterodimerization activity. Involved in chromosome segregation; kinetochore assembly; and mitotic cell cycle. Located in kinetochore and nucleoplasm. [provided by Alliance of Genome Resources, Apr 2022]
Expression
Broad expression in bone marrow (RPKM 2.8), testis (RPKM 2.0) and 20 other tissues See more
Orthologs
NEW
Try the new Gene table
Try the new Transcript table

Genomic context

See CENPW in Genome Data Viewer
Location:
6q22.32
Exon count:
5
Annotation release Status Assembly Chr Location
RS_2023_10 current GRCh38.p14 (GCF_000001405.40) 6 NC_000006.12 (126340115..126483320)
RS_2023_10 current T2T-CHM13v2.0 (GCF_009914755.1) 6 NC_060930.1 (127528692..127671944)
105.20220307 previous assembly GRCh37.p13 (GCF_000001405.25) 6 NC_000006.11 (126661261..126804466)

Chromosome 6 - NC_000006.12Genomic Context describing neighboring genes Neighboring gene NANOG-H3K27ac-H3K4me1 hESC enhancer GRCh37_chr6:126306909-126307739 Neighboring gene NANOG-H3K27ac-H3K4me1 hESC enhancer GRCh37_chr6:126307740-126308569 Neighboring gene RNA, 5S ribosomal pseudogene 216 Neighboring gene histidine triad nucleotide binding protein 3 Neighboring gene tRNA methyltransferase 11 homolog Neighboring gene Sharpr-MPRA regulatory region 12057 Neighboring gene protein phosphatase 1 regulatory inhibitor subunit 14B pseudogene 5 Neighboring gene ATAC-STARR-seq lymphoblastoid active region 25035 Neighboring gene ATAC-STARR-seq lymphoblastoid active region 25036 Neighboring gene microRNA 588 Neighboring gene RNA, U6 small nuclear 200, pseudogene

Genomic regions, transcripts, and products

Expression

  • Project title: HPA RNA-seq normal tissues
  • Description: RNA-seq was performed of tissue samples from 95 human individuals representing 27 different tissues in order to determine tissue-specificity of all protein-coding genes
  • BioProject: PRJEB4337
  • Publication: PMID 24309898
  • Analysis date: Wed Apr 4 07:08:55 2018

Bibliography

GeneRIFs: Gene References Into Functions

What's a GeneRIF?

Phenotypes

EBI GWAS Catalog

Description
A novel common variant in DCST2 is associated with length in early life and height in adulthood.
EBI GWAS Catalog
Common variants at 6q22 and 17q21 are associated with intracranial volume.
EBI GWAS Catalog
Genome-wide association study and meta-analysis find that over 40 loci affect risk of type 1 diabetes.
EBI GWAS Catalog
Genome-wide meta-analysis identifies 11 new loci for anthropometric traits and provides insights into genetic architecture.
EBI GWAS Catalog
Genome-wide trans-ancestry meta-analysis provides insight into the genetic architecture of type 2 diabetes susceptibility.
EBI GWAS Catalog
Hundreds of variants clustered in genomic loci and biological pathways affect human height.
EBI GWAS Catalog
Identification, replication, and fine-mapping of Loci associated with adult height in individuals of african ancestry.
EBI GWAS Catalog
Many sequence variants affecting diversity of adult human height.
EBI GWAS Catalog
Parent-of-origin-specific allelic associations among 106 genomic loci for age at menarche.
EBI GWAS Catalog
Phenotypic dissection of bone mineral density reveals skeletal site specificity and facilitates the identification of novel loci in the genetic regulation of bone mass attainment.
EBI GWAS Catalog
Thirty new loci for age at menarche identified by a meta-analysis of genome-wide association studies.
EBI GWAS Catalog

Pathways from PubChem

Interactions

Products Interactant Other Gene Complex Source Pubs Description

General gene information

Clone Names

  • DKFZp686L1553

Gene Ontology Provided by GOA

Function Evidence Code Pubs
enables DNA binding IEA
Inferred from Electronic Annotation
more info
 
enables protein binding IPI
Inferred from Physical Interaction
more info
PubMed 
enables protein heterodimerization activity IEA
Inferred from Electronic Annotation
more info
 
Process Evidence Code Pubs
involved_in CENP-A containing chromatin assembly TAS
Traceable Author Statement
more info
PubMed 
involved_in cell division IEA
Inferred from Electronic Annotation
more info
 
involved_in chromosome organization IMP
Inferred from Mutant Phenotype
more info
PubMed 
involved_in chromosome segregation IBA
Inferred from Biological aspect of Ancestor
more info
 
involved_in chromosome segregation IMP
Inferred from Mutant Phenotype
more info
PubMed 
involved_in chromosome segregation NAS
Non-traceable Author Statement
more info
PubMed 
involved_in chromosome segregation TAS
Traceable Author Statement
more info
PubMed 
involved_in kinetochore assembly IBA
Inferred from Biological aspect of Ancestor
more info
 
involved_in kinetochore assembly IMP
Inferred from Mutant Phenotype
more info
PubMed 
involved_in mitotic cell cycle IBA
Inferred from Biological aspect of Ancestor
more info
 
involved_in mitotic cell cycle IMP
Inferred from Mutant Phenotype
more info
PubMed 
Component Evidence Code Pubs
located_in chromosome, centromeric region IDA
Inferred from Direct Assay
more info
PubMed 
part_of inner kinetochore IPI
Inferred from Physical Interaction
more info
PubMed 
is_active_in kinetochore IBA
Inferred from Biological aspect of Ancestor
more info
 
located_in kinetochore IDA
Inferred from Direct Assay
more info
PubMed 
located_in nuclear matrix IEA
Inferred from Electronic Annotation
more info
 
located_in nucleolus IEA
Inferred from Electronic Annotation
more info
 
is_active_in nucleoplasm IBA
Inferred from Biological aspect of Ancestor
more info
 
located_in nucleoplasm IDA
Inferred from Direct Assay
more info
 
located_in nucleoplasm TAS
Traceable Author Statement
more info
 
located_in nucleus NAS
Non-traceable Author Statement
more info
PubMed 

General protein information

Preferred Names
centromere protein W
Names
cancer-up-regulated gene 2 protein
cancer-upregulated gene 2

NCBI Reference Sequences (RefSeq)

NEW Try the new Transcript table

RefSeqs maintained independently of Annotated Genomes

These reference sequences exist independently of genome builds. Explain

These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

mRNA and Protein(s)

  1. NM_001012507.4NP_001012525.1  centromere protein W isoform b

    See identical proteins and their annotated locations for NP_001012525.1

    Status: VALIDATED

    Description
    Transcript Variant: This variant (2) uses an alternate in-frame splice site in the 5' coding region, compared to variant 1, resulting in an isoform (b) that is shorter than isoform a.
    Source sequence(s)
    AC020559, BC039556, BM721156
    Consensus CDS
    CCDS34529.1
    UniProtKB/Swiss-Prot
    A6NIR0, A6NJC2, Q5EE01
    Related
    ENSP00000357311.4, ENST00000368328.5
    Conserved Domains (1) summary
    pfam15510
    Location:188
    CENP-W; CENP-W protein
  2. NM_001286524.2NP_001273453.1  centromere protein W isoform a

    See identical proteins and their annotated locations for NP_001273453.1

    Status: VALIDATED

    Description
    Transcript Variant: This variant (1) encodes the longest isoform (a).
    Source sequence(s)
    AC020559, AI207187, AL832235, BM721156
    Consensus CDS
    CCDS69196.1
    UniProtKB/Swiss-Prot
    Q5EE01
    Related
    ENSP00000357308.1, ENST00000368325.5
    Conserved Domains (1) summary
    pfam15510
    Location:1103
    CENP-W; CENP-W protein
  3. NM_001286525.2NP_001273454.1  centromere protein W isoform c

    See identical proteins and their annotated locations for NP_001273454.1

    Status: VALIDATED

    Description
    Transcript Variant: This variant (3) uses two alternate splice sites that result in a frameshifted 3' coding region, compared to variant 1. The encoded isoform (c) has a distinct C-terminus and is shorter than isoform a.
    Source sequence(s)
    AC020559, BM721156, BM874602
    Consensus CDS
    CCDS75516.1
    UniProtKB/TrEMBL
    A0A0A0MRK5
    Related
    ENSP00000357309.1, ENST00000368326.5
    Conserved Domains (1) summary
    pfam15510
    Location:142
    CENP-W; CENP-W protein

RNA

  1. NR_104462.2 RNA Sequence

    Status: VALIDATED

    Description
    Transcript Variant: This variant (4) uses an alternate splice site in the 5' region and contains alternate 3' exon structure, compared to variant 1. This variant is represented as non-coding because the use of the 5'-most expected translational start codon, as used in variant 1, renders the transcript a candidate for nonsense-mediated mRNA decay (NMD).
    Source sequence(s)
    AL356534, AL832235

RefSeqs of Annotated Genomes: GCF_000001405.40-RS_2023_10

The following sections contain reference sequences that belong to a specific genome build. Explain

Reference GRCh38.p14 Primary Assembly

Genomic

  1. NC_000006.12 Reference GRCh38.p14 Primary Assembly

    Range
    126340115..126483320
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)

mRNA and Protein(s)

  1. XM_017010845.2XP_016866334.1  centromere protein W isoform X1

    UniProtKB/TrEMBL
    A0A0A0MRK5
    Conserved Domains (1) summary
    pfam15510
    Location:142
    CENP-W; CENP-W protein

Alternate T2T-CHM13v2.0

Genomic

  1. NC_060930.1 Alternate T2T-CHM13v2.0

    Range
    127528692..127671944
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)

mRNA and Protein(s)

  1. XM_054355393.1XP_054211368.1  centromere protein W isoform X1

    UniProtKB/TrEMBL
    A0A0A0MRK5