U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination
    • Showing Current items.

    Gsg1 germ cell associated 1 [ Mus musculus (house mouse) ]

    Gene ID: 14840, updated on 2-Nov-2024

    Summary

    Official Symbol
    Gsg1provided by MGI
    Official Full Name
    germ cell associated 1provided by MGI
    Primary source
    MGI:MGI:1194499
    See related
    Ensembl:ENSMUSG00000030206 AllianceGenome:MGI:1194499
    Gene type
    protein coding
    RefSeq status
    VALIDATED
    Organism
    Mus musculus
    Lineage
    Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus
    Summary
    Enables RNA polymerase binding activity. Located in endoplasmic reticulum. Orthologous to human GSG1 (germ cell associated 1). [provided by Alliance of Genome Resources, Nov 2024]
    Expression
    Restricted expression toward testis adult (RPKM 1505.6) See more
    Orthologs
    NEW
    Try the new Gene table
    Try the new Transcript table

    Genomic context

    See Gsg1 in Genome Data Viewer
    Location:
    6 G1; 6 66.07 cM
    Exon count:
    8
    Annotation release Status Assembly Chr Location
    RS_2024_02 current GRCm39 (GCF_000001635.27) 6 NC_000072.7 (135214327..135231334, complement)
    108.20200622 previous assembly GRCm38.p6 (GCF_000001635.26) 6 NC_000072.6 (135237329..135254336, complement)

    Chromosome 6 - NC_000072.7Genomic Context describing neighboring genes Neighboring gene predicted gene, 19434 Neighboring gene predicted gene, 36533 Neighboring gene family with sequence similarity 234, member B Neighboring gene CapStarr-seq enhancer MGSCv37_chr6:135168385-135168675 Neighboring gene predicted gene, 57750 Neighboring gene STARR-seq mESC enhancer starr_17565 Neighboring gene predicted gene, 36640 Neighboring gene STARR-seq mESC enhancer starr_17566 Neighboring gene predicted gene, 36582 Neighboring gene phosphatidylethanolamine binding protein 2

    Genomic regions, transcripts, and products

    Expression

    • Project title: Mouse ENCODE transcriptome data Mouse ENCODE transcriptome data
    • Description: RNA profiling data sets generated by the Mouse ENCODE project.
    • BioProject: PRJNA66167
    • Publication: PMID 25409824
    • Analysis date: n/a

    Variation

    Alleles

    Alleles of this type are documented at Mouse Genome Informatics  (MGI)
    • Endonuclease-mediated (1)  1 citation
    • Targeted (1) 

    Interactions

    Products Interactant Other Gene Complex Source Pubs Description

    General gene information

    Markers

    Gene Ontology Provided by MGI

    Function Evidence Code Pubs
    enables RNA polymerase binding IPI
    Inferred from Physical Interaction
    more info
    PubMed 
    enables protein binding IPI
    Inferred from Physical Interaction
    more info
    PubMed 
    Process Evidence Code Pubs
    involved_in biological_process ND
    No biological Data available
    more info
     
    Component Evidence Code Pubs
    located_in endoplasmic reticulum IDA
    Inferred from Direct Assay
    more info
    PubMed 
    located_in endoplasmic reticulum membrane IEA
    Inferred from Electronic Annotation
    more info
     
    is_active_in plasma membrane IBA
    Inferred from Biological aspect of Ancestor
    more info
     

    General protein information

    Preferred Names
    germ cell-specific gene 1 protein
    Names
    germ cell-associated protein 1

    NCBI Reference Sequences (RefSeq)

    NEW Try the new Transcript table

    RefSeqs maintained independently of Annotated Genomes

    These reference sequences exist independently of genome builds. Explain

    These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

    mRNA and Protein(s)

    1. NM_001080552.1NP_001074021.1  germ cell-specific gene 1 protein isoform a

      Status: VALIDATED

      Description
      Transcript Variant: This variant (1) encodes the longer isoform (a).
      Source sequence(s)
      BC023009
      Consensus CDS
      CCDS39682.1
      UniProtKB/Swiss-Prot
      Q8C2N5, Q8R1W2, Q9D9Z3, Q9Z1H7
      Related
      ENSMUSP00000085022.6, ENSMUST00000087729.12
      Conserved Domains (1) summary
      pfam07803
      Location:13125
      GSG-1; GSG1-like protein
    2. NM_001080553.1NP_001074022.1  germ cell-specific gene 1 protein isoform b

      See identical proteins and their annotated locations for NP_001074022.1

      Status: VALIDATED

      Description
      Transcript Variant: This variant (2) contains a distinct 5' UTR and lacks an in-frame portion of the 5' coding region, compared to variant 1. The resulting isoform (b) has a shorter N-terminus, compared to isoform a. Variants 2 and 3 encode the same isoform.
      Source sequence(s)
      BC023009, BY098420
      Consensus CDS
      CCDS39683.1
      UniProtKB/TrEMBL
      E9QQ16
      Related
      ENSMUSP00000107541.2, ENSMUST00000111910.4
      Conserved Domains (1) summary
      pfam07803
      Location:10122
      GSG-1; GSG1-like protein
    3. NM_010352.2NP_034482.2  germ cell-specific gene 1 protein isoform b

      See identical proteins and their annotated locations for NP_034482.2

      Status: VALIDATED

      Description
      Transcript Variant: This variant (3) contains a distinct 5' UTR and lacks an in-frame portion of the 5' coding region, compared to variant 1. The resulting isoform (b) has a shorter N-terminus, compared to isoform a. Variants 2 and 3 encode the same isoform.
      Source sequence(s)
      BC023009, BY098420, CA465575
      Consensus CDS
      CCDS39683.1
      UniProtKB/TrEMBL
      E9QQ16
      Related
      ENSMUSP00000107542.3, ENSMUST00000111911.9
      Conserved Domains (1) summary
      pfam07803
      Location:10122
      GSG-1; GSG1-like protein

    RefSeqs of Annotated Genomes: GCF_000001635.27-RS_2024_02

    The following sections contain reference sequences that belong to a specific genome build. Explain

    Reference GRCm39 C57BL/6J

    Genomic

    1. NC_000072.7 Reference GRCm39 C57BL/6J

      Range
      135214327..135231334 complement
      Download
      GenBank, FASTA, Sequence Viewer (Graphics)