U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

PAEP progestagen associated endometrial protein [ Homo sapiens (human) ]

Gene ID: 5047, updated on 5-Mar-2024

Summary

Official Symbol
PAEPprovided by HGNC
Official Full Name
progestagen associated endometrial proteinprovided by HGNC
Primary source
HGNC:HGNC:8573
See related
Ensembl:ENSG00000122133 MIM:173310; AllianceGenome:HGNC:8573
Gene type
protein coding
RefSeq status
REVIEWED
Organism
Homo sapiens
Lineage
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo
Also known as
GD; GdA; GdF; GdS; PEP; PAEG; PP14; ZIF-1
Summary
This gene is a member of the kernel lipocalin superfamily whose members share relatively low sequence similarity but have highly conserved exon/intron structure and three-dimensional protein folding. Most lipocalins are clustered on the long arm of chromosome 9. The encoded glycoprotein has been previously referred to as pregnancy-associated endometrial alpha-2-globulin, placental protein 14, and glycodelin, but has been officially named progestagen-associated endometrial protein. Three distinct forms, with identical protein backbones but different glycosylation profiles, are found in amniotic fluid, follicular fluid and seminal plasma of the reproductive system. These glycoproteins have distinct and essential roles in regulating a uterine environment suitable for pregnancy and in the timing and occurrence of the appropriate sequence of events in the fertilization process. Alternative splicing results in multiple transcript variants. [provided by RefSeq, Oct 2015]
Expression
Restricted expression toward endometrium (RPKM 125.9) See more
Orthologs
NEW
Try the new Gene table
Try the new Transcript table

Genomic context

See PAEP in Genome Data Viewer
Location:
9q34.3
Exon count:
7
Annotation release Status Assembly Chr Location
RS_2023_10 current GRCh38.p14 (GCF_000001405.40) 9 NC_000009.12 (135561756..135566955)
RS_2023_10 current T2T-CHM13v2.0 (GCF_009914755.1) 9 NC_060933.1 (147789564..147794795)
105.20220307 previous assembly GRCh37.p13 (GCF_000001405.25) 9 NC_000009.11 (138453602..138458801)

Chromosome 9 - NC_000009.12Genomic Context describing neighboring genes Neighboring gene odorant binding protein 2A Neighboring gene uncharacterized LOC107987040 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr9:138457239-138457870 Neighboring gene long intergenic non-protein coding RNA 1502 Neighboring gene uncharacterized LOC105376316 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr9:138479168-138479918 Neighboring gene progestagen associated endometrial protein pseudogene 1

Genomic regions, transcripts, and products

Expression

  • Project title: HPA RNA-seq normal tissues
  • Description: RNA-seq was performed of tissue samples from 95 human individuals representing 27 different tissues in order to determine tissue-specificity of all protein-coding genes
  • BioProject: PRJEB4337
  • Publication: PMID 24309898
  • Analysis date: Wed Apr 4 07:08:55 2018

Bibliography

GeneRIFs: Gene References Into Functions

What's a GeneRIF?

HIV-1 interactions

Protein interactions

Protein Gene Interaction Pubs
Envelope surface glycoprotein gp120 env 3-hydroxyphthalic anhydride (3HP) modified glycodelin GdA and GdS proteins inhibit HIV-1 gp120-CD4 binding and induce anti-HIV activity PubMed

Go to the HIV-1, Human Interaction Database

Interactions

Products Interactant Other Gene Complex Source Pubs Description

General protein information

Preferred Names
glycodelin
Names
PEG
PP14 protein (placental protein 14)
alpha uterine protein
glycodelin-A
glycodelin-F
glycodelin-S
placental protein 14
pregnancy-associated endometrial alpha-2 globulin
progestagen-associated endometrial protein (placental protein 14, pregnancy-associated endometrial a
progesterone-associated endometrial protein
zona-binding inhibitory factor-1

NCBI Reference Sequences (RefSeq)

NEW Try the new Transcript table

RefSeqs maintained independently of Annotated Genomes

These reference sequences exist independently of genome builds. Explain

These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

mRNA and Protein(s)

  1. NM_001018048.2NP_001018058.1  glycodelin isoform 2 precursor

    Status: REVIEWED

    Description
    Transcript Variant: This variant (3) uses an alternate, in-frame splice site in the 5' coding region, compared to variant 1. It encodes isoform 2 which is shorter than isoform 1.
    Source sequence(s)
    AK304657, AL050169, AL354761, DC421713
    UniProtKB/TrEMBL
    B4E3C0
    Conserved Domains (1) summary
    pfam00061
    Location:32153
    Lipocalin; Lipocalin / cytosolic fatty-acid binding protein family
  2. NM_001018049.3NP_001018059.1  glycodelin isoform 1 precursor

    See identical proteins and their annotated locations for NP_001018059.1

    Status: REVIEWED

    Description
    Transcript Variant: This variant (1) represents the longest transcript. Variants 1 and 2 encode the same protein.
    Source sequence(s)
    AK309983, AL050169, AL354761, BC113728, M61886
    Consensus CDS
    CCDS35173.1
    UniProtKB/Swiss-Prot
    P09466, Q5T6T1, Q9UG92
    UniProtKB/TrEMBL
    B2R4F9
    Related
    ENSP00000277508.5, ENST00000277508.9
    Conserved Domains (1) summary
    pfam00061
    Location:34175
    Lipocalin; Lipocalin / cytosolic fatty-acid binding protein family
  3. NM_002571.4NP_002562.2  glycodelin isoform 1 precursor

    See identical proteins and their annotated locations for NP_002562.2

    Status: REVIEWED

    Description
    Transcript Variant: This variant (2) uses an alternate, in-frame splice site in the 3' UTR, compared to variant 1. It encodes the same protein as variant 1.
    Source sequence(s)
    AK311813, AL050169, AL354761, J04129
    Consensus CDS
    CCDS35173.1
    UniProtKB/Swiss-Prot
    P09466, Q5T6T1, Q9UG92
    UniProtKB/TrEMBL
    B2R4F9
    Related
    ENSP00000417898.1, ENST00000479141.6
    Conserved Domains (1) summary
    pfam00061
    Location:34175
    Lipocalin; Lipocalin / cytosolic fatty-acid binding protein family

RefSeqs of Annotated Genomes: GCF_000001405.40-RS_2023_10

The following sections contain reference sequences that belong to a specific genome build. Explain

Reference GRCh38.p14 Primary Assembly

Genomic

  1. NC_000009.12 Reference GRCh38.p14 Primary Assembly

    Range
    135561756..135566955
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)

mRNA and Protein(s)

  1. XM_011518746.3XP_011517048.1  glycodelin isoform X1

    See identical proteins and their annotated locations for XP_011517048.1

    Conserved Domains (1) summary
    pfam00061
    Location:34199
    Lipocalin; Lipocalin / cytosolic fatty-acid binding protein family
  2. XM_011518745.3XP_011517047.1  glycodelin isoform X1

    See identical proteins and their annotated locations for XP_011517047.1

    Conserved Domains (1) summary
    pfam00061
    Location:34199
    Lipocalin; Lipocalin / cytosolic fatty-acid binding protein family
  3. XM_011518752.3XP_011517054.1  glycodelin isoform X5

    See identical proteins and their annotated locations for XP_011517054.1

    UniProtKB/TrEMBL
    A6XNE0
    Related
    ENSP00000484659.1, ENST00000611414.4
    Conserved Domains (1) summary
    pfam00061
    Location:34104
    Lipocalin; Lipocalin / cytosolic fatty-acid binding protein family
  4. XM_011518748.2XP_011517050.1  glycodelin isoform X2

    UniProtKB/TrEMBL
    B4E3C0
    Conserved Domains (1) summary
    pfam00061
    Location:32177
    Lipocalin; Lipocalin / cytosolic fatty-acid binding protein family
  5. XM_011518751.2XP_011517053.1  glycodelin isoform X4

    See identical proteins and their annotated locations for XP_011517053.1

    UniProtKB/TrEMBL
    B4E3C0
    Conserved Domains (1) summary
    pfam00061
    Location:32153
    Lipocalin; Lipocalin / cytosolic fatty-acid binding protein family
  6. XM_017014782.3XP_016870271.1  glycodelin isoform X6

  7. XM_011518747.2XP_011517049.1  glycodelin isoform X1

    See identical proteins and their annotated locations for XP_011517049.1

    Conserved Domains (1) summary
    pfam00061
    Location:34199
    Lipocalin; Lipocalin / cytosolic fatty-acid binding protein family
  8. XM_011518749.2XP_011517051.1  glycodelin isoform X3

    See identical proteins and their annotated locations for XP_011517051.1

    UniProtKB/Swiss-Prot
    P09466, Q5T6T1, Q9UG92
    UniProtKB/TrEMBL
    B2R4F9
    Related
    ENSP00000360831.1, ENST00000371766.6
    Conserved Domains (1) summary
    pfam00061
    Location:34175
    Lipocalin; Lipocalin / cytosolic fatty-acid binding protein family
  9. XM_017014783.2XP_016870272.1  glycodelin isoform X7

Alternate T2T-CHM13v2.0

Genomic

  1. NC_060933.1 Alternate T2T-CHM13v2.0

    Range
    147789564..147794795
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)