U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

SRX4282109: RNA-Seq of human: fetal and adult brain
1 PACBIO_SMRT (PacBio RS II) run: 180,731 spots, 213.8M bases, 18.4Mb downloads

Design: Long-read sequencing of RNA from human brain enriched for targeted human-specific duplicate genes (probe panel 2, Dougherty et.al, "Transcriptional fates of human-specific segmental duplications " )
Submitted by: University of Washington
Study: Transcriptional fates of human-specific segmental duplications
show Abstracthide Abstract
High-quality sequence assembly and accurate gene annotation are critical to understanding gene evolution but are often complicated in regions of segmental duplications (SDs). Although such regions are particularly important for gene innovation, a basic understanding (e.g., gene structure and protein-coding potential) is still incomplete, incorrect, or lacking for many duplicate genes. We developed a method to yield full-length transcript information and confidently distinguish between nearly identical genes/paralogs. We used biotinylated probes to enrich for full-length cDNA from duplicated regions, which were then amplified, size-fractionated, and sequenced using single-molecule, long-read sequencing technology, permitting us to distinguish between highly identical genes by virtue of multiple paralogous sequence variants. We examined 19 gene families as expressed in developing and adult human brain, selected for their high sequence identity (average >99%) and overlap with human-specific SDs. We characterized the transcriptional differences between related paralogs to better understand the birth-death process of duplicate genes and particularly how the process leads to gene innovation. In 48% of the cases, we find that the expressed duplicates have changed substantially from their ancestral models due to novel sites of transcription initiation, splicing, and polyadenylation, as well as fusion transcripts that connect duplication-derived exons with neighboring genes. This transcriptional diversity occurs early during evolution likely in in the absence of selection. We detect unannotated open-reading frames in genes currently annotated as pseudogenes, while relegating other duplicates to pseudogene status. Our method significantly improves gene annotation, specifically defining full-length transcripts, isoforms, and open-reading frames for new genes in highly identical SDs. The approach will be more broadly applicable to genes in structurally complex regions of other genomes where the duplication process creates novel genes important for adaptive traits.
Sample: fetal and adult brain
SAMN09469431 • SRS3446957 • All experiments • All runs
Organism: Homo sapiens
Library:
Name: ProbePanel 2: fetal and adult brain
Instrument: PacBio RS II
Strategy: RNA-Seq
Source: TRANSCRIPTOMIC
Selection: DNase
Layout: SINGLE
Runs: 1 run, 180,731 spots, 213.8M bases, 18.4Mb
Run# of Spots# of BasesSizePublished
SRR7410571180,731213.8M18.4Mb2018-08-08

ID:
5769494

Supplemental Content

Recent activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...