NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Series GSE117594 Query DataSets for GSE117594
Status Public on Jan 08, 2019
Title High-throughput functional analysis of lncRNA core promoters elucidates rules governing tissue specificity
Organisms Homo sapiens; synthetic construct
Experiment type Other
Summary Bidirectional transcription initiates at both coding and non-coding genomic elements, including mRNA and long non-coding RNA (lncRNA) promoters and enhancer RNAs (eRNAs). However, each class has different tissue-specific expression profiles with lncRNAs and eRNAs being the most tissue-specific. How these complex differences in expression profiles and tissue-specificities are encoded in a single DNA sequence, however, remains an open question. Here, we address this question using multiple computational and experimental approaches, including massively parallel reporter assays (MPRA). As most transcription factors (TFs) are enriched near the transcription start sites (TSSs) of both promoters and enhancers, we focus our analyses on these core promoter regions. We find that divergent lncRNA and mRNA core promoters have higher capacities to drive transcription than non-divergent lncRNA and mRNA core promoters, respectively. Conversely, lincRNAs and eRNAs are more tissue-specific than divergent genes. This higher tissue-specificity is strongly associated with having less complex TF motif profiles at the core promoter. We confirm these findings using single-nucleotide deletions in MPRA and we identify specific TFs regulating a set of disease-related lncRNAs. Finally, we assess the effects of genetic variation at core promoters and find that 22% of common single nucleotide polymorphisms show significant regulatory effects. Collectively, our findings characterize the important role of core promoter sequences in determining expression levels across both coding and non-coding gene classes and highlight an unexpected role of TF motif architecture in explaining the more restricted expression patterns of lncRNAs and eRNAs.
 
Overall design We designed two MPRAs: (1) a pool of 120,000 oligos corresponding to ~2,000 unique core promoter regions (POOL1) and (2) a pool of 119,082 oligos corresponding to single-nucleotide deletions across ~30 core promoter regions (POOL2). For POOL1, we performed the MPRA both with a minimal promoter (pMPRA1) and without a minimal promoter (pNoCMVMPRA1) to determine whether this had any effect on the assay results. We analyzed POOL1 in HeLa (4 replicates with and without minimal promoter), HepG2 (12 replicates with a minimal promoter, 4 replicates without), and K562 (4 replicates with and without minimal promoter). We analyzed POOL2 in HepG2 (8 replicates) and K562 (4 replicates). For each experimental set-up, we provide the raw fastqs of the input DNA library and the output RNA barcodes. We also include processed data in the form of DNA and RNA barcode counts.
 
Contributor(s) Melé M, Mattioli K, Gerhardinger C, Maass PG, Rinn JL
Citation(s) 30683753
Submission date Jul 24, 2018
Last update date Mar 28, 2019
Contact name Martha L. Bulyk
Organization name Brigham and Women's Hospital
Department Division of Genetics
Lab Bulyk Lab
Street address 77 Avenue Louis Pasteur, Rm 468
City Boston
State/province MA
ZIP/Postal code 02115
Country USA
 
Platforms (2)
GPL16791 Illumina HiSeq 2500 (Homo sapiens)
GPL19604 Illumina HiSeq 2500 (synthetic construct)
Samples (47)
GSM3304485 POOL1__pMPRA1__input_DNA
GSM3304486 POOL1__pNoCMVMPRA1__input_DNA
GSM3304487 POOL2__pMPRA1__input_DNA
Relations
BioProject PRJNA482608
SRA SRP154997

Download family Format
SOFT formatted family file(s) SOFTHelp
MINiML formatted family file(s) MINiMLHelp
Series Matrix File(s) TXTHelp

Supplementary file Size Download File type/resource
GSE117594_RAW.tar 24.1 Mb (http)(custom) TAR (of TXT)
SRA Run SelectorHelp
Raw data are available in SRA
Processed data provided as supplementary file

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap