NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Sample GSM6482145 Query DataSets for GSM6482145
Status Public on Dec 15, 2022
Title LibFR012_LibFR015_humantwistmix_STARRseq_rep1
Sample type SRA
 
Source name HCT-116 cells
Organism Homo sapiens
Characteristics cell line: HCT-116
sample type: PCR amplified cDNA (STARR-seq transcript)
Growth protocol Human colon cancer cell line HCT116 was grown in DMEM supplemented with 10% heat inactivated FBS and 1% L-Glutamine at 37ºC. Cells were passaged every 2-3 days.
Extracted molecule polyA RNA
Extraction protocol 6h after electroporation, total RNA was extracted using the RNeasy Maxi kit (Qiagen; cat. no. 75162), followed by polyA+ RNA isolation using Invitrogen Dynabeads Oligo(dT)25 (scaling up the manufacturer’s protocol accordingly; cat. no. 61005) and DNase treatment with Ambion Turbo DNase (cat. no. AM2239) at a concentration of at most 200 ng/µl for 30 minutes (min) at 37°C. The reactions were then subjected to Qiagen RNeasy MinElute reaction clean-up (cat. no. 74204), for Turbo DNase inactivation and RNA concentration. After reverse transcription and second strand synthesis a unique molecular identifier (UMI) was added to each transcript. This is followed by two nested PCR steps, each with primers that are specific to the reporter transcripts such that STARR-seq does not detect endogenous cellular RNAs. See Neumayr et al., Curr. Protoc. Mol. Biol. 2019.
Oligo libraries were synthesized by Twist Bioscience including 249 bp enhancer sequence and adaptors for library cloning. Following the instructions of NEBNext® DNA Library Prep Reagent Set for Illumina® (NEB; cat. no. E6000L), Illumina Multiplexing Adapters (Illumina Inc; cat. no. PE-400-1001) were ligated and homology arms for In-Fusion® recombination were added by PCR, followed by recombination into the human STARR-seq vector. The In-Fusion® reactions were transformed (MegaX DH10B; Invitrogen), grown in liquid culture and plasmids were isolated. See Neumayr et al., Curr. Protoc. Mol. Biol. 2019.
STARR-seq
 
Library strategy OTHER
Library source transcriptomic
Library selection other
Instrument model NextSeq 550
 
Data processing Oligo library UMI-STARR-seq RNA and DNA input reads (150 bp) were mapped to a reference containing 249 bp long sequences containing both wildtype and mutated fragments from the human library using Bowtie v.1.2.2. We demultiplexed reads by the i5 and i7 indexes and subsequently by the oligo identity, since this library contained oligos from other experiments. Mapping reads with the correct length, strand and with no mismatches (to identify all sequence variants) were kept. Both DNA and RNA reads were collapsed by UMI (10 bp, allowing one mismatch; UMI sequences are in the sequencing read name) to ensure the counting of unique reporter transcripts.
We excluded oligos with less than 10 reads in any of the input replicates and added one read pseudocount to oligos with zero RNA counts. The enhancer activity of each oligo in each screen was calculated as the log2 fold-change over input, using both replicates, using DESeq2. We used the counts of wildtype negative regions in each library as scaling factors between samples. This normalization only changes the position of the zero and consequently does not affect the calculation of log2 fold-changes or the p-values for the statistical tests used.
Assembly: hg19 (custom oligo library)
Supplementary files format and content: Table of all human enhancer sequences and their mutant sequences included in the oligo library (n=31709), with genomic coordinates, oligo sequence, experiment, wildtype and pasted motif information, read counts for each screen and final enhancer activity (log2).
 
Submission date Aug 19, 2022
Last update date Dec 15, 2022
Contact name Bernardo P de Almeida
E-mail(s) bernardo.almeida@imp.ac.at
Organization name Research Institute of Molecular Pathology (IMP)
Lab Stark Lab
Street address Campus-Vienna-Biocenter 1
City Wien
ZIP/Postal code 1030
Country Austria
 
Platform ID GPL21697
Series (2)
GSE211657 Enhancers display sequence flexibility constrained by transcription factor motif syntax [Human motif pasting STARR-seq]
GSE211659 Enhancers display sequence flexibility constrained by transcription factor motif syntax
Relations
BioSample SAMN30413382
SRA SRX17154878

Supplementary data files not provided
SRA Run SelectorHelp
Raw data are available in SRA
Processed data are available on Series record

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap