GEO Accession viewer

NCBI > GEO > Accession Display

Not logged in | Login

GEO help: Mouse over screen elements for information.

Sample GSM6482120

Query DataSets for GSM6482120

Status

Public on Dec 15, 2022

Title

ZnT63Cenhancer libraries rep1 STARR-seq

Sample type

SRA

Source name

S2 cells

Organism

Drosophila melanogaster

Characteristics

cell line: S2
sample type: PCR amplified cDNA (STARR-seq transcript)
promoter: developmental (DSCP)

Growth protocol

Schneider 2 cells were grown in Schneider’s Medium supplemented with 10% heat inactivated FBS at 27ºC. Cells were passaged every 2-3 days.

Extracted molecule

polyA RNA

Extraction protocol

24h after electroporation, total RNA was extracted using the RNeasy Maxi kit (Qiagen; cat. no. 75162), followed by polyA+ RNA isolation using Invitrogen Dynabeads Oligo(dT)25 (scaling up the manufacturer’s protocol accordingly; cat. no. 61005) and DNase treatment with Ambion Turbo DNase (cat. no. AM2239) at a concentration of at most 200 ng/µl for 30 minutes (min) at 37°C. The reactions were then subjected to Qiagen RNeasy MinElute reaction clean-up (cat. no. 74204), for Turbo DNase inactivation and RNA concentration. After reverse transcription and second strand synthesis a unique molecular identifier (UMI) was added to each transcript. This is followed by two nested PCR steps, each with primers that are specific to the reporter transcripts such that STARR-seq does not detect endogenous cellular RNAs. See Neumayr et al., Curr. Protoc. Mol. Biol. 2019.
Random 8nt variant libraries were generated using a PCR approach with degenerate oligonucleotides. Forward primers were designed to anneal directly downstream of the enhancer position of interested followed by 8 degenerate bp (creating 65,536 variants) and another 20 bp complementary stretch. Reverse primers were complementary to the 20 bp 5’ of the degenerate stretch. The STARR-seq vector containing the wildtype enhancer of interest (either ced-6 or ZnT63C) was used as a template for the PCR. The PCR was run across the whole STARR-seq plasmid, followed by DpnI digest and a Gibson reaction that re-circularizes the plasmid. Libraries were grown in 2l LB-Amp (final ampicillin concentration 100µg/mL). Variant libraries of the same enhancer i.e. ced-6 enhancer pos110, pos182, pos230, pos241 and ZnT63C enhancer pos142, pos180, pos210 were pooled to equimolar ratio, together with another synthetic oligo library containing wt enhancer sequences and negative regions.
STARR-seq

Library strategy

OTHER

Library source

transcriptomic

Library selection

other

Instrument model

Illumina NovaSeq 6000

Data processing

Dedicated bowtie indices were created for each enhancer position’s N8 library and combined with an oligo library of thousands of wildtype enhancers and negative sequences (de Almeida et al., 2022) for normalization, all 249 bp-long sequences. UMI-STARR-seq RNA and DNA input reads (paired-end 150 bp) were mapped to these dedicated bowtie indices using Bowtie v.1.2.2 (Langmead et al., 2009). Since the N8 variants were all positioned in the last 150 nt of each enhancer, we allowed for flexible mapping in the beginning of the fragments to increase the number of mapped reads while keeping high sensitivity for the different enhancer variants. Specifically, we trimmed the forward reads to 36 bp and mapped them to the indices allowing for 3 mismatches; the full 150 bp-long reverse reads were mapped with no mismatches, to identify all sequence variants; paired-end reads with the correct position, length and strand were kept. This mapping strategy was used for both DNA and RNA reads. For paired-end DNA and RNA reads that mapped to the same variant, we collapsed those that have identical UMIs (10 bp, allowing one mismatch) to ensure the counting of unique molecules.
We excluded oligos with less than 5 reads in any of the input replicates and less than 1 read in any of the RNA replicates. The enhancer activity of each sequence in each screen was calculated as the log2 fold-change over input, using all replicates, with DESeq2 (Love et al., 2014). We used the counts of wildtype negative regions in each library as scaling factors between samples.
Assembly: dm3 (custom library)
Supplementary files format and content: Table of all enhancers with random 8nt variants at different positions (n=536279), with genomic coordinates, DNA sequence of variants, read counts and enhancer activity (log2) in each library.

Submission date

Aug 19, 2022

Last update date

Dec 15, 2022

Contact name

Bernardo P de Almeida

E-mail(s)

bernardo.almeida@imp.ac.at

Organization name

Research Institute of Molecular Pathology (IMP)

Lab

Stark Lab

Street address

Campus-Vienna-Biocenter 1

City

Wien

ZIP/Postal code

1030

Country

Austria

Platform ID

GPL25244

Series (2)

GSE211655	Enhancers display sequence flexibility constrained by transcription factor motif syntax [Drosophila random variant STARR-seq]
GSE211659	Enhancers display sequence flexibility constrained by transcription factor motif syntax

Relations

BioSample

SAMN30413394

SRA

SRX17155289

Supplementary data files not provided

SRA Run Selector

Raw data are available in SRA

Processed data are available on Series record