|
|
GEO help: Mouse over screen elements for information. |
|
Status |
Public on Dec 21, 2022 |
Title |
dTAGSET1AB_ZC3H4_TTseq_Rep1_0hr |
Sample type |
SRA |
|
|
Source name |
4sU-labelled mouse embryonic stem cells with 4sU-labelled Drosophila SG4 cells spike-in, untreated control, dTAG-SET1A/B/ZC3H4
|
Organisms |
Drosophila melanogaster; Mus musculus |
Characteristics |
cell line: dTAG-SET1A/B/ZC3H4 condition: 4sU-labelled mouse embryonic stem cells with 4sU-labelled Drosophila SG4 cells spike-in, untreated control, dTAG-SET1A/B/ZC3H4 treatment: untreated treatment time: 0hr additional treatment: 500 µM 4-thiouridine additional treatment time: 15min spike-in reference organism: Drosophila melanogaster spike-in cell line: SG4 protocol: calibrated TT-seq
|
Treatment protocol |
Cell lines expressing dTAG fusion proteins were treated with 100 nM dTAG-13 (produced by Behnam Nabet and Nathanael Gray1 or Carole Bataille and Angela Russell) to induce protein depletion.
|
Growth protocol |
Mouse embryonic stem cells were grown in Dulbecco’s Modified Eagle Medium (Thermo Fisher Scientific) supplemented with fetal bovine serum (FBS, 15% Biosera or 10% Sigma), 1x non-essential amino acids (Thermo Fisher Scientific), 2 mM L-glutamine (Thermo Fisher Scientific), 1x penicillin/streptomycin (Thermo Fisher Scientific), 0.5 mM beta-mercaptoethanol (Thermo Fisher Scientific), and 10 ng/ml leukaemia inhibitory factor (produced in-house). ESCs were grown on gelatinised plates at 37⁰C and 5% CO2.
|
Extracted molecule |
total RNA |
Extraction protocol |
cTT-seq was performed largely as described previously (Gregersen et al., 2020). In brief, 9 million ESCs and 3 million Drosophila SG4 cells were labelled with 500 µM 4-thiouridine (4sU, Glentham Life Sciences) for 15 min and harvested into TRIzol reagent. 4sU-labelled mouse and Drosophila cells were mixed and RNA was extracted using the Direct-zol DNA/RNA Miniprep kit (Zymo Research) as per the manufacturer’s protocol. gDNA was depleted using the TURBO DNA-free Kit (Thermo Fisher Scientific). An equal quantity of RNA (60-80 µg) was taken into 100 µl nuclease free water and fragmented on ice with 20 µl 1M NaOH for 20 min. Fragmentation was stopped with 80 μl 1 M Tris, pH 6.8 and the RNA was cleaned up with Micro Bio-Spin P-30 gel columns (Biorad). RNA was biotin-labelled with 50 μl 0.1 mg/ml MTSEA biotin-XX linker (Biotium) with 3 μl biotin buffer (833 mM Tris HCl, pH 7.4, 83.3 mM EDTA) for 30 min at RT. Biotin-labelled RNA was purified with a 1:1 ratio of Phenol/Chloroform/Isoamyl alcohol (Thermo Fisher Scientific). Streptavidin pull-down was performed with the μMACS Streptavidin Kit (Miltenyi Biotec), washing the columns three times with 55°C pull-down wash buffer (100 mM Tris HCl, pH 7.4, 10 mM EDTA, 1 M NaCl and 0.1% Tween 20) and 3x RT pull down wash buffer. Biotin-labelled RNA was eluted with 100 μl elution buffer (100 mM DTT in nuclease-free water) and cleaned up with the RNeasy MinElute Cleanup kit (QIAGEN), adjusting the amount of ethanol to capture RNA < 200 nucleotides in length. RNA was quantified using the Qubit RNA HS assay kit. RNA libraries were prepared from 20-50 ng RNA with the Ultra II Directional RNA library prep kit, as per the manufacturer’s guidelines for rRNA depleted and FFPE RNA (NEB).
|
|
|
Library strategy |
OTHER |
Library source |
transcriptomic |
Library selection |
other |
Instrument model |
Illumina NextSeq 500 |
|
|
Data processing |
Library strategy: TT-seq Reads that aligned to the mm10 and dm6 rDNA genomic sequences (GenBank: BK000964.3 and M21017.1) were first identified using Bowtie2 with ‘-very-fast’, ‘-no-mixed’ and ‘-no-discordant’ options) and discarded. Unmapped reads were then aligned to the concatenated mm10 and dm6 genomes using STAR. To improve mapping of intronic sequences, reads that failed to map using STAR were aligned using Bowtie2 with ‘-sensitive-local’, ‘-no-mixed’ and ‘-no-discordant’ options. Uniquely aligned reads from the last two steps were combined for further analysis and PCR duplicates were removed using Sambamba. To internally calibrate TT-seq, we spiked-in a fixed number of Drosophila SG4 to each experimental sample. For data visualisation, mm10 reads were randomly subsampled using factors that reflected the total number of dm6 reads in each sample. Read counts from biological triplicates were determined using custom scripts utilising SAMTools for a custom-built, non-redundant mm10 gene set. Briefly, mm10 refGene genes were filtered to remove very short genes with poor sequence mappability and highly similar transcripts which resulted in the final set of 20,633 genes. Raw mm10 reads prior to spike-in normalisation were used for read counts quantitation for differential expression analysis. The final set of 20,633 genes were used for differential analysis using a custom R script adapting DESeq2 (Love et al., 2014) for spike-in calibrated TT-seq data. To incorporate spike-in calibration into this analysis, read counts for the spike-in genome at a control set of intervals were supplied to calculate DESeq2 size factors which were then used for DESeq2 normalisation of raw mm10 read counts. A set of unique dm6 refGene genes was used for spike-in normalisation of TT-seq. For a change to be called significant, we applied a threshold of p-adj < 0.05 and fold change > 1.5. Assembly: mm10, dm6 Supplementary files format and content: Stranded bigWig files were generated using genomeCoverageBed from BEDTools representing genome coverage of merged spike-in normalised biological replicates. Supplementary files format and content: DESeq2 results tables for differential gene expression analysis containing normalised read counts, raw and shrunk log2-fold changes (LFC), and statistical significance levels for a non-redundant set of mm10 refGene genes.
|
|
|
Submission date |
Mar 30, 2022 |
Last update date |
Dec 21, 2022 |
Contact name |
Emilia Dimitrova |
E-mail(s) |
emilia.dimitrova@bioch.ox.ac.uk
|
Organization name |
University of Oxford
|
Department |
Biochemistry
|
Lab |
Rob Klose
|
Street address |
South Parks Road
|
City |
Oxford |
ZIP/Postal code |
OX1 3QU |
Country |
United Kingdom |
|
|
Platform ID |
GPL25537 |
Series (2) |
GSE199804 |
A CpG island-encoded mechanism protects genes from premature transcription termination [SET1_TTseq] |
GSE199805 |
A CpG island-encoded mechanism protects genes from premature transcription termination |
|
Relations |
BioSample |
SAMN27108447 |
SRA |
SRX14672094 |
Supplementary data files not provided |
SRA Run Selector |
Raw data are available in SRA |
Processed data are available on Series record |
|
|
|
|
|