|
|
GEO help: Mouse over screen elements for information. |
|
Status |
Public on Feb 19, 2024 |
Title |
C2C12_D0_RNAseq_rep2 |
Sample type |
SRA |
|
|
Source name |
mouse myoblast cells
|
Organism |
Mus musculus |
Characteristics |
treatment: 0 hours after differentiation genotype: wild type
|
Treatment protocol |
HepG2 cells were treated with 500ng/ml ADR or 500μM CoCl2 for 24h before harvesting; 2ug/ml poly (I:C) were transfected into HepG2 with lipo2000 and then the cells were cultured for 12h before harvesting. C2C12 cells grow up to 80% confluence, and then the culture medium was changed to the differentiation medium which is consisted of DMEM with 2% horse serum.
|
Growth protocol |
HepG2 and U-87 MG were cultured in MEM supplemented with 10% FBS, 1% penicillin-streptomycin at 37 °C in a humidified. HEK293T and C2C12 cells were clutured in DMEM supplemented with 10% FBS and 1% penicillin-streptomycin at 37 °C in a humidified atmosphere with 95% air and 5% CO2.
|
Extracted molecule |
total RNA |
Extraction protocol |
RNA was harvested using Trizol reagent. 2ug total RNA was used to prepare the sequencing libraries. RNA libraries were prepared for sequencing using standard Illumina or nanopore protocols
|
|
|
Library strategy |
RNA-Seq |
Library source |
transcriptomic |
Library selection |
cDNA |
Instrument model |
HiSeq X Ten |
|
|
Description |
C2C12, 0 hours after differentiation, biological replicate2 total RNA, remove rRNA Discover the full-length non-capped RNAs of the mammalian transcriptome
|
Data processing |
NGS raw data gained from Illumina HiSeqX10 with 150 bp paired end reads, TGS raw data gained from MinION nanopore with single end reads For Illumina sequencing data, Cutadapt (v2.8) (Martin, 2011) was firstly used to cut the sequencing adapters of the paired-end reads with the following parameters: cutadapt -a AGATCGGAAGAGCACACGTCTG -A AGATCGGAAGAGCGTCGT -m 15 -e 0.15. The NAP-seq specific 5’-adapter (AAGCAGTGGTATCAACGCAGAGT) and 3’-adapter (AGTCGTAGTAAGTCTGTGCTCG) which marked the boundary of the napRNAs were subsequently moved by our programme flClipAdapter with the following parameters: -l 15 -e 0.1 -c 6. Finally, the reads were mapped to the reference genome (hg38 or mm10) with STAR (Dobin et al., 2013) software with the following parameters: --genomeLoad NoSharedMemory --limitBAMsortRAM 60000000000 --alignEndsType EndToEnd --outFilterType BySJout --outFilterMultimapScoreRange 0 --outFilterMultimapNmax 20 --outFilterMismatchNmax 10 --outFilterMismatchNoverLmax 0.05 --outFilterScoreMin 0 --outFilterScoreMinOverLread 0 --outFilterMatchNmin 20 --outFilterMatchNminOverLread 0.8 --seedSearchStartLmax 15 --seedSearchStartLmaxOverLread 1 --alignIntronMin 20 --alignIntronMax 1000000 --alignMatesGapMax 1000000 --alignSJoverhangMin 20 --alignSJDBoverhangMin 10 --outSAMtype BAM Unsorted --outSAMmode Full --outSAMattributes All --outSAMunmapped None --outSAMorder Paired --outSAMprimaryFlag AllBestScore --outSAMreadID Standard --outReadsUnmapped Fastx --limitOutSJcollapsed 5000000 --alignEndsProtrude 150 ConcordantPair --readFilesCommand zcat . For nanopore sequencing data, Cutadapt (v2.8) was used to cut the NAP-seq specific adapters with the following parameters: -j 16 -g AAGCAGTGGTATCAACGCAGAGT -a AGTCGTAGTAAGTCTGTGCTCG -m 20 -e 0.3 -O 10, and then use Cutadapt again to move any possible adapters with the following parameters: -j 16 -g CGAGCACAGACTTACTACGACT -a ACTCTGCGTTGATACCACTGCTT -m 20 -e 0.3 -O 10, and we just retained the read that contained the NAP-seq specific adapters. Finally, thr reads were mapped to the reference genome (hg38) with minimap2 (Li, 2018) with the following parameters: --junc-bed hg38.gencode.v30.geneAnno.bed -t 16 -k15 -w5 --splice -g2000 -G200k -A2 -B4 -O4,96 -E2,0 -C18 -z400,200 -ub --end-bonus=18 --junc-bonus=18 --splice-flank=yes -ub --sam-hit-only --secondary=no -a For SHAPE-MaP data, preprocessing of SHAPE-MaP sequencing data was the same as NAP-seq Illumina sequencing data, including remove the sequencing adapters and NAP-seq specific adapters. Then clean paired-end reads of three biological replicates were merge and mapped to each identified napRNAs. The alignments for individual napRNAs were extracted and processed with shapemapper_mutation_parser, shapemapper_mutation_counter and the python script make_reactivity_profiles.py from ShapeMapper2 to produce the reactivity profiles. The SHAPE reactivity profiles were used to guide RNA structure prediction using RNA RNAalifold. We further manually adjust the RNA structure for some polyA-pocket-ACA-napRNAs to make it more more consistent to the hairpin-pocket-hinge-hairpin structure. For CAP-seq data, Cutadapt (v2.8) (Martin, 2011) was first used with the following parameters to remove the sequencing adapters from the paired-end reads: cutadapt -a TGGAATTCTCGGGTGCCAAGG -A GATCGTCGGACTGTAGAACTCTGAAC -m 3 -e 0.15. Then the reads were mapped to the reference genome (hg38) with STAR (Dobin et al., 2013) software. To identify pronounced high-confidence napRNAs, we firstly assembled the continuous reads to the contigs, and then calculated the numbers of start reads containing specific 5’-adapter (startReadNum) and end reads containing specific 3’-adapter (endReadNum). We reasoned that the startReadNum and endReadNum of a candidate napRNA should be significantly higher over the sequence upstream and downstream, while the coverage of the contigs should also be significantly higher over the regions around. Finally, we developed the computational software napSeeker to calculate the numbers of start reads containing specific 5’-adapter (startReadNum) and end reads containing specific 3’-adapter (endReadNum). We calculated the fold change between startReadNum and numbers of reads containing specific 5’-adapter within 100nt upstream and downstream (startFC), in a similar way, the fold change between endReadNum and the numbers of reads containing specific 3’-adapter upstream and downstream within 100nt (endFC). Next, we calculated the fold change between coverage of the contigs and the regions within 20nt upstream/downstream (up20ntFC/down20ntFC). A high-confidence napRNA had to meet the following criteria: (1) startReadNum, endReadNum ≥ 7; (2) startFold, endFold ≥ 2. (2) up20ntFold, down20ntFold ≥ 2. (4) length≥100. What’s more, the candidate napRNAs had to express in at least 2 samples of all the human (or mouse) samples. Finally, we only retained the napRNA with the summary counts ≥ 20. These stringent parameters allowed us to identify the highest-confidence candidate napRNAs. To identify the highest-confidence candidate 5’-Cap sites, we calculated the 5’-end sites coverage of the sequencing read (endCov), the one nucleotide upstream (upCov) and the one nucleotide downstream (downCov). A positive 5’-Cap sites had to meet the following criteria: (1) upFC = endCov/upCov≥2; (2) downFC = endCov/downCov≥2; (3) read counts≥10; (4) P-value<0.05; (5) existing in two replicates; (6) located within mRNA 5`UTR region. Assembly: hg38, mm10 Supplementary files format and content: bigwig
|
|
|
Submission date |
Mar 24, 2023 |
Last update date |
Feb 19, 2024 |
Contact name |
Jian-Hua Yang |
E-mail(s) |
yangjh7@mail.sysu.edu.cn
|
Phone |
86-20-84112517
|
Organization name |
Sun Yat-sen University
|
Department |
School of Life Sciences
|
Street address |
No. 135, Xingang Xi Road
|
City |
Guangzhou |
ZIP/Postal code |
510275 |
Country |
China |
|
|
Platform ID |
GPL21273 |
Series (1) |
GSE228168 |
Full-length Sequencing of Non-capped RNAs Reveals the Heterogeneity of RNA Processing and Novel Classes of Noncoding RNAs |
|
Relations |
BioSample |
SAMN33905462 |
SRA |
SRX19766630 |
Supplementary file |
Size |
Download |
File type/resource |
GSM7115938_C2C12_D0_RNAseq_rep2.rpm.minus.coverage.bw |
274.2 Mb |
(ftp)(http) |
BW |
GSM7115938_C2C12_D0_RNAseq_rep2.rpm.minus.end.bw |
18.0 Mb |
(ftp)(http) |
BW |
GSM7115938_C2C12_D0_RNAseq_rep2.rpm.minus.start.bw |
20.7 Mb |
(ftp)(http) |
BW |
GSM7115938_C2C12_D0_RNAseq_rep2.rpm.plus.coverage.bw |
281.3 Mb |
(ftp)(http) |
BW |
GSM7115938_C2C12_D0_RNAseq_rep2.rpm.plus.end.bw |
18.2 Mb |
(ftp)(http) |
BW |
GSM7115938_C2C12_D0_RNAseq_rep2.rpm.plus.start.bw |
18.8 Mb |
(ftp)(http) |
BW |
SRA Run Selector |
Raw data are available in SRA |
Processed data provided as supplementary file |
|
|
|
|
|