NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Sample GSM6173954 Query DataSets for GSM6173954
Status Public on May 10, 2023
Title heads_Isoseq_1
Sample type SRA
 
Source name heads
Organism Drosophila melanogaster
Characteristics tissue: heads
developmental stage: adults
genotype: wild type (w1118)
Treatment protocol untreated
Growth protocol Flies were grown in 25C on standard apple juice based food
Extracted molecule total RNA
Extraction protocol 3-day-old w1118 flies were collected and flash-frozen in liquid nitrogen and heads were used for RNA extraction.
Iso-seq libraries were prepared using 0.5µg total RNA from 3-day-old w1118 fly heads, processed with the Iso-seq express 2.0 workflow (PacBio) with 14 cycles of PCR amplification and size selection with the BluePippin system for transcripts larger than 3kb according to the manufacturer’s protocol. After SMRTbell adapter addition, libraries were sequenced on 3 SMRTcells on a Sequel I PacBio sequencer. The raw data files were processed with SMRT Link v8 software to generate CCS fastq files. Data analysis was performed using the Iso-seq3 pipeline to generate consensus reads.
Iso-seq express 2.0
 
Library strategy OTHER
Library source transcriptomic
Library selection other
Instrument model Sequel
 
Data processing CIA assembly workflow: FLAIR (Tang et al., 2020) was used for transcriptome assembly. Samples were pooled to perform the assembly. For FLAIR collapse, the Eukaryotic Promoter Database EPD (Dreos et al., 2017) was provided to retain only reads with a supported TSS at their 5' end, using --max_ends 5 to allow for multiple 5’-3’ end identification. A minimum of three full-length reads were required for an isoform to be collapsed in the assembly. Assemblies were filtered for 3' ends that likely originated from internal priming or truncation during library preparation. We used FLAM-seq and RNA Direct data, as both of these methods allow for poly(A) tail detection, to perform poly(A) tail calling. Only reads containing a poly(A) tail were retained, and were trimmed to a single nucleotide preceding the poly(A) tail. Single nucleotide reads were clustered in 20-nt windows; clusters supported by at least two reads were included in the PAS database. The following filtering parameters were considered: 1) All isoforms overlapping a 3' end in a window of 100 nt were retained, 2) 3' ends found in the assembly more distal than the 3' ends found in the database were retained only if they were within the reference annotation and contained a AATAAA signal. For 3' end Correction: 1) 3’ UTR bins were created using the PAS database, starting from the end of the open reading frame, between each consecutive PAS, to the most distal PAS. Isoform 3’ ends falling within the last bin of the 3’ UTR (between the two distal-most PASs) were corrected to the most distal bin, provided the isoform covered more than 10% of the last bin.
Assembly: D. melanogaster Assembly: dm6_ensembl ; D. melanogaster Annotation: ensembl release 96
Supplementary files format and content: Processed gtf (General Transfer Format) output of FLAIR assembly: each row represents an exon. Columns are: chromosome name, source, feature type, start, end, score, strand. In the 9th column metadata assigned to each exon: gene_id, transcript_id, exon number
Library strategy: Iso-seq
 
Submission date May 20, 2022
Last update date May 10, 2023
Contact name Valérie Hilgers
Organization name MPI of Immunobiology and Epigenetics
Street address Stübeweg 51
City Freiburg
ZIP/Postal code 79108
Country Germany
 
Platform ID GPL28354
Series (2)
GSE203497 Sites of transcription initiation drive mRNA isoform selection [PacBio]
GSE203583 Sites of transcription initiation drive mRNA isoform selection
Relations
BioSample SAMN28572102
SRA SRX15403484

Supplementary data files not provided
SRA Run SelectorHelp
Raw data are available in SRA
Processed data are available on Series record

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap