NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Sample GSM4468575 Query DataSets for GSM4468575
Status Public on May 28, 2020
Title URT1D491/3A replicate 2 Dataset#1
Sample type SRA
 
Source name leaves
Organism Nicotiana benthamiana
Characteristics construction: URT1D491/3A
biological replicate: replicate 2
dataset: Dataset#1
reference sequence: GFP|NbPR2(Niben101Scf04869g03002.1)
Treatment protocol Different URT1 constructions were co-inflitrated with a GFP reporter mRNA and P19 (silencing inhibitor) in leaves.
Growth protocol Nicotiana benthamiana infiltrated leaves of plants grown during 4 weeks on soil with 16 hr light (22 °C) /8 hr (18 °C) darkness cycles.
Extracted molecule total RNA
Extraction protocol Total RNA was extracted from flowers with TRI Reagent® (Molecular Research Center) according to manufacturer’s instructions
3'RACEseq protocol is based on the ligation of a primer at the 3’ end of RNA, and the subsequent targeted amplification by PCR of amplicons suitable for Illumina sequencing.
 
Library strategy RNA-Seq
Library source transcriptomic
Library selection RACE
Instrument model Illumina MiSeq
 
Description Processed_data_dataset1.txt
Data processing After initial data processing by the MiSeq Control Software v 2.5. (Illumina), base calls were retrieved and further analysed by a suite of home made python scripts (v2.7) using biopython (v1.63) and regex (v2.4) libraries
Data processing pipeline was adapted from (Sikorska et al., 2017). Reads with low quality bases (=<Q10) within the 15 –base random sequence of the read 2 or within the 30 bases downstream the delimiter sequence, were filtered out
Sequences with identical nucleotides in 15 –base random sequence were deduplicated.
Next, 20 nucleotides sequences corresponding to nucleotides of the transcript were searched into reads 1 to identify the corresponding target mRNAs. One mismatch was tolerated. Matched reads 1 and their corresponding reads 2 were extracted and annotated.
Reads 2 that contain the delimiter sequence were selected and subsequently trimmed from their random and delimiter sequences.
Then, the analysis was divided into two steps.
The aim of the first step was to identify the position of mRNA 3’ extremities and to detect untemplated nucleotides. To do this, the 30 nucleotide sequences downstream of the read 2 delimiter sequence were mapped to the corresponding reference sequence, which goes from the first nucleotide of the transcript that maps the forward PCR2 primer to the end of the mRNA. Up to four mismatches were tolerated, with the exception of the first five nucleotides downstream of the mapping site that had to perfectly map. To map the 3’ end position of reads 2 with untemplated tails, the sequences of the unmatched reads 2 were successively trimmed from their 3’ end, with a one nucleotide trimming step, until they could be mapped to the reference sequence or until a maximum of 30 nucleotide has been removed. For each successfully mapped read 2, untemplated nucleotides at the 3’ end were extracted
The aim of the second step was to analyze long mRNA poly(A) tail. Sequencing of long homopolymeric stretches causes a rapid decrease of sequencing quality, making it impossible to exactly map the 3’ end of mRNA with long poly(A). We thus looked for long T stretches of at least 10 Ts in the read 2 that failed to map the reference sequence. Poly(A) tails were searched with the constraint that it must begin in the first 30 cycles, which means that the maximal length of the added 3’ end modification is limited to 29 nucleotides.
Finally, results from step 1 and 2 were compiled and 3’ extensions were analyzed.
Supplementary_files_format_and_content: One processed data file is given for each of the tree datasets generated in individual Miseq runs. It includes the processed data for each construction and biological replicate. Each line corresponds to one individual reads.
Supplementary_files_format_and_content: For each read, we indicate the read ID (read.ID), the gene AGI (Gene), poly(A) tail sequence (polyA), poly(A) tail length (polyA.size), non-A extension sequence (modification), length of the non-A extension (modification.size), A + non-A tail sequence (extension), A + non-A tail size (extension.size), tag (classification) indicating the category of the tail, the 15N random sequence used for deduplication (random), the biological replicate (rep) and the construction (construction).
 
Submission date Apr 09, 2020
Last update date May 28, 2020
Contact name Dominique Gagliardi
E-mail(s) dominique.gagliardi@ibmp-cnrs.unistra.fr
Organization name CNRS
Department IBMP
Street address 12, rue du General Zimmer
City Strasbourg
ZIP/Postal code 67084
Country France
 
Platform ID GPL22072
Series (2)
GSE148409 Ectopic expression of URT1 remodels poly(A) tail profiles in Nicotiana benthamiana
GSE148449 Molecular connection between the TUTase URT1 and decapping activators
Relations
BioSample SAMN14569606
SRA SRX8092635

Supplementary data files not provided
SRA Run SelectorHelp
Raw data are available in SRA
Processed data are available on Series record

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap