NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Sample GSM4152124 Query DataSets for GSM4152124
Status Public on Jan 07, 2022
Title Av-RNA-Rep1
Sample type SRA
 
Source name whole animal
Organism Adineta vaga
Characteristics strain: MBL
tissue: whole animal
Extracted molecule total RNA
Extraction protocol RNA was harvested using Trizol reagent. PolyA selection.
RNA libraries were prepared for sequencing using standard Illumina protocols
 
Library strategy RNA-Seq
Library source transcriptomic
Library selection cDNA
Instrument model Illumina NextSeq 500
 
Data processing Illumina Casava1.7 software used for basecalling.
Sequenced reads were trimmed for adaptor sequence (cutadapt v1.9.2), and masked for short sequences (<15nt) or low-quality scores (<20), then mapped to reference genome using TopHat v2.1.1, with default parameters and --max-intron-length 100
Prediction of protein-coding genes in AvL1 strain: BRAKER, a combination of GeneMark-ET and AUGUSTUS, was used to predict protein-coding genes in the AvL1 genome using aligned RNA-Seq data. TopHat aligments were used to generate UTR training examples for AUGUSTUS to train UTR parameters and predict genes. This procedure was done with --softmasking enabled, after masking the genome with RepeatMasker. Total predictions comprised 74,569 gene models, saved as fasta and GFF-format.
Prediction of protein-coding genes in Av reference genome (MBL) was taken from Flot et al. Nature 2013.
Aligned sequence reads were counted by genomic feature with HTSeq-count, using Tophat RNA-seq alignment output bam file and gene annotation gff file.
Supplementary_files_format_and_content: read counts
 
Submission date Nov 06, 2019
Last update date Jan 07, 2022
Contact name Irina Arkhipova
E-mail(s) iarkhipova@mbl.edu
Organization name Marine Biological Laboratory
Department Josephine Bay Paul Center
Lab Arkhipova Lab
Street address 7 MBL St
City Woods Hole
State/province MA
ZIP/Postal code 02543
Country USA
 
Platform ID GPL27727
Series (2)
GSE140051 Non-canonical base modifications of bacterial origin in a eukaryotic genome [RNA-seq]
GSE140052 Non-canonical base modifications of bacterial origin in a eukaryotic genome
Relations
BioSample SAMN13226618
SRA SRX7107236

Supplementary data files not provided
SRA Run SelectorHelp
Raw data are available in SRA
Processed data are available on Series record

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap