NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Sample GSM5400803 Query DataSets for GSM5400803
Status Public on Jun 26, 2021
Title LCC_Omen
Sample type SRA
 
Source name Clinical sample
Organism Homo sapiens
Characteristics tissue: CaOv tumor
Extracted molecule total RNA
Extraction protocol RNeasy® PowerLyzer® Tissue & Cells Kit was used for the isolation of total RNA from ovarian tumor tissues according to the manufacturer's protocol.
RNA libraries were prepared for sequencing using standard Illumina protocols.
 
Library strategy RNA-Seq
Library source transcriptomic
Library selection cDNA
Instrument model Illumina HiSeq 2000
 
Data processing Illumina Casava1.7 software used for basecalling.
Raw data (raw reads) of fastq format were processed with fastp, an ultra-fast FASTQ preprocessor with useful quality. Clean data (clean reads) were obtained after quality control, adapter trimming, quality filtering and per-read quality cutting. All the downstream analyses were based on the clean data with high quality.
Reference genome and gene model annotation files were downloaded from genome website directly. Paired-end clean reads were aligned to the reference genome using HISAT2 v2.1.0 (hierarchical indexing for spliced alignment of transcripts), which is a highly efficient system for aligning reads from RNA sequencing experiments. HISAT2 uses an indexing scheme based on the Burrows-Wheeler transform and the Ferragina-Manzini (FM) index, employing two types of indexes for alignment: a whole_x0002_genome FM index to anchor each alignment and numerous local FM indexes for very rapid extensions of these alignments. HISAT2 is the fastest system currently available, with equal or better accuracy than any other method. HISAT2 was run with the default parameters.
Featurecount was used to count the reads numbers mapped to each gene. And then FPKM of each gene was calculated based on the length of the gene and reads count mapped to this gene. FPKM, expected number of Fragments Per Kilobase of transcript sequence per Millions base pairs sequenced, considers the effect of sequencing depth and gene length for the reads count at the same time, and is currently the most commonly used method for estimating gene expression levels (Trapnell, Cole, et al., 2010).
Differential expression analysis of two conditions/groups (two biological replicates per condition) was performed using the DESeq R package (1.18.1). DESeq provide statistical routines for determining differential expression in digital gene expression data using a model based on the negative binomial distribution. The resulting P-values were adjusted using the Benjamini and Hochberg’s approach for controlling the false discovery rate. Genes with |log2(FoldChange)| > 1 and adjusted P_x0002_value
Genome_build: Homo sapiens reference genome (GRCh38)
Supplementary_files_format_and_content: Matrix table with raw gene counts for every gene and every sample.
 
Submission date Jun 25, 2021
Last update date Jun 26, 2021
Contact name Mingo Ming Ho YUNG
E-mail(s) h1094157@connect.hku.hk
Organization name HKU
Department O&G
Street address 21 Sassoon
City Hong Kong
ZIP/Postal code 00
Country Hong Kong
 
Platform ID GPL11154
Series (1)
GSE178913 Transcriptomic analysis of ovarian tumor pairs from primary and metastatic lesions
Relations
SRA SRX11230185
BioSample SAMN19896459

Supplementary data files not provided
SRA Run SelectorHelp
Raw data are available in SRA
Processed data are available on Series record

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap