NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Series GSE71576 Query DataSets for GSE71576
Status Public on Aug 04, 2015
Title Integration analysis of three omics data using penalized regression methods: An application to bladder cancer (expression)
Organism Homo sapiens
Experiment type Expression profiling by array
Summary Omics data integration is becoming necessary to investigate the still unknown genomic mechanisms of complex diseases. During the integration process, many challenges arise such as data heterogeneity, the smaller number of individuals in comparison to the number of parameters, multicollinearity, and interpretation and validation of results due to their complexity and lack of knowledge about biological mechanisms. To overcome some of these issues, innovative statistical approaches are being developed. In this work, we applied penalized regression methods (LASSO and ENET) to explore relationships between common genetic variants, DNA methylation and gene expression measured in bladder tumor samples and have proposed a permutation-based method to concomitantly assess significance and correct by multiple testing with the MaxT algorithm. The overall analysis flow consisted of three steps: (1) SNPs/CpGs were selected per each gene probe within 1Mb window upstream and downstream the gene; (2) LASSO and ENET were applied to assess the association between each expression probe and the selected SNPs/CpGs in three multivariable models (SNP, CPG, and Global models, the latter integrating SNPs and CPGs); and (3) the significance of each model was assessed using the permutation-based MaxT method. We identified 48 genes whom expression levels were associated with both SNPs and GPGs. Importantly, we replicated results for 36 (75%) of them in an independent data set (TCGA). We checked the performance of the proposed method with a simulation study and further supported our results with a biological interpretation based on an enrichment analysis. The approach we propose allows reducing computational time and is flexibly and easy to implement when analyzing several omics data. Our results highlight the importance of integrating omics data by applying appropriate statistical strategies to discover new insights into the complexity of disease genetic mechanisms.
 
Overall design Gene expression data were obtained from 44 tumor samples using the Affymetrix DNA Microarray Human Gene 1.0 ST Array with 32,321 probes.
Web link http://journals.plos.org/plosgenetics/article?id=10.1371/journal.pgen.1005689
 
Contributor(s) Pineda S, Real FX, Kogevinas M, Carrato A, Chanock SJ, Malats N, Van Steen K
Citation(s) 26646822
Submission date Jul 30, 2015
Last update date Jul 26, 2018
Contact name Nuria Malats
E-mail(s) nmalats@cnio.es
Organization name CNIO
Street address C/ Melchor Fernández Almagro 3
City Madrid
State/province Madrid
ZIP/Postal code 28029
Country Spain
 
Platforms (1)
GPL6244 [HuGene-1_0-st] Affymetrix Human Gene 1.0 ST Array [transcript (gene) version]
Samples (44)
GSM1838823 10091310
GSM1838824 10094010
GSM1838825 30110214
This SubSeries is part of SuperSeries:
GSE71669 Integration analysis of three omics data using penalized regression methods: An application to bladder cancer
Relations
BioProject PRJNA291765

Download family Format
SOFT formatted family file(s) SOFTHelp
MINiML formatted family file(s) MINiMLHelp
Series Matrix File(s) TXTHelp

Supplementary file Size Download File type/resource
GSE71576_RAW.tar 204.3 Mb (http)(custom) TAR (of CEL)
Processed data included within Sample table

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap