Analysis of genome-wide association studies with multiple outcomes using penalization

PLoS One. 2012;7(12):e51198. doi: 10.1371/journal.pone.0051198. Epub 2012 Dec 14.

Abstract

Genome-wide association studies have been extensively conducted, searching for markers for biologically meaningful outcomes and phenotypes. Penalization methods have been adopted in the analysis of the joint effects of a large number of SNPs (single nucleotide polymorphisms) and marker identification. This study is partly motivated by the analysis of heterogeneous stock mice dataset, in which multiple correlated phenotypes and a large number of SNPs are available. Existing penalization methods designed to analyze a single response variable cannot accommodate the correlation among multiple response variables. With multiple response variables sharing the same set of markers, joint modeling is first employed to accommodate the correlation. The group Lasso approach is adopted to select markers associated with all the outcome variables. An efficient computational algorithm is developed. Simulation study and analysis of the heterogeneous stock mice dataset show that the proposed method can outperform existing penalization methods.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms
  • Animals
  • Computer Simulation
  • Databases, Genetic
  • Genetic Markers
  • Genetic Predisposition to Disease
  • Genetic Techniques
  • Genome-Wide Association Study*
  • Humans
  • Linear Models
  • Mice
  • Models, Genetic
  • Models, Statistical
  • Multivariate Analysis
  • Phenotype
  • Regression Analysis
  • Risk Factors

Substances

  • Genetic Markers