Joint modeling of genetically correlated diseases and functional annotations increases accuracy of polygenic risk prediction

PLoS Genet. 2017 Jun 9;13(6):e1006836. doi: 10.1371/journal.pgen.1006836. eCollection 2017 Jun.

Abstract

Accurate prediction of disease risk based on genetic factors is an important goal in human genetics research and precision medicine. Advanced prediction models will lead to more effective disease prevention and treatment strategies. Despite the identification of thousands of disease-associated genetic variants through genome-wide association studies (GWAS) in the past decade, accuracy of genetic risk prediction remains moderate for most diseases, which is largely due to the challenges in both identifying all the functionally relevant variants and accurately estimating their effect sizes. In this work, we introduce PleioPred, a principled framework that leverages pleiotropy and functional annotations in genetic risk prediction for complex diseases. PleioPred uses GWAS summary statistics as its input, and jointly models multiple genetically correlated diseases and a variety of external information including linkage disequilibrium and diverse functional annotations to increase the accuracy of risk prediction. Through comprehensive simulations and real data analyses on Crohn's disease, celiac disease and type-II diabetes, we demonstrate that our approach can substantially increase the accuracy of polygenic risk prediction and risk population stratification, i.e. PleioPred can significantly better separate type-II diabetes patients with early and late onset ages, illustrating its potential clinical application. Furthermore, we show that the increment in prediction accuracy is significantly correlated with the genetic correlation between the predicted and jointly modeled diseases.

MeSH terms

  • Genetic Diseases, Inborn*
  • Genome-Wide Association Study*
  • Genotype
  • Humans
  • Linkage Disequilibrium
  • Models, Genetic*
  • Multifactorial Inheritance / genetics*
  • Phenotype
  • Polymorphism, Single Nucleotide / genetics
  • Risk Factors