Identifying gene-gene interactions using penalized tensor regression

Stat Med. 2018 Feb 20;37(4):598-610. doi: 10.1002/sim.7523. Epub 2017 Oct 16.

Abstract

Gene-gene (G×G) interactions have been shown to be critical for the fundamental mechanisms and development of complex diseases beyond main genetic effects. The commonly adopted marginal analysis is limited by considering only a small number of G factors at a time. With the "main effects, interactions" hierarchical constraint, many of the existing joint analysis methods suffer from prohibitively high computational cost. In this study, we propose a new method for identifying important G×G interactions under joint modeling. The proposed method adopts tensor regression to accommodate high data dimensionality and the penalization technique for selection. It naturally accommodates the strong hierarchical structure without imposing additional constraints, making optimization much simpler and faster than in the existing studies. It outperforms multiple alternatives in simulation. The analysis of The Cancer Genome Atlas (TCGA) data on lung cancer and melanoma demonstrates that it can identify markers with important implications and better prediction performance.

Keywords: gene-gene interactions; penalized selection; tensor regression.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adenocarcinoma of Lung / genetics
  • Algorithms
  • Biomarkers, Tumor / genetics
  • Biostatistics
  • Computer Simulation
  • Databases, Genetic
  • Epistasis, Genetic*
  • Humans
  • Linear Models
  • Lung Neoplasms / genetics
  • Melanoma / genetics
  • Models, Genetic*
  • Models, Statistical
  • Polymorphism, Single Nucleotide
  • Proportional Hazards Models
  • Regression Analysis
  • Skin Neoplasms / genetics

Substances

  • Biomarkers, Tumor