Analysis of cancer gene expression data with an assisted robust marker identification approach

Genet Epidemiol. 2017 Dec;41(8):779-789. doi: 10.1002/gepi.22066. Epub 2017 Sep 14.

Abstract

Gene expression (GE) studies have been playing a critical role in cancer research. Despite tremendous effort, the analysis results are still often unsatisfactory, because of the weak signals and high data dimensionality. Analysis is often further challenged by the long-tailed distributions of the outcome variables. In recent multidimensional studies, data have been collected on GEs as well as their regulators (e.g., copy number alterations (CNAs), methylation, and microRNAs), which can provide additional information on the associations between GEs and cancer outcomes. In this study, we develop an ARMI (assisted robust marker identification) approach for analyzing cancer studies with measurements on GEs as well as regulators. The proposed approach borrows information from regulators and can be more effective than analyzing GE data alone. A robust objective function is adopted to accommodate long-tailed distributions. Marker identification is effectively realized using penalization. The proposed approach has an intuitive formulation and is computationally much affordable. Simulation shows its satisfactory performance under a variety of settings. TCGA (The Cancer Genome Atlas) data on melanoma and lung cancer are analyzed, which leads to biologically plausible marker identification and superior prediction.

Keywords: assisted analysis; cancer; gene expression; robustness.

MeSH terms

  • Biomarkers, Tumor / genetics*
  • Biomarkers, Tumor / metabolism
  • Gene Expression Regulation, Neoplastic
  • Genes, Neoplasm
  • Humans
  • Melanoma / genetics
  • Melanoma / metabolism
  • Melanoma / pathology
  • Models, Genetic*
  • Neoplasms / genetics*
  • Neoplasms / metabolism
  • Neoplasms / pathology
  • Phenotype
  • Skin Neoplasms / genetics
  • Skin Neoplasms / metabolism
  • Skin Neoplasms / pathology

Substances

  • Biomarkers, Tumor