Regularized gene selection in cancer microarray meta-analysis

BMC Bioinformatics. 2009 Jan 1:10:1. doi: 10.1186/1471-2105-10-1.

Abstract

Background: In cancer studies, it is common that multiple microarray experiments are conducted to measure the same clinical outcome and expressions of the same set of genes. An important goal of such experiments is to identify a subset of genes that can potentially serve as predictive markers for cancer development and progression. Analyses of individual experiments may lead to unreliable gene selection results because of the small sample sizes. Meta analysis can be used to pool multiple experiments, increase statistical power, and achieve more reliable gene selection. The meta analysis of cancer microarray data is challenging because of the high dimensionality of gene expressions and the differences in experimental settings amongst different experiments.

Results: We propose a Meta Threshold Gradient Descent Regularization (MTGDR) approach for gene selection in the meta analysis of cancer microarray data. The MTGDR has many advantages over existing approaches. It allows different experiments to have different experimental settings. It can account for the joint effects of multiple genes on cancer, and it can select the same set of cancer-associated genes across multiple experiments. Simulation studies and analyses of multiple pancreatic and liver cancer experiments demonstrate the superior performance of the MTGDR.

Conclusion: The MTGDR provides an effective way of analyzing multiple cancer microarray studies and selecting reliable cancer-associated genes.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Bayes Theorem
  • Biomarkers, Tumor
  • Cluster Analysis
  • Computational Biology / methods*
  • Computer Simulation
  • Female
  • Gene Expression Profiling / methods
  • Gene Expression Regulation, Neoplastic*
  • Humans
  • Liver Neoplasms / genetics
  • Liver Neoplasms / metabolism
  • Male
  • Neoplasms / genetics*
  • Neoplasms / metabolism*
  • Oligonucleotide Array Sequence Analysis / methods*
  • Pancreatic Neoplasms / genetics
  • Pancreatic Neoplasms / metabolism

Substances

  • Biomarkers, Tumor