Tree-based analysis of microarray data for classifying breast cancer

Front Biosci. 2002 May 1:7:c63-7. doi: 10.2741/A759.

Abstract

DNA microarray data have provided us with the opportunity to assess the expression levels for thousands of genes simultaneously. One of the uses of this information is to classify cancer tumors. A noted challenge in using microarray information is analytical. Following the work of Zhang et al. (1), we further pursue the use of recursive partitioning in analyses of microarray data for cancer classification. Not only does the recursive partitioning technique create intuitive classification rules, but also it is most flexible as to the handling of a massive number of genes, missing expressions, and multi-class tissues. Using a published data set (2), we demonstrate that the recursive partitioning technique creates a more precise and simpler classification rule than other commonly used approaches. In particular, we introduce the concept of A-tree and propose a procedure to assess a large number of A-trees. One of the identified genes (ERBB2) is in the close region of BRCA1 (17q21.1) and has been shown by others to have altered expression levels in breast cancer. Nonetheless, our identified genes warrant further investigation as to whether they play a role in the etiology of breast cancer.

Publication types

  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Breast Neoplasms / classification
  • Breast Neoplasms / genetics*
  • Gene Expression Profiling*
  • Gene Expression Regulation, Neoplastic
  • Models, Biological
  • Oligonucleotide Array Sequence Analysis / methods*