Classification of breast cancer using genetic algorithms and tissue microarrays

Clin Cancer Res. 2006 Nov 1;12(21):6459-68. doi: 10.1158/1078-0432.CCR-06-1383.

Abstract

Purpose: A multitude of breast cancer mRNA profiling studies has stratified breast cancer and defined gene sets that correlate with outcome. However, the number of genes used to predict patient outcome or define tumor subtypes by RNA expression studies is variable, nonoverlapping, and generally requires specialized technologies that are beyond those used in the routine pathology laboratory. It would be ideal if the familiarity and streamlined nature of immunohistochemistry could be combined with the rigorously quantitative and highly specific properties of nucleic acid-based analysis to predict patient outcome.

Experimental design: We have used AQUA-based objective quantitative analysis of tissue microarrays toward the goal of discovery of a minimal number of markers with maximal prognostic or predictive value that can be applied to the conventional formalin-fixed, paraffin-embedded tissue section.

Results: The minimal discovered multiplexed set of tissue biomarkers was GATA3, NAT1, and estrogen receptor. Genetic algorithms were then applied after division of our cohort into a training set of 223 breast cancer patients to discover a prospectively applicable solution that can define a subset of patients with 5-year survival of 96%. This algorithm was then validated on an internal validation set (n=223, 5-year survival=95.8%) and further validated on an independent cohort from Sweden, which showed 5-year survival of 92.7% (n=149).

Conclusions: With further validation, this test has both the familiarity and specificity for widespread use in management of breast cancer. More generally, this work illustrates the potential for multiplexed biomarker discovery on the tissue microarray platform.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Validation Study

MeSH terms

  • Algorithms*
  • Biomarkers, Tumor / analysis*
  • Breast Neoplasms / classification*
  • Breast Neoplasms / metabolism
  • Breast Neoplasms / mortality
  • Female
  • Fluorescent Antibody Technique
  • Formaldehyde
  • Gene Expression
  • Humans
  • Paraffin Embedding
  • Predictive Value of Tests
  • Prognosis
  • Survival Analysis
  • Tissue Array Analysis*
  • Tissue Fixation

Substances

  • Biomarkers, Tumor
  • Formaldehyde