Identification of non-Hodgkin's lymphoma prognosis signatures using the CTGDR method

Bioinformatics. 2010 Jan 1;26(1):15-21. doi: 10.1093/bioinformatics/btp604. Epub 2009 Oct 22.

Abstract

Motivation: Although NHL (non-Hodgkin's lymphoma) is the fifth leading cause of cancer incidence and mortality in the USA, it remains poorly understood and is largely incurable. Biomedical studies have shown that genomic variations, measured with SNPs (single nucleotide polymorphisms) in genes, may have independent predictive power for disease-free survival in NHL patients beyond clinical measurements.

Results: We apply the CTGDR (clustering threshold gradient directed regularization) method to genetic association studies using SNPs, analyze data from an association study of NHL and identify prognosis signatures to diffuse large B cell lymphoma (DLBCL) and follicular lymphoma (FL), the two most common subtypes of NHL. With the CTGDR method, we are able to account for the joint effects of multiple genes/SNPs, whereas most existing studies are single-marker based. In addition, we are able to account for the 'gene and SNP-within-gene' hierarchical structure and identify not only predictive genes but also predictive SNPs within identified genes. In contrast, existing studies are limited to either gene or SNP identification, but not both. We propose using resampling methods to evaluate the predictive power and reproducibility of identified genes and SNPs. Simulation study and data analysis suggest satisfactory performance of the CTGDR method.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms*
  • Cluster Analysis
  • DNA Mutational Analysis / methods*
  • DNA, Neoplasm / genetics*
  • Genetic Predisposition to Disease / genetics*
  • Humans
  • Lymphoma, Non-Hodgkin / diagnosis*
  • Lymphoma, Non-Hodgkin / genetics*
  • Polymorphism, Single Nucleotide / genetics*
  • Prognosis
  • Reproducibility of Results
  • Risk Assessment / methods
  • Risk Factors
  • Sensitivity and Specificity

Substances

  • DNA, Neoplasm