Bayesian hierarchical modeling and selection of differentially expressed genes for the EST data

Biometrics. 2011 Mar;67(1):142-50. doi: 10.1111/j.1541-0420.2010.01447.x.

Abstract

Expressed sequence tag (EST) sequencing is a one-pass sequencing reading of cloned cDNAs derived from a certain tissue. The frequency of unique tags among different unbiased cDNA libraries is used to infer the relative expression level of each tag. In this article, we propose a hierarchical multinomial model with a nonlinear Dirichlet prior for the EST data with multiple libraries and multiple types of tissues. A novel hierarchical prior is developed and the properties of the proposed prior are examined. An efficient Markov chain Monte Carlo algorithm is developed for carrying out the posterior computation. We also propose a new selection criterion for detecting which genes are differentially expressed between two tissue types. Our new method with the new gene selection criterion is demonstrated via several simulations to have low false negative and false positive rates. A real EST data set is used to motivate and illustrate the proposed method.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms*
  • Bayes Theorem*
  • Biometry / methods
  • Computer Simulation
  • Data Interpretation, Statistical*
  • Escherichia coli Proteins / genetics*
  • Expressed Sequence Tags*
  • Gene Expression Profiling / methods*
  • Gene Library
  • Models, Statistical*
  • Pattern Recognition, Automated / methods

Substances

  • Escherichia coli Proteins