Evaluating phylogenetic informativeness as a predictor of phylogenetic signal for metazoan, fungal, and mammalian phylogenomic data sets

Biomed Res Int. 2013:2013:621604. doi: 10.1155/2013/621604. Epub 2013 Jun 26.

Abstract

Phylogenetic research is often stymied by selection of a marker that leads to poor phylogenetic resolution despite considerable cost and effort. Profiles of phylogenetic informativeness provide a quantitative measure for prioritizing gene sampling to resolve branching order in a particular epoch. To evaluate the utility of these profiles, we analyzed phylogenomic data sets from metazoans, fungi, and mammals, thus encompassing diverse time scales and taxonomic groups. We also evaluated the utility of profiles created based on simulated data sets. We found that genes selected via their informativeness dramatically outperformed haphazard sampling of markers. Furthermore, our analyses demonstrate that the original phylogenetic informativeness method can be extended to trees with more than four taxa. Thus, although the method currently predicts phylogenetic signal without specifically accounting for the misleading effects of stochastic noise, it is robust to the effects of homoplasy. The phylogenetic informativeness rankings obtained will allow other researchers to select advantageous genes for future studies within these clades, maximizing return on effort and investment. Genes identified might also yield efficient experimental designs for phylogenetic inference for many sister clades and outgroup taxa that are closely related to the diverse groups of organisms analyzed.

MeSH terms

  • Animals
  • Chromosome Mapping / methods*
  • Computer Simulation
  • Conserved Sequence
  • Databases, Genetic*
  • Evolution, Molecular*
  • Fungi / genetics
  • Genetic Markers / genetics*
  • Genome / genetics*
  • Mammals
  • Models, Genetic*
  • Phylogeny*
  • Sequence Analysis, DNA / methods

Substances

  • Genetic Markers