Global perspectives on proteins: comparing genomes in terms of folds, pathways and beyond

Pharmacogenomics J. 2001;1(2):115-25. doi: 10.1038/sj.tpj.6500021.

Abstract

The sequencing of complete genomes provides us with a global view of all the proteins in an organism. Proteomic analysis can be done on a purely sequence-based level, with a focus on finding homologues and grouping them into families and clusters of orthologs. However, incorporating protein structure into this analysis provides valuable simplification; it allows one to collect together very distantly related sequences, thus condensing the proteome into a minimal number of 'parts.' We describe issues related to surveying proteomes in terms of structural parts, including methods for fold assignment and formats for comparisons (eg top-10 lists and whole-genome trees), and show how biases in the databases and in sampling can affect these surveys. We illustrate our main points through a case study on the unique protein properties evident in many thermophile genomes (eg more salt bridges). Finally, we discuss metabolic pathways as an even greater simplification of genomes. In comparison to folds these allow the organization of many more genes into coherent systems, yet can nevertheless be understood in many of the same terms.

Publication types

  • Review

MeSH terms

  • Animals
  • Databases, Factual
  • Genome*
  • Genomics*
  • Humans
  • Protein Conformation
  • Protein Folding*
  • Proteins / genetics*

Substances

  • Proteins