Amino acid substitution matrices from protein blocks

Proc Natl Acad Sci U S A. 1992 Nov 15;89(22):10915-9. doi: 10.1073/pnas.89.22.10915.

Abstract

Methods for alignment of protein sequences typically measure similarity by using a substitution matrix with scores for all possible exchanges of one amino acid with another. The most widely used matrices are based on the Dayhoff model of evolutionary rates. Using a different approach, we have derived substitution matrices from about 2000 blocks of aligned sequence segments characterizing more than 500 groups of related proteins. This led to marked improvements in alignments and in searches using queries from each of the groups.

Publication types

  • Comparative Study
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Algorithms
  • Amino Acid Sequence*
  • Animals
  • Caenorhabditis elegans / genetics
  • Drosophila / genetics
  • Lod Score
  • Mathematics
  • Molecular Sequence Data
  • Probability
  • Proteins / chemistry
  • Proteins / genetics*
  • Sequence Homology, Amino Acid*
  • Software*

Substances

  • Proteins