Prediction of dual protein targeting to plant organelles

New Phytol. 2009;183(1):224-236. doi: 10.1111/j.1469-8137.2009.02832.x.

Abstract

* Dual targeting of proteins to more than one subcellular localization has been found in animals, in fungi and in plants. In the latter, ambiguous N-terminal targeting signals have been described that result in the protein being located in both mitochondria and plastids. We have developed ambiguous targeting predictor (ATP), a machine-learning implementation that classifies such ambiguous targeting signals. * Ambiguous targeting predictor is based on a support vector machine implementation that makes use of 12 different amino acid features. Prediction results were validated using fluorescent protein fusion. * Both in silico and in vivo evaluations demonstrate that ambiguous targeting predictor is useful for predicting dual targeting to mitochondria and plastids. Proteins that are targeted to both organelles by tandemly arrayed signals (so-called twin targeting) can be predicted by both ambiguous targeting predictor and a combination of single targeting prediction tools. Comparison of ambiguous targeting predictor with previous experimental approaches, as well as in silico approaches, shows good congruence. * Based on the prediction results, land plant genomes are expected to encode, on average, > 400 proteins that are located in mitochondria and plastids. Ambiguous targeting predictor is helpful for functional genome annotation and can be used as a tool to further our understanding about dual protein targeting and its evolution.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Artificial Intelligence*
  • Bryopsida / metabolism
  • Chloroplasts / metabolism*
  • Computer Simulation*
  • Genome, Plant*
  • Green Fluorescent Proteins
  • Mitochondria / metabolism*
  • Plant Proteins / metabolism*
  • Plants / genetics
  • Plants / metabolism*
  • ROC Curve
  • Signal Transduction

Substances

  • Plant Proteins
  • Green Fluorescent Proteins