MOTIPS: automated motif analysis for predicting targets of modular protein domains

Hugo Y K Lam; Philip M Kim; Janine Mok; Raffi Tonikian; Sachdev S Sidhu; Benjamin E Turk; Michael Snyder; Mark B Gerstein

doi:10.1186/1471-2105-11-243

MOTIPS: automated motif analysis for predicting targets of modular protein domains

BMC Bioinformatics. 2010 May 11:11:243. doi: 10.1186/1471-2105-11-243.

Authors

Hugo Y K Lam¹, Philip M Kim, Janine Mok, Raffi Tonikian, Sachdev S Sidhu, Benjamin E Turk, Michael Snyder, Mark B Gerstein

Affiliation

¹ Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA.

Abstract

Background: Many protein interactions, especially those involved in signaling, involve short linear motifs consisting of 5-10 amino acid residues that interact with modular protein domains such as the SH3 binding domains and the kinase catalytic domains. One straightforward way of identifying these interactions is by scanning for matches to the motif against all the sequences in a target proteome. However, predicting domain targets by motif sequence alone without considering other genomic and structural information has been shown to be lacking in accuracy.

Results: We developed an efficient search algorithm to scan the target proteome for potential domain targets and to increase the accuracy of each hit by integrating a variety of pre-computed features, such as conservation, surface propensity, and disorder. The integration is performed using naïve Bayes and a training set of validated experiments.

Conclusions: By integrating a variety of biologically relevant features to predict domain targets, we demonstrated a notably improved prediction of modular protein domain targets. Combined with emerging high-resolution data of domain specificities, we believe that our approach can assist in the reconstruction of many signaling pathways.

Publication types

Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't

MeSH terms

Algorithms
Amino Acid Motifs
Binding Sites
Models, Molecular
Protein Conformation
Protein Structure, Tertiary*
Proteins / chemistry*
Proteins / metabolism
Proteome / chemistry
Proteome / metabolism
Proteomics / methods*
Software*

Substances

Proteins
Proteome

Abstract

Publication types

MeSH terms

Substances

Grants and funding