Algorithms for challenging motif problems

J Bioinform Comput Biol. 2006 Feb;4(1):43-58. doi: 10.1142/s0219720006001692.

Abstract

Pevzner and Sze(19) have introduced the Planted (l,d)-Motif Problem to find similar patterns (motifs) in sequences which represent the promoter regions of co-regulated genes, where l is the length of the motif and d is the maximum Hamming distance around the similar patterns. Many algorithms have been developed to solve this motif problem. However, these algorithms either have long running times or do not guarantee the motif can be found. In this paper, we introduce new algorithms to solve this motif problem. Our algorithms can find motifs in reasonable time for not only the challenging (9, 2), (11, 3), (15, 5)-motif problems but for even longer motifs, say (20, 7), (30, 11) and (40, 15), which have never been seriously attempted by other researchers because of the large time and space required. Besides, our algorithms can be extended to find more complicated motifs structure called cis-regulatory modules (CRM).

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Binding Sites / genetics
  • Computational Biology
  • DNA / genetics*
  • DNA / metabolism*
  • Promoter Regions, Genetic
  • Transcription Factors / metabolism

Substances

  • Transcription Factors
  • DNA