Memory efficient algorithms for structural alignment of RNAs with pseudoknots

IEEE/ACM Trans Comput Biol Bioinform. 2012 Jan-Feb;9(1):161-8. doi: 10.1109/TCBB.2011.66. Epub 2011 Mar 30.

Abstract

In this paper, we consider the problem of structural alignment of a target RNA sequence of length n and a query RNA sequence of length m with known secondary structure that may contain simple pseudoknots or embedded simple pseudoknots. The best known algorithm for solving this problem runs in O(mn3) time for simple pseudoknot or O(mn4) time for embedded simple pseudoknot with space complexity of O(mn3) for both structures, which require too much memory making it infeasible for comparing noncoding RNAs (ncRNAs) with length several hundreds or more. We propose memory efficient algorithms to solve the same problem. We reduce the space complexity to O(n3) for simple pseudoknot and O(mn2 + n3) for embedded simple pseudoknot while maintaining the same time complexity. We also show how to modify our algorithm to handle a restricted class of recursive simple pseudoknot which is found abundant in real data with space complexity of O(mn2 + n3) and time complexity of O(mn4). Experimental results show that our algorithms are feasible for comparing ncRNAs of length more than 500.

MeSH terms

  • Algorithms*
  • Nucleic Acid Conformation*
  • RNA, Untranslated / chemistry*
  • Sequence Alignment / methods*
  • Sequence Analysis, RNA / methods*

Substances

  • RNA, Untranslated