Search for 5'-leader regulatory RNA structures based on gene annotation aided by the RiboGap database

Methods. 2017 Mar 15:117:3-13. doi: 10.1016/j.ymeth.2017.02.009. Epub 2017 Mar 6.

Abstract

The discovery of noncoding RNAs (ncRNAs) and their importance for gene regulation led us to develop bioinformatics tools to pursue the discovery of novel ncRNAs. Finding ncRNAs de novo is challenging, first due to the difficulty of retrieving large numbers of sequences for given gene activities, and second due to exponential demands on calculation needed for comparative genomics on a large scale. Recently, several tools for the prediction of conserved RNA secondary structure were developed, but many of them are not designed to uncover new ncRNAs, or are too slow for conducting analyses on a large scale. Here we present various approaches using the database RiboGap as a primary tool for finding known ncRNAs and for uncovering simple sequence motifs with regulatory roles. This database also can be used to easily extract intergenic sequences of eubacteria and archaea to find conserved RNA structures upstream of given genes. We also show how to extend analysis further to choose the best candidate ncRNAs for experimental validation.

Keywords: Bioinformatics; CsrA; GraphClust; Infernal; Methyltransferase; RNA secondary structure; Rfam; Riboswitch; RsmA; TRAP; ncRNA.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms*
  • Animals
  • Archaea / genetics
  • Bacteria / genetics
  • Base Pairing
  • Base Sequence
  • Computational Biology / methods*
  • Databases, Genetic
  • Humans
  • Molecular Sequence Annotation
  • Nucleic Acid Conformation
  • RNA, Untranslated / chemistry
  • RNA, Untranslated / classification
  • RNA, Untranslated / genetics*
  • Riboswitch
  • Sequence Alignment
  • Sequence Analysis, RNA / methods*

Substances

  • RNA, Untranslated
  • Riboswitch