Alns: a new searchable and filterable sequence alignment format

Int J Data Min Bioinform. 2013;7(2):135-45. doi: 10.1504/ijdmb.2013.053193.

Abstract

Nucleotides and amino acids are basic building units of RNA, DNA and protein. Although intensive studies on understanding how changes in these building blocks affect the phenotypes of these biopolymers are ever increasing, many popular alignment formats are generated by pair-wise comparison tools such as the Basic Local Alignment Search Tool (BLAST). These alignments are user-friendly to researchers but are not convenient for searching, filtering and storage, in particular when there are thousands of alignments generated from highly conserved sequences. Here, we introduce a new alignment format, alns, to facilitate rapid and convenient association of genetic changes and similarity to other sources of information such as phenotypes, disease state, time, geography and taxonomy via simple spreadsheet functions. The format shall assist biologists from a wide range of disciplines in knowledge discovery.

MeSH terms

  • Amino Acid Sequence
  • Conserved Sequence
  • DNA / chemistry*
  • Phenotype
  • Proteins / chemistry*
  • Sequence Alignment
  • Software*

Substances

  • Proteins
  • DNA