Comparative analysis of CRISPR loci in lactic acid bacteria genomes

Int J Food Microbiol. 2009 Apr 30;131(1):62-70. doi: 10.1016/j.ijfoodmicro.2008.05.030. Epub 2008 Jul 16.

Abstract

Clustered regularly interspaced short palindromic repeats (CRISPR) are hypervariable loci widely distributed in bacteria and archaea, that provide acquired immunity against foreign genetic elements. Here, we investigate the occurrence of CRISPR loci in the genomes of lactic acid bacteria (LAB), including members of the Firmicutes and Actinobacteria phyla. A total of 102 complete and draft genomes across 11 genera were studied and 66 CRISPR loci were identified in 26 species. We provide a comparative analysis of the CRISPR/cas content and diversity across LAB genera and species for 37 sets of CRISPR loci. We analyzed CRISPR repeats, CRISPR spacers, leader sequences, and cas gene content, sequences and architecture. Interestingly, multiple CRISPR families were identified within Bifidobacterium, Lactobacillus and Streptococcus, and similar CRISPR loci were found in distant organisms. Overall, eight distinct CRISPR families were identified consistently across CRISPR repeats, cas gene content and architecture, and sequences of the universal cas1 gene. Since the clustering of the CRISPR families does not correlate with the classical phylogenetic tree, we hypothesize that CRISPR loci have been subjected to horizontal gene transfer and further evolved independently in select lineages, in part due to selective pressure resulting from phage predation. Globally, we provide additional insights into the origin and evolution of CRISPR loci and discuss their contribution to microbial adaptation.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • 5' Untranslated Regions
  • DNA, Intergenic
  • Evolution, Molecular*
  • Genes, Bacterial*
  • Genome, Bacterial*
  • Gram-Positive Bacteria / genetics*
  • Inverted Repeat Sequences / genetics*
  • Lactobacillaceae / genetics*

Substances

  • 5' Untranslated Regions
  • DNA, Intergenic