Cytosine deamination and selection of CpG suppressed clones are the two major independent biological forces that shape codon usage bias in coronaviruses

Virology. 2007 Dec 20;369(2):431-42. doi: 10.1016/j.virol.2007.08.010. Epub 2007 Sep 19.

Abstract

Using the complete genome sequences of 19 coronavirus genomes, we analyzed the codon usage bias, dinucleotide relative abundance and cytosine deamination in coronavirus genomes. Of the eight codons that contain CpG, six were markedly suppressed. The mean NNU/NNC ratio of the six amino acids using either NNC or NNU as codon is 3.262, suggesting cytosine deamination. Among the 16 dinucleotides, CpG was most markedly suppressed (mean relative abundance 0.509). No correlation was observed between CpG abundance and mean NNU/NNC ratio. Among the 19 coronaviruses, CoV-HKU1 showed the most extreme codon usage bias and extremely high NNU/NNC ratio of 8.835. Cytosine deamination and selection of CpG suppressed clones by the immune system are the two major independent biochemical and biological selective forces that shape codon usage bias in coronavirus genomes. The underlying mechanism for the extreme codon usage bias, cytosine deamination and G+C content in CoV-HKU1 warrants further studies.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Base Sequence
  • Codon / genetics
  • Coronaviridae / classification
  • Coronaviridae / genetics*
  • CpG Islands
  • Cytosine / chemistry
  • Genome, Viral
  • Humans
  • RNA, Viral / chemistry
  • RNA, Viral / genetics
  • Species Specificity

Substances

  • Codon
  • RNA, Viral
  • Cytosine