An ancient genomic regulatory block conserved across bilaterians and its dismantling in tetrapods by retrogene replacement

Genome Res. 2012 Apr;22(4):642-55. doi: 10.1101/gr.132233.111. Epub 2012 Jan 10.

Abstract

Developmental genes are regulated by complex, distantly located cis-regulatory modules (CRMs), often forming genomic regulatory blocks (GRBs) that are conserved among vertebrates and among insects. We have investigated GRBs associated with Iroquois homeobox genes in 39 metazoans. Despite 600 million years of independent evolution, Iroquois genes are linked to ankyrin-repeat-containing Sowah genes in nearly all studied bilaterians. We show that Iroquois-specific CRMs populate the Sowah locus, suggesting that regulatory constraints underlie the maintenance of the Iroquois-Sowah syntenic block. Surprisingly, tetrapod Sowah orthologs are intronless and not associated with Iroquois; however, teleost and elephant shark data demonstrate that this is a derived feature, and that many Iroquois-CRMs were ancestrally located within Sowah introns. Retroposition, gene, and genome duplication have allowed selective elimination of Sowah exons from the Iroquois regulatory landscape while keeping associated CRMs, resulting in large associated gene deserts. These results highlight the importance of CRMs in imposing constraints to genome architecture, even across large phylogenetic distances, and of gene duplication-mediated genetic redundancy to disentangle these constraints, increasing genomic plasticity.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Evolution, Molecular
  • Gene Duplication / genetics
  • Gene Expression Regulation, Developmental
  • Genome / genetics*
  • Genomics / methods
  • Homeodomain Proteins / classification
  • Homeodomain Proteins / genetics*
  • Insecta / classification
  • Insecta / embryology
  • Insecta / genetics
  • Invertebrates / classification
  • Invertebrates / embryology
  • Invertebrates / genetics*
  • Molecular Sequence Data
  • Phylogeny
  • Retroelements / genetics
  • Sequence Homology, Amino Acid
  • Species Specificity
  • Vertebrates / classification
  • Vertebrates / embryology
  • Vertebrates / genetics*

Substances

  • Homeodomain Proteins
  • Retroelements

Associated data

  • GENBANK/JN228895
  • GENBANK/JN609218