New insights on the sister lineage of percomorph fishes with an anchored hybrid enrichment dataset

Mol Phylogenet Evol. 2017 May:110:27-38. doi: 10.1016/j.ympev.2017.02.017. Epub 2017 Feb 27.

Abstract

Percomorph fishes represent over 17,100 species, including several model organisms and species of economic importance. Despite continuous advances in the resolution of the percomorph Tree of Life, resolution of the sister lineage to Percomorpha remains inconsistent but restricted to a small number of candidate lineages. Here we use an anchored hybrid enrichment (AHE) dataset of 132 loci with over 99,000 base pairs to identify the sister lineage of percomorph fishes. Initial analyses of this dataset failed to recover a strongly supported sister clade to Percomorpha, however, scrutiny of the AHE dataset revealed a bias towards high GC content at fast-evolving codon partitions (GC bias). By combining several existing approaches aimed at mitigating the impacts of convergence in GC bias, including RY coding and analyses of amino acids, we consistently recovered a strongly supported clade comprised of Holocentridae (squirrelfishes), Berycidae (Alfonsinos), Melamphaidae (bigscale fishes), Cetomimidae (flabby whalefishes), and Rondeletiidae (redmouth whalefishes) as the sister lineage to Percomorpha. Additionally, implementing phylogenetic informativeness (PI) based metrics as a filtration method yielded this same topology, suggesting PI based approaches will preferentially filter these fast-evolving regions and act in a manner consistent with other phylogenetic approaches aimed at mitigating GC bias. Our results provide a new perspective on a key issue for studies investigating the evolutionary history of more than one quarter of all living species of vertebrates.

Keywords: Anchored phylogenomics; Beryciformes; Codon bias; GC3; Homoplasy; Hybrid enrichment; Nucleotide saturation.

MeSH terms

  • Amino Acids / genetics
  • Animals
  • Base Composition / genetics
  • Databases, Genetic*
  • Fishes / classification*
  • Fishes / genetics*
  • Genomics
  • Hybridization, Genetic*
  • Likelihood Functions
  • Nucleotides / genetics
  • Phylogeny*
  • Species Specificity

Substances

  • Amino Acids
  • Nucleotides