Capturing sequence diversity in metagenomes with comprehensive and scalable probe design

Nat Biotechnol. 2019 Feb;37(2):160-168. doi: 10.1038/s41587-018-0006-x. Epub 2019 Feb 4.

Abstract

Metagenomic sequencing has the potential to transform microbial detection and characterization, but new tools are needed to improve its sensitivity. Here we present CATCH, a computational method to enhance nucleic acid capture for enrichment of diverse microbial taxa. CATCH designs optimal probe sets, with a specified number of oligonucleotides, that achieve full coverage of, and scale well with, known sequence diversity. We focus on applying CATCH to capture viral genomes in complex metagenomic samples. We design, synthesize, and validate multiple probe sets, including one that targets the whole genomes of the 356 viral species known to infect humans. Capture with these probe sets enriches unique viral content on average 18-fold, allowing us to assemble genomes that could not be recovered without enrichment, and accurately preserves within-sample diversity. We also use these probe sets to recover genomes from the 2018 Lassa fever outbreak in Nigeria and to improve detection of uncharacterized viral infections in human and mosquito samples. The results demonstrate that CATCH enables more sensitive and cost-effective metagenomic sequencing.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Computational Biology / methods*
  • Culicidae / virology
  • Disease Outbreaks
  • Gene Library
  • Genetic Variation
  • Genome, Viral*
  • Genomics
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Lassa Fever / virology
  • Metagenome*
  • Metagenomics*
  • Nigeria / epidemiology
  • Oligonucleotide Probes
  • Oligonucleotides / genetics
  • Sequence Analysis, DNA
  • Virus Diseases

Substances

  • Oligonucleotide Probes
  • Oligonucleotides