While population stratification has been extensively studied and quantified in sexually reproducing species, it has been only anecdotally described in others. Here, we conducted a large-scale survey of population stratification in prokaryotic species, sampled from their natural habitat, without isolation or cultivation biases. We surveyed the variation landscape of 71 prevalent microbial species in 2144 human fecal metagenomes, and show that in 44 of these, accounting for 72% of the total assigned microbial abundance, single nucleotide variation can be clustered and stratified into sub-populations (termed subspecies). Individuals are usually dominated by only one such subspecies (per species), as expected of closely related organisms from ecological theory. Geographical dispersal patterns differ between phyla, with Firmicutes subspecies being significantly more geographically restricted. To demonstrate the functional significance of the delineated subspecies, we performed a metagenomic analysis, independent of reference genomes, of the gene content that consistently differs between conspecific subspecies, allowing us to associate groups of genes to properties of the microbial community and to the host. For example, some Eubacterium rectale subspecies specifically harbor an accessory pro-inflammatory flagellum operon that is associated with lower gut community diversity, higher host BMI and higher blood fasting insulin levels. Taken together, subspecies are a natural taxonomic level in prokaryotes and facilitate the functional and ecological understanding of the human gut microbiome in conjunction with its host.
Less...