School of Civil and Environmental Engineering, Georgia Institute of Technology, 311 Ferst Dr NW, Atlanta, GA, 30332, USA.
Environ Microbiol. 2020 Aug;22(8):3394-3412. doi: 10.1111/1462-2920.15112. Epub 2020 Jul 29.
Recent advances in sequencing technology and bioinformatic pipelines have allowed unprecedented access to the genomes of yet-uncultivated microorganisms from diverse environments. However, the catalogue of freshwater genomes remains limited, and most genome recovery attempts in freshwater ecosystems have only targeted specific taxa. Here, we present a genome recovery pipeline incorporating iterative subtractive binning, and apply it to a time series of 100 metagenomic datasets from seven connected lakes and estuaries along the Chattahoochee River (Southeastern USA). Our set of metagenome-assembled genomes (MAGs) represents >400 yet-unnamed genomospecies, substantially increasing the number of high-quality MAGs from freshwater lakes. We propose names for two novel species: 'Candidatus Elulimicrobium humile' ('Ca. Elulimicrobiota', 'Patescibacteria') and 'Candidatus Aquidulcis frankliniae' ('Chloroflexi'). Collectively, our MAGs represented about half of the total microbial community at any sampling point. To evaluate the prevalence of these genomospecies in the chronoseries, we introduce methodologies to estimate relative abundance and habitat preference that control for uneven genome quality and sample representation. We demonstrate high degrees of habitat-specialization and endemicity for most genomospecies in the Chattahoochee lakes. Wider ecological ranges characterized smaller genomes with higher coding densities, indicating an overall advantage of smaller, more compact genomes for cosmopolitan distributions.
测序技术和生物信息学管道的最新进展使得人们能够以前所未有的方式访问来自不同环境的未培养微生物的基因组。然而,淡水基因组的目录仍然有限,大多数淡水生态系统中的基因组回收尝试仅针对特定的分类群。在这里,我们提出了一个包含迭代减法分箱的基因组回收管道,并将其应用于沿 Chattahoochee 河(美国东南部)的七个相连的湖泊和河口的 100 个宏基因组数据集的时间序列。我们的一组宏基因组组装基因组(MAG)代表了 >400 个尚未命名的基因组种,大大增加了来自淡水湖泊的高质量 MAG 的数量。我们为两个新物种提出了名称:“Candidatus Elulimicrobium humile”(“Ca.Elulimicrobiota”,“Patescibacteria”)和“Candidatus Aquidulcis frankliniae”(“Chloroflexi”)。总的来说,我们的 MAG 在任何采样点都代表了总微生物群落的大约一半。为了评估这些基因组种在 chronoseries 中的普遍性,我们引入了估计相对丰度和栖息地偏好的方法,这些方法可以控制基因组质量和样本代表性的不均匀性。我们证明了 Chattahoochee 湖泊中大多数基因组种都具有高度的栖息地专业化和特有性。更大的基因组具有更高的编码密度,其生态范围更广泛,这表明较小、更紧凑的基因组在全球分布中具有总体优势。