Tromas Nicolas, Fortin Nathalie, Bedrani Larbi, Terrat Yves, Cardoso Pedro, Bird David, Greer Charles W, Shapiro B Jesse
Département de Sciences Biologiques, Université de Montréal, 90 Vincent-d'Indy, Montréal, QC, Canada.
National Research Council Canada, Energy, Mining and Environment, Montréal, QC, Canada.
ISME J. 2017 Aug;11(8):1746-1763. doi: 10.1038/ismej.2017.58. Epub 2017 May 19.
Cyanobacterial blooms occur in lakes worldwide, producing toxins that pose a serious public health threat. Eutrophication caused by human activities and warmer temperatures both contribute to blooms, but it is still difficult to predict precisely when and where blooms will occur. One reason that prediction is so difficult is that blooms can be caused by different species or genera of cyanobacteria, which may interact with other bacteria and respond to a variety of environmental cues. Here we used a deep 16S amplicon sequencing approach to profile the bacterial community in eutrophic Lake Champlain over time, to characterise the composition and repeatability of cyanobacterial blooms, and to determine the potential for blooms to be predicted based on time course sequence data. Our analysis, based on 135 samples between 2006 and 2013, spans multiple bloom events. We found that bloom events significantly alter the bacterial community without reducing overall diversity, suggesting that a distinct microbial community-including non-cyanobacteria-prospers during the bloom. We also observed that the community changes cyclically over the course of a year, with a repeatable pattern from year to year. This suggests that, in principle, bloom events are predictable. We used probabilistic assemblages of OTUs to characterise the bloom-associated community, and to classify samples into bloom or non-bloom categories, achieving up to 92% classification accuracy (86% after excluding cyanobacterial sequences). Finally, using symbolic regression, we were able to predict the start date of a bloom with 78-92% accuracy (depending on the data used for model training), and found that sequence data was a better predictor than environmental variables.
蓝藻水华在世界各地的湖泊中都有发生,产生的毒素对公众健康构成严重威胁。人类活动导致的富营养化和气温升高都促使水华的形成,但仍然很难精确预测水华何时何地会发生。预测如此困难的一个原因是,水华可能由不同种类或属的蓝藻引起,这些蓝藻可能与其他细菌相互作用,并对各种环境线索做出反应。在这里,我们使用深度16S扩增子测序方法,对富营养化的尚普兰湖随时间变化的细菌群落进行分析,以表征蓝藻水华的组成和重复性,并确定基于时间序列数据预测水华的可能性。我们基于2006年至2013年期间的135个样本进行的分析涵盖了多个水华事件。我们发现,水华事件显著改变了细菌群落,但并未降低整体多样性,这表明在水华期间,一个独特的微生物群落(包括非蓝藻)繁荣发展。我们还观察到,群落每年都会周期性变化,且年复一年具有可重复的模式。这表明,原则上,水华事件是可预测的。我们使用操作分类单元(OTU)的概率组合来表征与水华相关的群落,并将样本分类为水华或非水华类别,分类准确率高达92%(排除蓝藻序列后为86%)。最后,使用符号回归,我们能够以78% - 92%的准确率预测水华的开始日期(取决于用于模型训练的数据),并且发现序列数据比环境变量是更好的预测指标。