Chen Fengxian, Zhou Bin, Yang Liqiong, Chen Xijuan, Zhuang Jie
Key Laboratory of Pollution Ecology and Environmental Engineering, Institute of Applied Ecology, Chinese Academy of Sciences, Shenyang, Liaoning, China.
Faculty of Medicine, University of Augsburg, Augsburg, Germany.
Front Microbiol. 2023 May 10;14:1152059. doi: 10.3389/fmicb.2023.1152059. eCollection 2023.
, as an indicator of fecal contamination, can move from manure-amended soil to groundwater under rainfall or irrigation events. Predicting its vertical transport in the subsurface is essential for the development of engineering solutions to reduce the risk of microbiological contamination. In this study, we collected 377 datasets from 61 published papers addressing transport through saturated porous media and trained six types of machine learning algorithms to predict bacterial transport. Eight variables, including bacterial concentration, porous medium type, median grain size, ionic strength, pore water velocity, column length, saturated hydraulic conductivity, and organic matter content were used as input variables while the first-order attachment coefficient and spatial removal rate were set as target variables. The eight input variables have low correlations with the target variables, namely, they cannot predict target variables independently. However, using the predictive models, input variables can effectively predict the target variables. For scenarios with higher bacterial retention, such as smaller median grain size, the predictive models showed better performance. Among six types of machine learning algorithms, Gradient Boosting Machine and Extreme Gradient Boosting outperformed other algorithms. In most predictive models, pore water velocity, ionic strength, median grain size, and column length showed higher importance than other input variables. This study provided a valuable tool to evaluate the transport risk of in the subsurface under saturated water flow conditions. It also proved the feasibility of data-driven methods that could be used for predicting other contaminants' transport in the environment.
作为粪便污染的一个指标,在降雨或灌溉事件下,它可以从施用过粪肥的土壤移动到地下水中。预测其在地下的垂直运移对于开发工程解决方案以降低微生物污染风险至关重要。在本研究中,我们从61篇已发表的论文中收集了377个数据集,这些论文涉及通过饱和多孔介质的运移,并训练了六种机器学习算法来预测细菌运移。八个变量,包括细菌浓度、多孔介质类型、中值粒径、离子强度、孔隙水速度(原文有误,应为pore water velocity)、柱长、饱和导水率和有机质含量被用作输入变量,而一级附着系数和空间去除率率被设置为目标变量。这八个输入变量与目标变量的相关性较低,也就是说,它们不能独立预测目标变量。然而,使用预测模型,输入变量可以有效地预测目标变量。对于细菌保留率较高的情况,如较小的中值粒径,预测模型表现出更好的性能。在六种机器学习算法中,梯度提升机和极端梯度提升比其他算法表现更好。在大多数预测模型中,孔隙水速度、离子强度、中值粒径和柱长比其他输入变量显示出更高的重要性。本研究提供了一个有价值的工具来评估在饱和水流条件下地下的运移风险。它还证明了数据驱动方法可用于预测环境中其他污染物运移的可行性。