Yang Heng, Wang Panlei, Chen Anqiang, Ye Yuanhang, Chen Qingfei, Cui Rongyang, Zhang Dan
College of Resource and Environment, Yunnan Agricultural University, Kunming, 650201, China.
Agricultural Environment and Resources Institute, Yunnan Academy of Agricultural Sciences, Kunming, 650201, China.
Chemosphere. 2023 Feb;313:137623. doi: 10.1016/j.chemosphere.2022.137623. Epub 2022 Dec 21.
Excessive accumulation of phosphorus in soil profiles has become the main source of phosphorus in groundwater due to the application of phosphorus fertilizers in intensive agricultural regions (IARs). Elevated phosphorus concentrations in groundwater have become a global phenomenon, which places enormous pressure on the safe use of water resources and the safety of the aquatic environment. Currently, the prediction of pollutant concentrations in groundwater mainly focuses on nitrate nitrogen, while research on phosphorus prediction is limited. Taking the IARs approximately 8 plateau lakes in the Yunnan-Guizhou Plateau as an example, 570 shallow groundwater samples and 28 predictor variables were collected and measured, and a machine learning approach was used to predict phosphorus concentrations in groundwater. The performance of three machine learning algorithms and different sets of variables for predicting phosphorus concentrations in shallow groundwater was evaluated. The results showed that after all variables were introduced into the model, the R, RMSE and MAE of support vector machine (SVM), random forest (RF) and neural network (NN) were 0.52-0.60, 0.101-0.108 and 0.074-0.081, respectively. Among them, the SVM model had the best prediction effect. The clay content and water-soluble phosphorus in soil and soluble organic carbon in groundwater had a high contribution to the prediction accuracy of the model. The prediction accuracy of the model with reduced number of variables showed that when the number of variables was equal to 6, the RF model had R, RMSE and MAE values of 0.53, 0.108 and 0.074, respectively, and the number of variables increased again; there were small changes in R, RMSE and MAE. Compared with the SVM and NN models, the RF model can achieve higher accuracy by inputting fewer variables.
由于集约化农业地区(IARs)磷肥的施用,土壤剖面中磷的过量积累已成为地下水中磷的主要来源。地下水中磷浓度升高已成为一种全球现象,这给水资源的安全利用和水生环境安全带来了巨大压力。目前,地下水中污染物浓度的预测主要集中在硝态氮,而对磷预测的研究有限。以云贵高原约8个高原湖泊周边的集约化农业地区为例,采集并测定了570个浅层地下水样本和28个预测变量,并采用机器学习方法预测地下水中的磷浓度。评估了三种机器学习算法和不同变量集对浅层地下水中磷浓度的预测性能。结果表明,将所有变量引入模型后,支持向量机(SVM)、随机森林(RF)和神经网络(NN)的R、RMSE和MAE分别为0.52 - 0.60、0.101 - 0.108和0.074 - 0.081。其中,SVM模型的预测效果最佳。土壤中的黏粒含量、水溶性磷以及地下水中的可溶性有机碳对模型的预测精度有较高贡献。变量数量减少的模型的预测精度表明,当变量数量等于6时,RF模型的R、RMSE和MAE值分别为0.53、0.108和0.074,变量数量再次增加时,R值、RMSE和MAE的变化较小。与SVM和NN模型相比,RF模型通过输入较少的变量即可实现更高的精度。