Center for Remote Sensing and GIS Research, Faculty of Earth Sciences, Shahid Beheshti University, Tehran, Iran.
Center for Remote Sensing and GIS Research, Faculty of Earth Sciences, Shahid Beheshti University, Tehran, Iran.
J Environ Manage. 2023 Nov 1;345:118782. doi: 10.1016/j.jenvman.2023.118782. Epub 2023 Aug 17.
Groundwater is one of the most important water resources around the world, which is increasingly exposed to contamination. As nitrate is a common pollutant of groundwater and has negative effects on human health, predicting its concentration is of particular importance. Ensemble machine learning (ML) algorithms have been widely employed for nitrate concentration prediction in groundwater. However, existing ensemble models often overlook spatial variation by combining ML models with conventional methods like averaging. The objective of this study is to enhance the spatial accuracy of groundwater nitrate concentration prediction by integrating the outputs of ML models using a local approach that accounts for spatial variation. Initially, three widely used ML models including random forest regression (RFR), k-nearest neighbor (KNN), and support vector regression (SVR) were employed to predict groundwater nitrate concentration of Qom aquifer in Iran. Subsequently, the output of these models were integrated using geographically weighted regression (GWR) as a local model. The findings demonstrated that the ensemble of ML models using GWR resulted in the highest performance (R2 = 0.75 and RMSE = 9.38 mg/l) compared to an ensemble model using averaging (R2 = 0.68 and RMSE = 10.56 mg/l), as well as individual models such as RFR (R2 = 0.70 and RMSE = 10.16 mg/l), SVR (R2 = 0.59 and RMSE = 11.95 mg/l), and KNN (R2 = 0.57 and RMSE = 12.19 mg/l). The resulting prediction map revealed that groundwater nitrate contamination is predominantly concentrated in urban areas located in the northwestern regions of the study area. The insights gained from this study have practical implications for managers, assisting them in preventing nitrate pollution in groundwater and formulating strategies to improve water quality.
地下水是全球最重要的水资源之一,但其越来越容易受到污染。由于硝酸盐是地下水的一种常见污染物,对人体健康有负面影响,因此预测其浓度尤为重要。集成机器学习(ML)算法已广泛用于地下水硝酸盐浓度预测。然而,现有的集成模型通常通过将 ML 模型与传统方法(如平均值)结合来忽略空间变化。本研究的目的是通过使用考虑空间变化的局部方法整合 ML 模型的输出,来提高地下水硝酸盐浓度预测的空间精度。最初,使用了三种广泛使用的 ML 模型,包括随机森林回归(RFR)、k-最近邻(KNN)和支持向量回归(SVR),来预测伊朗库姆含水层的地下水硝酸盐浓度。随后,使用地理加权回归(GWR)作为局部模型来整合这些模型的输出。结果表明,与使用平均值的集成模型(R2=0.68 和 RMSE=10.56mg/l)相比,使用 GWR 的 ML 模型集成的表现最佳(R2=0.75 和 RMSE=9.38mg/l),并且优于单个模型,如 RFR(R2=0.70 和 RMSE=10.16mg/l)、SVR(R2=0.59 和 RMSE=11.95mg/l)和 KNN(R2=0.57 和 RMSE=12.19mg/l)。生成的预测图显示,地下水硝酸盐污染主要集中在研究区西北部的城市地区。本研究的结果对于管理者具有实际意义,有助于他们预防地下水硝酸盐污染,并制定改善水质的策略。