用于在两个建模层面预测地下水对硝酸盐污染的脆弱性指数的新型机器学习算法。

Novel machine learning algorithms to predict the groundwater vulnerability index to nitrate pollution at two levels of modeling.

作者信息

Elzain Hussam Eldin, Chung Sang Yong, Venkatramanan Senapathi, Selvam Sekar, Ahemd Hamdi Abdurhman, Seo Young Kyo, Bhuyan Md Simul, Yassin Mohamed A

机构信息

Department. of Environmental & Earth Sciences, Pukyong National University, Busan, 48513, South Korea; Water Research Center, Sultan Qaboos University, Muscat, Oman.

Department. of Environmental & Earth Sciences, Pukyong National University, Busan, 48513, South Korea.

出版信息

Chemosphere. 2023 Feb;314:137671. doi: 10.1016/j.chemosphere.2022.137671. Epub 2022 Dec 28.

DOI:10.1016/j.chemosphere.2022.137671

PMID:36586442

Abstract

The accurate mapping and assessment of groundwater vulnerability index are crucial for the preservation of groundwater resources from the possible contamination. In this research, novel intelligent predictive Machine Learning (ML) regression models of k-Neighborhood (KNN), ensemble Extremely Randomized Trees (ERT), and ensemble Bagging regression (BA) at two levels of modeling were utilized to improve DRASTIC-LU model in the Miryang aquifer located in South Korea. The predicted outputs from level 1 (KNN and ERT models) were used as inputs for ensemble bagging (BA) in level 2. The predictive groundwater pollution vulnerability index (GPVI), derived from DRASTIC-LU model was adjusted by NO-N data and was utilized as the target data of the ML models. Hyperparameters for all models were tuned using a Grid Searching approach to determine the best effective model structures. Various statistical metrics and graphical representations were used to evaluate the superior predictive performance among ML models. Ensemble BA model in level 2 was more precise than standalone KNN and ensemble ERT models in level 1 for predicting GPVI values. Furthermore, the ensemble BA model offered suitable outcomes for the unseen data that could subsequently prevent the overfitting issue in the testing phase. Therefore, ML modeling at two levels could be an excellent approach for the proactive management of groundwater resources against contamination.

摘要

准确绘制和评估地下水脆弱性指数对于保护地下水资源免受潜在污染至关重要。在本研究中，采用了新颖的智能预测机器学习（ML）回归模型，即k近邻（KNN）、集成极端随机树（ERT）和两级建模的集成装袋回归（BA），以改进韩国密阳市含水层的DRASTIC-LU模型。一级（KNN和ERT模型）的预测输出用作二级集成装袋（BA）的输入。由DRASTIC-LU模型得出的预测地下水污染脆弱性指数（GPVI）通过无氮数据进行调整，并用作ML模型的目标数据。使用网格搜索方法调整所有模型的超参数，以确定最佳有效模型结构。使用各种统计指标和图形表示来评估ML模型之间的卓越预测性能。二级的集成BA模型在预测GPVI值方面比一级的独立KNN和集成ERT模型更精确。此外，集成BA模型为未知数据提供了合适的结果，从而可以在测试阶段防止过拟合问题。因此，两级ML建模可能是主动管理地下水资源以防止污染的绝佳方法。

相似文献

Novel machine learning algorithms to predict the groundwater vulnerability index to nitrate pollution at two levels of modeling.

Chemosphere. 2023 Feb;314:137671. doi: 10.1016/j.chemosphere.2022.137671. Epub 2022 Dec 28.

Delimitation of groundwater zones under contamination risk using a bagged ensemble of optimized DRASTIC frameworks.

Environ Sci Pollut Res Int. 2019 Mar;26(8):8325-8339. doi: 10.1007/s11356-019-04252-9. Epub 2019 Jan 31.

Comparative study of machine learning models for evaluating groundwater vulnerability to nitrate contamination.

Ecotoxicol Environ Saf. 2022 Jan 1;229:113061. doi: 10.1016/j.ecoenv.2021.113061. Epub 2021 Dec 11.

An innovative approach for predicting groundwater TDS using optimized ensemble machine learning algorithms at two levels of modeling strategy.

J Environ Manage. 2024 Feb;351:119896. doi: 10.1016/j.jenvman.2023.119896. Epub 2024 Jan 3.

Improving groundwater nitrate concentration prediction using local ensemble of machine learning models.

J Environ Manage. 2023 Nov 1;345:118782. doi: 10.1016/j.jenvman.2023.118782. Epub 2023 Aug 17.

Geostatistical estimates of groundwater nitrate-nitrogen concentrations with spatial auxiliary information on DRASTIC-LU-based aquifer contamination vulnerability.

Environ Sci Pollut Res Int. 2023 Jul;30(33):81113-81130. doi: 10.1007/s11356-023-28208-2. Epub 2023 Jun 14.

ANFIS-MOA models for the assessment of groundwater contamination vulnerability in a nitrate contaminated area.

J Environ Manage. 2021 May 15;286:112162. doi: 10.1016/j.jenvman.2021.112162. Epub 2021 Feb 24.

Mapping groundwater contamination risk of multiple aquifers using multi-model ensemble of machine learning algorithms.

Sci Total Environ. 2018 Apr 15;621:697-712. doi: 10.1016/j.scitotenv.2017.11.185. Epub 2017 Nov 30.

Predictive modeling of groundwater nitrate pollution using Random Forest and multisource variables related to intrinsic and specific vulnerability: a case study in an agricultural setting (Southern Spain).

Sci Total Environ. 2014 Apr 1;476-477:189-206. doi: 10.1016/j.scitotenv.2014.01.001. Epub 2014 Jan 24.

Susceptibility Assessment of Groundwater Nitrate Contamination Using an Ensemble Machine Learning Approach.

Ground Water. 2023 Jul-Aug;61(4):510-516. doi: 10.1111/gwat.13258. Epub 2022 Sep 30.

引用本文的文献

Hybrid metaheuristic optimized Catboost models for construction cost estimation of concrete solid slabs.

Sci Rep. 2025 Jul 1;15(1):21612. doi: 10.1038/s41598-025-06380-4.

Enhancement of groundwater resources quality prediction by machine learning models on the basis of an improved DRASTIC method.

Sci Rep. 2024 Dec 2;14(1):29933. doi: 10.1038/s41598-024-78812-6.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于在两个建模层面预测地下水对硝酸盐污染的脆弱性指数的新型机器学习算法。

Novel machine learning algorithms to predict the groundwater vulnerability index to nitrate pollution at two levels of modeling.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献