Armanuos Asaad M, Zeleňáková Martina, Elshaarawy Mohamed Kamel
Irrigation and Hydraulics Engineering Department, Faculty of Engineering, Tanta University, Tanta, 31733, Egypt.
Institute of Environmental Engineering, Faculty of Civil Engineering, Technical University of Košice, 04200, Košice, Slovakia.
Sci Rep. 2025 Aug 10;15(1):29281. doi: 10.1038/s41598-025-12830-w.
Reliable modeling of saltwater intrusion (SWI) into freshwater aquifers is essential for the sustainable management of coastal groundwater resources and the protection of water quality. This study evaluates the performance of four Bayesian-optimized gradient boosting models in predicting the SWI wedge length ratio (L/L) in coastal sloping aquifers with underground barriers. A dataset of 456 samples was generated through numerical simulations using SEAWAT, incorporating key variables such as bed slope, hydraulic gradient, relative density, relative hydraulic conductivity, barrier wall depth ratio, and distance ratio. The dataset was divided into 70% for training and 30% for testing. Model performance was assessed using both visual and quantitative metrics. Among the models, Light Gradient Boosting (LGB) achieved the highest predictive accuracy, with RMSE values of 0.016 and 0.037 for the training and testing sets, respectively, and the highest coefficient of determination (R²). Stochastic Gradient Boosting (SGB) followed closely, while Categorical Gradient Boosting (CGB) and eXtreme Gradient Boosting (XGB) showed slightly higher error rates. SHapley Additive exPlanations (SHAP) analysis identified relative barrier wall distance and bed slope as the most influential features affecting model predictions. To support practical application, an interactive graphical user interface (GUI) was developed, allowing users to input key variables and easily estimate L/L values. Finally, the best-performing model was validated against the Akrotiri coastal aquifer in Cyprus, a realistic benchmark case derived from numerical simulations. The model's predictions showed strong agreement with reference results, achieving an RMSE of 0.04, thereby confirming its practical applicability. This study highlights the potential of interpretable, optimized ML models to enhance SWI prediction and support informed decision-making in coastal aquifer management.
可靠地模拟咸水入侵(SWI)进入淡水含水层对于沿海地下水资源的可持续管理和水质保护至关重要。本研究评估了四种贝叶斯优化梯度提升模型在预测有地下屏障的沿海倾斜含水层中SWI楔长比(L/L)方面的性能。通过使用SEAWAT进行数值模拟生成了一个包含456个样本的数据集,纳入了诸如河床坡度、水力梯度、相对密度、相对水力传导率、屏障墙深度比和距离比等关键变量。该数据集被分为70%用于训练和30%用于测试。使用视觉和定量指标评估模型性能。在这些模型中,轻量级梯度提升(LGB)实现了最高的预测精度,训练集和测试集的均方根误差(RMSE)值分别为0.016和0.037,以及最高的决定系数(R²)。随机梯度提升(SGB)紧随其后,而分类梯度提升(CGB)和极端梯度提升(XGB)显示出略高的错误率。SHapley加性解释(SHAP)分析确定相对屏障墙距离和河床坡度是影响模型预测的最具影响力的特征。为支持实际应用,开发了一个交互式图形用户界面(GUI),允许用户输入关键变量并轻松估计L/L值。最后,针对塞浦路斯的阿克罗蒂里沿海含水层对性能最佳的模型进行了验证,该含水层是一个从数值模拟得出的实际基准案例。模型的预测结果与参考结果高度一致,RMSE为0.04,从而证实了其实际适用性。本研究突出了可解释的优化机器学习模型在增强SWI预测和支持沿海含水层管理中的明智决策方面的潜力。