El Bilali Ali, Abdeslam Taleb, Ayoub Nafii, Lamane Houda, Ezzaouini Mohamed Abdellah, Elbeltagi Ahmed
Hassan University of Casablanca, Faculty of Sciences and Techniques of Mohammedia, Morocco; River Basin Agency of Bouregreg and Chaouia, Benslimane, Morocco.
Hassan University of Casablanca, Faculty of Sciences and Techniques of Mohammedia, Morocco.
J Environ Manage. 2023 Feb 1;327:116890. doi: 10.1016/j.jenvman.2022.116890. Epub 2022 Nov 29.
Evaporation is an important hydrological process in the water cycle, especially for water bodies. Machine Learning (ML) models have become accurate and powerful tools in predicting pan evaporation. Meanwhile, the "black-box" character and the consistency with the physical process can decrease the practical implication of ML models. To overcome such limitations, we attempt to develop an interpretable based-ML framework to predict daily pan evaporation using Extra Tree, XGBoost, SVR, and Deep Neural Network (DNN) ML models using hourly climate datasets. To that end, we integrated and employed the Shapely Additive explanations (SHAP), Sobol-based sensitivity analysis, and Local Interpretable Model-agnostic Explanations (LIME) to evaluate the interpretability of the models in predicting daily pan evaporation, at Sidi Mohammed Ben Abdellah (SMBA) weather station, in Morocco. The validation results of the models showed that the developed models are accurate in reproducing the daily pan evaporation with NSE ranging from 0.76 to 0.83 during the validation phase. Furthermore, the interpretability results of the ML models showed that the air temperature (Ta), solar radiation (Rs), followed by relative humidity (H) are the most important climate variables with inflection points of the Ta_median, Ta_mean, Rs_sum, H_mean, and w_std are 17.42 °C, 17.65 °C, 3.8 kw.m, 69.59%, and 1.25 m s, sequentially. Overall, the interpretability of the models showed a good consistency of the ML models with the real hydro-climatic process of evaporation in a semi-arid environment. Hence, the proposed methodology is powerful in enhancing the reliability and transparency of the developed models for predicting daily pan evaporation. Finally, the proposed approach is new insights to reduce the ''Black-Box'' character of ML models in hydrological studies.
蒸发是水循环中一个重要的水文过程,对水体而言尤其如此。机器学习(ML)模型已成为预测蒸发皿蒸发量的精确且强大的工具。与此同时,ML模型的“黑箱”特性以及与物理过程的一致性可能会降低其实际应用价值。为克服这些局限性,我们尝试开发一个基于可解释性ML的框架,利用小时气候数据集,通过Extra Tree、XGBoost、支持向量回归(SVR)和深度神经网络(DNN)ML模型来预测每日蒸发皿蒸发量。为此,我们整合并运用了Shapely加法解释(SHAP)、基于索博尔的敏感性分析以及局部可解释模型无关解释(LIME),以评估在摩洛哥西迪·穆罕默德·本·阿卜杜拉(SMBA)气象站预测每日蒸发皿蒸发量时模型的可解释性。模型的验证结果表明,所开发的模型在验证阶段能准确再现每日蒸发皿蒸发量,纳什效率系数(NSE)范围为0.76至0.83。此外,ML模型的可解释性结果表明,气温(Ta)、太阳辐射(Rs),其次是相对湿度(H)是最重要的气候变量,Ta的中位数、Ta的平均值、Rs的总和、H的平均值以及风速标准差(w_std)的拐点依次为17.42℃、17.65℃、3.8千瓦·米、69.59%和1.25米/秒。总体而言,模型的可解释性表明ML模型与半干旱环境中蒸发的实际水文气候过程具有良好的一致性。因此,所提出的方法在提高所开发的每日蒸发皿蒸发量预测模型的可靠性和透明度方面很有效。最后,所提出的方法为减少水文研究中ML模型的“黑箱”特性提供了新的见解。