State Key Laboratory of Hydrology-Water Resources and Hydraulic Engineering, Hohai University, Nanjing, Jiangsu, 210098, China; Yangtze Institute for Conservation and Development, Hohai Unversity, Nanjing, Jiangsu, 210098, China; Civil and Environmental Engineering, University of California, Merced, 95343, CA, USA.
Civil and Environmental Engineering, University of California, Merced, 95343, CA, USA.
Sci Total Environ. 2021 May 15;769:144715. doi: 10.1016/j.scitotenv.2020.144715. Epub 2021 Jan 20.
Agricultural water demand, groundwater extraction, surface water delivery and climate have complex nonlinear relationships with groundwater storage in agricultural regions. As an alternative to elaborate computationally intensive physical models, machine learning methods are often adopted as surrogate to capture such complex relationships due to their high computational efficiency. Inevitably, using only one machine learning model is prone to underestimate prediction uncertainty and subjected to poor accuracy. This study presents a novel machine learning-based groundwater ensemble modeling framework in conjunction with a Bayesian model averaging approach to predict groundwater storage change and improve overall model predicting reliability. Three different machine learning models have been developed namely artificial neural network, support vector machine and response surface regression. To explicitly quantify uncertainty from machine learning model parameter and structure, Bayesian model averaging is employed to produce a forecast distribution associated with each machine learning prediction. Model weights and variances are obtained based on model performance to construct ensemble models. Then, the developed individual and Bayesian model averaging machine learning ensemble models are applied, evaluated and validated at different spatial scales including subregional and regional scales in an overdrafted agricultural region-the San Joaquin River Basin, through independent training and testing dataset. Results shows the machine learning models have remarkable predicting capability without sacrificing accuracy but with higher computational efficiency. Compared to a single-model approach, the ensemble model is able to produce consistently reliable predictions across the basin, yet it does not always outperform the best model in the ensemble. Additionally, model results suggest that groundwater pumping for agricultural irrigation is the primary driving force of groundwater storage change across the region. The modeling framework can serve as an alternative approach to simulating groundwater response, especially in those agricultural regions where lack of subsurface data hinders physically-based modeling.
农业用水需求、地下水开采、地表水输送和气候与农业区地下水储量之间存在复杂的非线性关系。由于计算效率高,机器学习方法常被用作替代方法来捕捉这种复杂关系,而不是采用繁琐的计算密集型物理模型。不可避免的是,仅使用一种机器学习模型容易低估预测不确定性,并导致精度较差。本研究提出了一种新的基于机器学习的地下水集合建模框架,结合贝叶斯模型平均方法,以预测地下水储量变化并提高整体模型预测可靠性。开发了三种不同的机器学习模型,即人工神经网络、支持向量机和响应面回归。为了明确量化机器学习模型参数和结构的不确定性,采用贝叶斯模型平均方法生成与每个机器学习预测相关的预测分布。根据模型性能获得模型权重和方差,以构建集合模型。然后,将开发的个体和贝叶斯模型平均机器学习集合模型应用于圣华金河流域(一个超采农业区)的不同空间尺度(包括次区域和区域尺度),通过独立的训练和测试数据集进行评估和验证。结果表明,机器学习模型具有出色的预测能力,同时不牺牲准确性,但计算效率更高。与单模型方法相比,集合模型能够在整个流域产生一致可靠的预测,但并不总是优于集合中的最佳模型。此外,模型结果表明,农业灌溉地下水开采是该地区地下水储量变化的主要驱动力。该建模框架可作为模拟地下水响应的替代方法,特别是在那些缺乏地下数据阻碍基于物理的建模的农业地区。