Center for Water Resources and Environment Research, School of Civil Engineering, Sun Yat-sen University, Guangzhou, China.
Center for Water Resources and Environment Research, School of Civil Engineering, Sun Yat-sen University, Guangzhou, China.
J Environ Manage. 2021 Oct 1;295:113085. doi: 10.1016/j.jenvman.2021.113085. Epub 2021 Jun 18.
Accurate prediction of dissolved oxygen time series is important for improving the water environment and aiding water resource management. In this study, four stand-alone models including multiple linear regression (MLR), support vector machine (SVM), artificial neural network (ANN) and random forest (RF), and four hybrid models based on wavelet transform (WT) including WT-MLR, WT-SVM, WT-ANN and WT-RF were used to predict the daily dissolved oxygen (DO) at 1-5-day lead times in the Dongjiang River Basin, China. To make the prediction robust, the maximal information coefficient (MIC) was used to capture comprehensive information between DO and explanatory variables. The 5-fold cross validation grid search approach was used to optimize parameters of machine learning tools. Two types of frameworks of WT: direct framework (i.e., only the explanatory variables were decomposed) and multicomponent framework (i.e., both explanatory variables and target variables were decomposed) were used to construct hybrid models. The results show that MIC extracts four optimal explanatory variables: previous DO, water temperature, air temperature and air pressure. Four evaluation parameters including correlation coefficient (R), Nash-Sutcliffe efficiency (NSE), mean absolute error (MAE) and root mean square error (RMSE) indicate that the prediction accuracy decreases as the lead time changes from 1 to 5 days. In terms of the stand-alone models, MLR model outperforms the other three models with higher NSE values of 0.616-0.921, and lower RMSE values of 0.503-1.111. With regard to the hybrid models, WT-ANN and WT-MLR models exhibit higher performance, and multicomponent framework performs better than direct framework in all hybrid models. In general, the multicomponent framework of WT can improve the prediction accuracy of stand-alone models at a certain degree, while the direct framework shows no obvious advantage.
准确预测溶解氧时间序列对于改善水环境和辅助水资源管理至关重要。本研究采用四种独立模型(多元线性回归(MLR)、支持向量机(SVM)、人工神经网络(ANN)和随机森林(RF))和四种基于小波变换(WT)的混合模型(WT-MLR、WT-SVM、WT-ANN 和 WT-RF),预测中国东江流域未来 1-5 天的日溶解氧(DO)。为了使预测稳健,最大信息系数(MIC)用于捕捉 DO 与解释变量之间的综合信息。使用 5 倍交叉验证网格搜索方法来优化机器学习工具的参数。采用两种类型的 WT 框架:直接框架(仅对解释变量进行分解)和多分量框架(同时对解释变量和目标变量进行分解)来构建混合模型。结果表明,MIC 提取了四个最佳解释变量:前一天 DO、水温、气温和气压。四个评价参数(相关系数(R)、纳什效率系数(NSE)、平均绝对误差(MAE)和均方根误差(RMSE))表明,随着从 1 天到 5 天的时间推移,预测精度会降低。就独立模型而言,MLR 模型的性能优于其他三个模型,其 NSE 值更高(0.616-0.921),RMSE 值更低(0.503-1.111)。对于混合模型,WT-ANN 和 WT-MLR 模型表现出更高的性能,在所有混合模型中,多分量框架的性能优于直接框架。总的来说,WT 的多分量框架在一定程度上可以提高独立模型的预测精度,而直接框架则没有明显的优势。