Department of Civil and Mechanical Engineering (DICEM), University of Cassino and Southern Lazio, Via Di Biasio, 43, 03043 Cassino, Frosinone, Italy.
College of Hydraulic Science and Engineering, Yangzhou University, Yangzhou, China.
Sci Total Environ. 2023 Sep 10;890:164323. doi: 10.1016/j.scitotenv.2023.164323. Epub 2023 May 20.
Lake surface water temperature is one of the most important physical and ecological indices of lakes, which has frequently been used as the indicator to evaluate the impact of climate change on lakes. Knowing the dynamics of lake surface water temperature is thus of great significance. The past decades have witnessed the development of different modeling tools to forecast lake surface water temperature, yet, simple models with fewer input variables, while maintaining high forecasting accuracy are scarce. Impact of forecast horizons on model performance has seldom been investigated. To fill the gap, in this study, a novel machine learning algorithm by stacking multilayer perceptron and random forest (MLP-RF) was employed to forecast daily lake surface water temperature using daily air temperature as the exogenous input variable, with the Bayesian Optimization procedure applied for tuning the hyperparameters. Prediction models were developed using long-term observed data from eight Polish lakes. The MLP-RF stacked model showed very good forecasting capabilities for all lakes and forecast horizons, far better than shallow multilayer perceptron neural network, a model coupling wavelet transform and multilayer perceptron neural network, non-linear regression and air2water models. A reduction in model performance was observed as the forecast horizon increased. However, the model also performs well with a forecast horizon of several days (e.g., 7 days ahead, testing stage: R - [0.932, 0.990], RMSE °C - [0.77, 1.83], MAE °C - [0.55, 1.38]). In addition, the MLP-RF stacked model has proven to be reliable for both intermediate temperatures and minimum and maximum peaks. The model proposed in this study will be useful to the scientific community in predicting lake surface water temperature, thus contributing to studies on such sensitive aquatic ecosystems as lakes.
湖泊表面水温是湖泊最重要的物理和生态指标之一,常被用来评估气候变化对湖泊的影响。因此,了解湖泊表面水温的动态变化具有重要意义。过去几十年见证了不同的建模工具的发展,用于预测湖泊表面水温,但具有较少输入变量而保持较高预测精度的简单模型却很少。对预测范围对模型性能的影响的研究很少。为了填补这一空白,本研究采用堆叠多层感知机和随机森林(MLP-RF)的新型机器学习算法,使用每日气温作为外生输入变量来预测每日湖泊表面水温,应用贝叶斯优化程序调整超参数。使用来自波兰 8 个湖泊的长期观测数据开发预测模型。MLP-RF 堆叠模型对所有湖泊和预测范围的预测能力都非常出色,远远优于浅层多层感知机神经网络、结合小波变换和多层感知机神经网络、非线性回归和空气-水模型的模型。随着预测范围的增加,模型性能有所下降。然而,该模型在预测范围为几天(例如,提前 7 天,测试阶段:R - [0.932, 0.990],RMSE °C - [0.77, 1.83],MAE °C - [0.55, 1.38])时也表现良好。此外,MLP-RF 堆叠模型已被证明对中等温度以及最小和最大峰值都可靠。本研究提出的模型将有助于科学界预测湖泊表面水温,从而为湖泊等敏感水生生态系统的研究做出贡献。