Li Jintao, Ai Ping, Xiong Chuansheng, Song Yanhong
College of Computer Science and Software Engineering, Hohai University, Nanjing, 211100, China.
College of Hydrology and Water Resources, Hohai University, Nanjing, 210098, China.
Sci Rep. 2025 Apr 27;15(1):14732. doi: 10.1038/s41598-025-00115-1.
Accurate medium- to long-term runoff forecasting is crucial for flood control, drought resilience, water resources development, and ecological improvement. Traditional statistical methods struggle to utilize multifaceted variable information, leading to lower prediction accuracy. This study introduces two innovative coupled models-SRA-SVR and SRA-MLPR-to enhance runoff prediction by leveraging the strengths of statistical and deep learning approaches. Stepwise Regression Analysis (SRA) was employed to effectively handle high-dimensional data and multicollinearity, ensuring that only the most influential predictive variables were retained. Support Vector Regression (SVR) and Multi-Layer Perceptron Regression (MLPR) were chosen due to their strong adaptability in capturing nonlinear relationships and extracting latent hydrological patterns. The integration of these methods significantly improves prediction accuracy and model stability. By integrating 80 atmospheric circulation indices as teleconnection variables, the models tackle critical challenges such as high-dimensional data, multicollinearity, and nonlinear hydrological dynamics. The Yalong River Basin, characterized by complex hydrological processes and diverse climatic influences, serves as the case study for model validation. The results show that: (1) Compared to baseline single models, the SRA-MLPR model reduced RMSE (from 798.47 to 594.45) by 26% and MAPE (from 34.79 to 22.90%) by 34%, while achieving an NSE (from 0.67 to 0.76) improvement of 13%, particularly excelling in extreme runoff scenarios. (2) The inclusion of teleconnection indices not only enriched the predictive feature set but also improved model stability, with the SRA-MLPR demonstrating enhanced capability in capturing latent nonlinear relationships. (3) A one-month lag in atmospheric circulation indices was identified as the optimal predictor for basin-scale runoff, providing actionable insights into temporal runoff dynamics. (4) To enhance model interpretability, SHAP (SHapley Additive exPlanations) analysis was employed to quantify the contribution of atmospheric circulation indices to runoff predictions, revealing the dominant climate drivers and their nonlinear interactions. The results indicate that the Northern Hemisphere Polar Vortex and the East Asian Trough exert significant control over runoff dynamics, with their influence modulated by large-scale climate oscillations such as the North Atlantic Oscillation (NAO) and Pacific Decadal Oscillation (PDO). (5) The models' scalability is validated through their modular design, allowing seamless adaptation to diverse hydrological contexts. Applications include improved flood forecasting, optimized reservoir operations, and adaptive water resource planning. Furthermore, the study demonstrates the potential of coupled models as generalizable tools for hydrological forecasting in basins with varying climatic and geographic conditions. This study highlights the potential of coupled models as robust and generalizable tools for hydrological forecasting across diverse climatic and geographic conditions. By integrating atmospheric circulation indices, the proposed models enhance runoff prediction accuracy and stability while offering valuable insights for flood prevention, drought mitigation, and adaptive water resource management. These methodological advancements bridge the gap between statistical and deep learning approaches, providing a scalable framework for accurate and interpretable hydrological, climatological, and environmental predictions. Given the escalating challenges brought about by climate change, the findings of this study make contributions to sustainable water management, interpretable decision-making support, and disaster preparedness at a global level.
准确的中长期径流预测对于防洪、抗旱、水资源开发和生态改善至关重要。传统统计方法难以利用多方面的变量信息,导致预测准确率较低。本研究引入了两种创新的耦合模型——SRA-SVR和SRA-MLPR,通过利用统计和深度学习方法的优势来提高径流预测能力。采用逐步回归分析(SRA)有效处理高维数据和多重共线性,确保仅保留最具影响力的预测变量。选择支持向量回归(SVR)和多层感知器回归(MLPR)是因为它们在捕捉非线性关系和提取潜在水文模式方面具有很强的适应性。这些方法的整合显著提高了预测准确率和模型稳定性。通过将80个大气环流指数作为遥相关变量进行整合,模型应对了高维数据、多重共线性和非线性水文动力学等关键挑战。以水文过程复杂且受多种气候影响的雅砻江流域作为模型验证的案例研究。结果表明:(1)与基线单一模型相比,SRA-MLPR模型的均方根误差(RMSE)从798.47降至594.45,降低了26%,平均绝对百分比误差(MAPE)从34.79降至22.90%,降低了34%,同时纳什效率系数(NSE)从0.67提高到0.76,提高了13%,在极端径流情景中表现尤为出色。(2)纳入遥相关指数不仅丰富了预测特征集,还提高了模型稳定性,SRA-MLPR在捕捉潜在非线性关系方面表现出更强的能力。(3)确定大气环流指数一个月的滞后时间是流域尺度径流的最佳预测指标,为径流的时间动态提供了可操作的见解。(4)为提高模型的可解释性,采用SHAP(SHapley Additive exPlanations)分析来量化大气环流指数对径流预测的贡献,揭示主要气候驱动因素及其非线性相互作用。结果表明,北半球极地涡旋和东亚大槽对径流动态有显著控制作用,其影响受北大西洋涛动(NAO)和太平洋年代际振荡(PDO)等大尺度气候振荡的调节。(5)通过模块化设计验证了模型的可扩展性,使其能够无缝适应不同的水文环境。应用包括改进洪水预报、优化水库运行和适应性水资源规划。此外,该研究证明了耦合模型作为不同气候和地理条件流域水文预报通用工具的潜力。本研究突出了耦合模型作为跨不同气候和地理条件进行水文预报的强大且通用工具的潜力。通过整合大气环流指数,所提出的模型提高了径流预测的准确率和稳定性,同时为防洪、抗旱和适应性水资源管理提供了有价值的见解。这些方法学进展弥合了统计和深度学习方法之间的差距,为准确且可解释的水文、气候和环境预测提供了一个可扩展的框架。鉴于气候变化带来的挑战不断升级,本研究的结果为全球可持续水资源管理、可解释的决策支持和灾害准备做出了贡献。