Kumar Sanjit, Agarwal Mayank, Deshpande Vishal, Cooper James R, Khosravi Khabat, Rathnayake Namal, Hoshino Yukinobu, Kantamaneni Komali, Rathnayake Upaka
Indian Institute of Technology Patna, Patna, India.
University of Liverpool, Liverpool, UK.
Sci Rep. 2024 Jul 16;14(1):16368. doi: 10.1038/s41598-024-67269-2.
In river research, forecasting flow velocity accurately in vegetated channels is a significant challenge. The forecasting performance of various independent and hybrid machine learning (ML) models are thus quantified for the first time in this work. Utilizing flow velocity measurements in both natural and laboratory flume experiments, we assess the efficacy of four distinct standalone machine learning techniques-Kstar, M5P, reduced error pruning tree (REPT) and random forest (RF) models. In addition, we also test for eight types of hybrid ML algorithms trained with an Additive Regression (AR) and Bagging (BA) (AR-Kstar, AR-M5P, AR-REPT, AR-RF, BA-Kstar, BA-M5P, BA-REPT and BA-RF). Findings from a comparison of their predictive capabilities, along with a sensitivity analysis of the influencing factors, indicated: (1) Vegetation height emerged as the most sensitive parameter for determining the flow velocity; (2) all ML models displayed outperforming empirical equations; (3) nearly all ML algorithms worked optimal when the model was built using all of the input parameters. Overall, the findings showed that hybrid ML algorithms outperform regular ML algorithms and empirical equations at forecasting flow velocity. AR-M5P (R = 0.954, R = 0.977, NSE = 0.954, MAE = 0.042, MSE = 0.003, and PBias = 1.466) turned out to be the optimal model for forecasting of flow velocity in vegetated-rivers.
在河流研究中,准确预测植被河道中的流速是一项重大挑战。因此,在本研究中首次对各种独立和混合机器学习(ML)模型的预测性能进行了量化。利用自然和实验室水槽实验中的流速测量数据,我们评估了四种不同的独立机器学习技术——Kstar、M5P、简化误差剪枝树(REPT)和随机森林(RF)模型的有效性。此外,我们还测试了八种用加法回归(AR)和装袋法(BA)训练的混合ML算法(AR-Kstar、AR-M5P、AR-REPT、AR-RF、BA-Kstar、BA-M5P、BA-REPT和BA-RF)。通过比较它们的预测能力以及对影响因素的敏感性分析得出:(1)植被高度是确定流速最敏感的参数;(2)所有ML模型的表现均优于经验方程;(3)当使用所有输入参数构建模型时,几乎所有ML算法的效果最佳。总体而言,研究结果表明,在预测流速方面,混合ML算法优于常规ML算法和经验方程。AR-M5P(R = 0.954,R = 0.977,NSE = 0.954,MAE = 0.042,MSE = 0.003,PBias = 1.466)被证明是预测植被河道流速的最佳模型。