Tiwari Nand Kumar, Panwar Dinesh
Department of Civil Engineering, National Institute of Technology Kurukshetra, Haryana 136119, India E-mail:
Department of Civil Engineering, National Institute of Technology Kurukshetra, Haryana 136119, India.
Water Sci Technol. 2024 Dec;90(12):3210-3240. doi: 10.2166/wst.2024.393.
This study optimizes standard oxygen transfer efficiency (SOTE) in Venturi flumes investigating the impact of key parameters such as discharge per unit width (), throat width (), throat length (), upstream entrance width (), and gauge readings ( and ). To achieve this, a comprehensive experimental dataset was analyzed using multiple linear regression (MLR), multiple nonlinear regression (MNLR), gradient boosting machine (GBM), extreme gradient boosting (XRT), random forest (RF), M5 (pruned and unpruned), random tree (RT), and reduced error pruning (REP). Model performance was evaluated based on key metrics: correlation coefficient (CC), root mean square error (RMSE), and mean absolute error (MAE). Among the proposed models, M5_Unprun emerged as the top performer, exhibiting the highest CC (0.9455), the lowest RMSE (0.1918), and the lowest MAE (0.0030). GBM followed closely with a CC value of 0.9372, an RMSE value of 0.2067, and an MAE value of 0.0006. Uncertainty analysis further solidified the superior performance of M5_Unpruned (0.7522) and GBM (0.8055), with narrower prediction bands compared to other models, including MLR, which exhibited the widest band (1.4320). One-way analysis of variance confirmed the reliability and robustness of the proposed models. Sensitivity, correlation, and SHapley Additive exPlanations analyses identified and as the most influencing factors.
本研究通过研究单位宽度流量()、喉道宽度()、喉道长度()、上游入口宽度()以及仪表读数(和)等关键参数的影响,对文丘里水槽中的标准氧传递效率(SOTE)进行了优化。为此,使用多元线性回归(MLR)、多元非线性回归(MNLR)、梯度提升机(GBM)、极端梯度提升(XRT)、随机森林(RF)、M5(剪枝和未剪枝)、随机树(RT)和减少误差剪枝(REP)对一个综合实验数据集进行了分析。基于关键指标评估模型性能:相关系数(CC)、均方根误差(RMSE)和平均绝对误差(MAE)。在所提出的模型中,未剪枝M5表现最佳,具有最高的CC(0.9455)、最低的RMSE(0.1918)和最低的MAE(0.0030)。GBM紧随其后,CC值为0.9372,RMSE值为0.2067,MAE值为0.0006。不确定性分析进一步证实了未剪枝M5(0.7522)和GBM(0.8055)的卓越性能,与包括MLR在内的其他模型相比,其预测带更窄,MLR的预测带最宽(1.4320)。单因素方差分析证实了所提出模型的可靠性和稳健性。敏感性、相关性和SHapley加性解释分析确定和为最具影响的因素。