Suppr超能文献

决策树(DT)算法和极限学习机(ELM)模型在预测上格林河流域水质方面的性能比较。

Comparison of the performance of decision tree (DT) algorithms and extreme learning machine (ELM) model in the prediction of water quality of the Upper Green River watershed.

机构信息

Department of Civil Engineering, Birla Institute of Technology and Science, Pilani, Hyderabad Campus, Hyderabad, Telangana, India.

出版信息

Water Environ Res. 2021 Nov;93(11):2360-2373. doi: 10.1002/wer.1642. Epub 2021 Oct 4.

Abstract

Stream waters play a crucial role in catering to the world's needs with the required quality of water. Due to the discharges of wastewater from the various point and nonpoint sources, most of the watersheds are contaminated easily. The Upper Green River watershed in Kentucky, USA, is one such watershed that is contaminated over the years due to the runoff from rural areas and agricultural lands and combined sewer overflows (CSOs) from urban areas. Monitoring and characterizing the water quality status of streams in such watersheds has become of great importance, with multivariate statistical techniques such as regression, factor analysis, cluster analysis, and artificial intelligence methods such as artificial neural networks (ANNs). The water quality parameters, namely, fecal coliform (FC), turbidity, pH, and conductivity have been predicted quantitatively using ANNs to understand the water quality status of streams in the Upper Green River watershed elsewhere. In this study, a novel attempt has been made to predict the status of the quality of the Green River water with the predictive capabilities of a few decision tree (DT) algorithms such as classification and regression tree (CART) model, multivariate adaptive regression splines (MARS) model, random forest (RF) model, and extreme learning machine (ELM) model. The RF model's performance is better in predicting FC, turbidity, and pH than CART models in training and testing phases. Relatively, MARS and ELM models did better in testing though the performance is poorer in training. For example, we obtain the RMSE values of 2206, 2532, 1533, and 1969 using RF, CART, MARS, and ELM for FC in testing. A good correlation has been observed between conductivity and temperature, precipitation, and land-use factors for the MARS model. Overall, DT models are helpful in understanding, interpreting the outcomes, and visualizing the results compared with the other models. PRACTITIONER POINTS: The prediction of stream water quality parameters using decision trees is explored. The climate and land use parameters are used as input parameters to the modeling. The DT models of CART, MARS, RF, and ANNs such as ELM are explored to predict stream water quality. The RF model shows stable results compared with CART, MARS, and ELM for the data explored. Apart from the R value, RMSE and MAE indicate the effectiveness of DTs in prediction.

摘要

河流在提供所需水质方面发挥着至关重要的作用。由于各种点源和非点源废水的排放,大多数流域很容易受到污染。美国肯塔基州的格林河上游流域就是这样一个流域,由于农村和农业地区的径流以及城市地区的合流污水(CSO),该流域多年来一直受到污染。监测和描述这些流域中溪流的水质状况变得非常重要,可采用多元统计技术(如回归、因子分析、聚类分析)和人工智能方法(如人工神经网络 (ANNs))。已经使用 ANNs 定量预测水质参数,如粪大肠菌群 (FC)、浊度、pH 值和电导率,以了解格林河上游流域中溪流的水质状况。在本研究中,尝试使用一些决策树 (DT) 算法(如分类回归树 (CART) 模型、多元自适应回归样条 (MARS) 模型、随机森林 (RF) 模型和极限学习机 (ELM) 模型)的预测能力来预测绿河水水质状况。在训练和测试阶段,RF 模型在预测 FC、浊度和 pH 值方面的性能优于 CART 模型。相对而言,MARS 和 ELM 模型在测试阶段表现较好,尽管在训练阶段表现较差。例如,我们在测试中使用 RF、CART、MARS 和 ELM 分别获得 FC 的 RMSE 值为 2206、2532、1533 和 1969。对于 MARS 模型,观察到电导率与温度、降水和土地利用因素之间存在良好的相关性。总体而言,与其他模型相比,DT 模型有助于理解、解释结果并可视化结果。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验