利用新型混合机器学习算法提高水质指数预测精度。

Improving prediction of water quality indices using novel hybrid machine-learning algorithms.

机构信息

Geographic Information Science Research Group, Ton Duc Thang University, Ho Chi Minh City, Viet Nam; Faculty of Environment and Labour Safety, Ton Duc Thang University, Ho Chi Minh City, Viet Nam.

School of Engineering, University of Guelph, Guelph, Canada.

出版信息

Sci Total Environ. 2020 Jun 15;721:137612. doi: 10.1016/j.scitotenv.2020.137612. Epub 2020 Mar 3.

DOI:10.1016/j.scitotenv.2020.137612

PMID:32169637

Abstract

River water quality assessment is one of the most important tasks to enhance water resources management plans. A water quality index (WQI) considers several water quality variables simultaneously. Traditionally WQI calculations consume time and are often fraught with errors during derivations of sub-indices. In this study, 4 standalone (random forest (RF), M5P, random tree (RT), and reduced error pruning tree (REPT)) and 12 hybrid data-mining algorithms (combinations of standalones with bagging (BA), CV parameter selection (CVPS) and randomizable filtered classification (RFC)) were used to create Iran WQI (IRWQI) predictions. Six years (2012 to 2018) of monthly data from two water quality monitoring stations within the Talar catchment were compiled. Using Pearson correlation coefficients, 10 different input combinations were constructed. The data were divided into two groups (ratio 70:30) for model building (training dataset) and model validation (testing dataset) using a 10-fold cross-validation technique. The models were evaluated using several statistical and visual evaluation metrics. Result show that fecal coliform (FC) and total solids (TS) had the greatest and least effect on the prediction of IRWQI. The best input combinations varied among the algorithms; generally variables with very low correlations displayed weaker performance. Hybrid algorithms improved the prediction power of several of the standalone models, but not all. Hybrid BA-RT outperformed the other models (R = 0.941, RMSE = 2.71, MAE = 1.87, NSE = 0.941, PBIAS = 0.500). PBIAS indicated that all algorithms, with the exceptions of RT, BA-RT and CVPS-REPT, overestimated WQI values.

摘要

河流水质评估是增强水资源管理计划的最重要任务之一。水质指数 (WQI) 同时考虑多个水质变量。传统上，WQI 计算既耗时又容易在子指数推导过程中出错。在这项研究中，使用了 4 个独立的（随机森林 (RF)、M5P、随机树 (RT) 和简化错误修剪树 (REPT)）和 12 个混合数据挖掘算法（独立算法与袋装 (BA)、交叉验证参数选择 (CVPS) 和可随机化过滤分类 (RFC) 的组合）来创建伊朗水质指数 (IRWQI) 预测。编译了塔尔勒流域内两个水质监测站的六年（2012 年至 2018 年）的每月数据。使用皮尔逊相关系数，构建了 10 种不同的输入组合。使用 10 折交叉验证技术将数据分为两组（比例为 70:30），用于模型构建（训练数据集）和模型验证（测试数据集）。使用多种统计和可视化评估指标对模型进行评估。结果表明，粪大肠菌群 (FC) 和总固体 (TS) 对 IRWQI 的预测影响最大和最小。最佳输入组合因算法而异；通常，相关性非常低的变量表现较弱。混合算法提高了几个独立模型的预测能力，但并非全部。混合 BA-RT 优于其他模型（R=0.941、RMSE=2.71、MAE=1.87、NSE=0.941、PBIAS=0.500）。PBIAS 表明，除了 RT、BA-RT 和 CVPS-REPT 之外，所有算法都高估了 WQI 值。

相似文献

Improving prediction of water quality indices using novel hybrid machine-learning algorithms.利用新型混合机器学习算法提高水质指数预测精度。

Sci Total Environ. 2020 Jun 15;721:137612. doi: 10.1016/j.scitotenv.2020.137612. Epub 2020 Mar 3.

AI-driven predictions of geophysical river flows with vegetation.利用植被进行人工智能驱动的地球物理河流水流预测。

Sci Rep. 2024 Jul 16;14(1):16368. doi: 10.1038/s41598-024-67269-2.

Robust machine learning algorithms for predicting coastal water quality index.用于预测沿海水质指数的稳健机器学习算法。

J Environ Manage. 2022 Nov 1;321:115923. doi: 10.1016/j.jenvman.2022.115923. Epub 2022 Aug 19.

Applications of various data-driven models for the prediction of groundwater quality index in the Akot basin, Maharashtra, India.应用各种数据驱动模型预测印度马哈拉施特拉邦阿科特盆地的地下水质量指数。

Environ Sci Pollut Res Int. 2022 Mar;29(12):17591-17605. doi: 10.1007/s11356-021-17064-7. Epub 2021 Oct 20.

Prediction of white blood cell count during exercise: a comparison between standalone and hybrid intelligent algorithms.运动期间白细胞计数的预测：独立智能算法与混合智能算法的比较。

Sci Rep. 2024 Sep 5;14(1):20683. doi: 10.1038/s41598-024-71576-z.

Comparison of the performance of decision tree (DT) algorithms and extreme learning machine (ELM) model in the prediction of water quality of the Upper Green River watershed.决策树（DT）算法和极限学习机（ELM）模型在预测上格林河流域水质方面的性能比较。

Water Environ Res. 2021 Nov;93(11):2360-2373. doi: 10.1002/wer.1642. Epub 2021 Oct 4.

Predictive modeling of selected trace elements in groundwater using hybrid algorithms of iterative classifier optimizer.利用迭代分类器优化器的混合算法对地下水中选定微量元素进行预测建模。

J Contam Hydrol. 2021 Oct;242:103849. doi: 10.1016/j.jconhyd.2021.103849. Epub 2021 Jun 12.

Extreme learning machines: a new approach for modeling dissolved oxygen (DO) concentration with and without water quality variables as predictors.极限学习机：一种以水质变量作为预测因子或不使用水质变量来建模溶解氧（DO）浓度的新方法。

Environ Sci Pollut Res Int. 2017 Jul;24(20):16702-16724. doi: 10.1007/s11356-017-9283-z. Epub 2017 May 30.

A hybrid air quality early-warning framework: An hourly forecasting model with online sequential extreme learning machines and empirical mode decomposition algorithms.一种混合空气质量预警框架：基于在线序贯极端学习机和经验模态分解算法的逐时预测模型。

Sci Total Environ. 2020 Mar 20;709:135934. doi: 10.1016/j.scitotenv.2019.135934. Epub 2019 Dec 10.

Prediction of weighted arithmetic water quality index for urban water quality using ensemble machine learning model.基于集成机器学习模型的城市水质加权算术水质指数预测

Chemosphere. 2024 Mar;352:141393. doi: 10.1016/j.chemosphere.2024.141393. Epub 2024 Feb 5.

引用本文的文献

Application of an improved LSTM model based on FECA and CEEMDAN VMD decomposition in water quality prediction.基于FECA和CEEMDAN-VMD分解的改进LSTM模型在水质预测中的应用

Sci Rep. 2025 Apr 14;15(1):12847. doi: 10.1038/s41598-025-96941-4.

Predictive modeling of climate change impacts using Artificial Intelligence: a review for equitable governance and sustainable outcome.利用人工智能对气候变化影响进行预测建模：关于公平治理与可持续成果的综述

Environ Sci Pollut Res Int. 2025 Apr;32(17):10705-10724. doi: 10.1007/s11356-025-36356-w. Epub 2025 Apr 4.

Relationship between Biological and Qualitative Indices in Surface Waters Receiving the Effluent of Fish Farms in the Northwest of Iran.伊朗西北部接收养鱼场废水的地表水中生物指标与定性指标之间的关系

J Arthropod Borne Dis. 2024 Jun 30;18(2):157-171. doi: 10.18502/jad.v18i2.17539. eCollection 2024 Jun.

Sci Rep. 2024 Sep 5;14(1):20683. doi: 10.1038/s41598-024-71576-z.

Water quality constrained adjustment planning for regional breeding management with nonlinear programming model under uncertainty in Wenchang City, China.基于非线性规划模型的不确定性条件下中国文昌市区域养殖管理水质约束调整规划

Heliyon. 2024 Aug 5;10(16):e35347. doi: 10.1016/j.heliyon.2024.e35347. eCollection 2024 Aug 30.

Assessing and predicting water quality index with key water parameters by machine learning models in coastal cities, China.利用机器学习模型通过关键水质参数评估和预测中国沿海城市的水质指数

Heliyon. 2024 Jun 27;10(13):e33695. doi: 10.1016/j.heliyon.2024.e33695. eCollection 2024 Jul 15.

Stacked hybridization to enhance the performance of artificial neural networks (ANN) for prediction of water quality index in the Bagh river basin, India.堆叠杂交以增强人工神经网络（ANN）在印度巴格河流域水质指数预测中的性能。

Heliyon. 2024 May 11;10(10):e31085. doi: 10.1016/j.heliyon.2024.e31085. eCollection 2024 May 30.

Toward Nano- and Microplastic Sensors: Identification of Nano- and Microplastic Particles via Artificial Intelligence Combined with a Plasmonic Probe Functionalized with an Estrogen Receptor.迈向纳米和微塑料传感器：通过人工智能结合经雌激素受体功能化的等离子体探针识别纳米和微塑料颗粒

ACS Omega. 2024 Apr 18;9(17):18984-18994. doi: 10.1021/acsomega.3c09485. eCollection 2024 Apr 30.

Application of machine learning techniques to predict groundwater quality in the Nabogo Basin, Northern Ghana.运用机器学习技术预测加纳北部纳博戈盆地的地下水水质。

Heliyon. 2024 Mar 30;10(7):e28527. doi: 10.1016/j.heliyon.2024.e28527. eCollection 2024 Apr 15.

Advances in machine learning and IoT for water quality monitoring: A comprehensive review.用于水质监测的机器学习与物联网进展：全面综述

Heliyon. 2024 Mar 13;10(6):e27920. doi: 10.1016/j.heliyon.2024.e27920. eCollection 2024 Mar 30.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

利用新型混合机器学习算法提高水质指数预测精度。

Improving prediction of water quality indices using novel hybrid machine-learning algorithms.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献