通过特征选择技术和预测模型的比较分析实现人工智能驱动的废水管理。

AI-driven wastewater management through comparative analysis of feature selection techniques and predictive models.

作者信息

Dikmen Faruk, Demir Ahmet, Özkaya Bestami, Raza Muhammad Owais, Rasheed Jawad, Asuroglu Tunc, Alsubai Shtwai

机构信息

Department of Environmental Engineering, Yildiz Technical University, 34220, Istanbul, Turkey.

Department of Civil Engineering, Istinye University, 34396, Istanbul, Turkey.

出版信息

Sci Rep. 2025 Jul 14;15(1):25347. doi: 10.1038/s41598-025-07124-0.

DOI:10.1038/s41598-025-07124-0

PMID:40659650

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12259835/

Abstract

The integration of artificial intelligence (AI) in wastewater treatment management offers a promising approach to optimizing effluent quality predictions and enhancing operational efficiency. This study evaluates the performance of machine learning models in predicting key wastewater effluent parameters Chemical Oxygen Demand (COD), Biochemical Oxygen Demand (BOD), Total Suspended Solids (TSS), Total Effluent Nitrogen and Total Effluent Phosphorus. Three feature selection techniques were applied: SelectKBest, Mutual Information, and Recursive Feature Elimination (RFE) using Random Forest to identify the most significant predictors. The study leveraged ensemble learning models, including XGBoost, Random Forest, Gradient Boosting, and LightGBM, and compared them with Decision Tree models. The results demonstrate that effluent volatile suspended solids (VSS) consistently held the highest predictive importance across all feature selection methods. Ensemble models significantly outperformed Decision Trees, with Gradient Boosting achieving the best predictive accuracy for TSS and total nitrogen (Mean Absolute Error (MAE): 3.667 [Formula: see text]: 97.53), XGBoost excelling in COD prediction with MAE and [Formula: see text] of 6.251 and 83. 41%, respectively, and XGBoost showing superior performance for BOD (MAE: 1.589 [Formula: see text]:79.64%). LightGBM yielded the highest precision in predicting total phosphate with MAE and a [Formula: see text] score of 0.230 and 28. 68%, respectively. Decision tree models consistently underperformed, exhibiting the highest error rates. These findings highlight the potential of AI-driven approaches in wastewater management to improve decision-making, regulatory compliance, and resource efficiency. However, limitations such as operational irregularities and seasonal variations remain challenges for further refinement.

摘要

人工智能（AI）在污水处理管理中的整合为优化出水水质预测和提高运营效率提供了一种很有前景的方法。本研究评估了机器学习模型在预测关键废水排放参数化学需氧量（COD）、生化需氧量（BOD）、总悬浮固体（TSS）、总排放氮和总排放磷方面的性能。应用了三种特征选择技术：使用随机森林的SelectKBest、互信息和递归特征消除（RFE），以识别最重要的预测因子。该研究利用了集成学习模型，包括XGBoost、随机森林、梯度提升和LightGBM，并将它们与决策树模型进行了比较。结果表明，在所有特征选择方法中，出水挥发性悬浮固体（VSS）始终具有最高的预测重要性。集成模型明显优于决策树，梯度提升在TSS和总氮预测方面达到了最佳预测精度（平均绝对误差（MAE）：3.667 [公式：见原文]；[公式：见原文]：97.53），XGBoost在COD预测方面表现出色，MAE为6.251，[公式：见原文]为83.41%，并且在BOD预测方面表现卓越（MAE：1.589 [公式：见原文]：79.64%）。LightGBM在预测总磷方面具有最高的精度，MAE和[公式：见原文]得分分别为0.230和28.68%。决策树模型始终表现不佳，错误率最高。这些发现凸显了人工智能驱动方法在废水管理中改善决策、合规监管和资源效率的潜力。然而，诸如运行不规则和季节变化等限制仍然是进一步优化的挑战。

相似文献

AI-driven wastewater management through comparative analysis of feature selection techniques and predictive models.

Sci Rep. 2025 Jul 14;15(1):25347. doi: 10.1038/s41598-025-07124-0.

Supervised Machine Learning Models for Predicting Sepsis-Associated Liver Injury in Patients With Sepsis: Development and Validation Study Based on a Multicenter Cohort Study.

J Med Internet Res. 2025 May 26;27:e66733. doi: 10.2196/66733.

Machine learning-based optimization of biogas and methane yields in UASB reactors for treating domestic wastewater.

Biodegradation. 2025 Jun 26;36(4):55. doi: 10.1007/s10532-025-10152-2.

Stabilizing machine learning for reproducible and explainable results: A novel validation approach to subject-specific insights.

Comput Methods Programs Biomed. 2025 Jun 21;269:108899. doi: 10.1016/j.cmpb.2025.108899.

Explainable AI-driven prediction of APE1 inhibitors: enhancing cancer therapy with machine learning models and feature importance analysis.

Mol Divers. 2025 Feb 21. doi: 10.1007/s11030-025-11133-6.

Proposal for Using AI to Assess Clinical Data Integrity and Generate Metadata: Algorithm Development and Validation.

JMIR Med Inform. 2025 Jun 30;13:e60204. doi: 10.2196/60204.

Gaps in Artificial Intelligence Research for Rural Health in the United States: A Scoping Review.

medRxiv. 2025 Jun 27:2025.06.26.25330361. doi: 10.1101/2025.06.26.25330361.

Interpretable Machine Learning for Serum-Based Metabolomics in Breast Cancer Diagnostics: Insights from Multi-Objective Feature Selection-Driven LightGBM-SHAP Models.

Medicina (Kaunas). 2025 Jun 19;61(6):1112. doi: 10.3390/medicina61061112.

Enhanced wind power forecasting using machine learning, deep learning models and ensemble integration.

Sci Rep. 2025 Jul 1;15(1):20572. doi: 10.1038/s41598-025-05250-3.

A novel double machine learning approach for detecting early breast cancer using advanced feature selection and dimensionality reduction techniques.

Sci Rep. 2025 Jul 2;15(1):22971. doi: 10.1038/s41598-025-06426-7.

本文引用的文献

Enhancing effluent quality prediction in wastewater treatment plants through the integration of factor analysis and machine learning.

Bioresour Technol. 2024 Feb;393:130008. doi: 10.1016/j.biortech.2023.130008. Epub 2023 Nov 18.

Artificial intelligence in wastewater treatment: A data-driven analysis of status and trends.

Chemosphere. 2023 Sep;336:139163. doi: 10.1016/j.chemosphere.2023.139163. Epub 2023 Jun 6.

A relationship between phages and organic carbon in wastewater treatment plant effluents.

Water Res X. 2022 Jun 16;16:100146. doi: 10.1016/j.wroa.2022.100146. eCollection 2022 Aug 1.

Data-Driven Machine Learning in Environmental Pollution: Gains and Problems.

Environ Sci Technol. 2022 Feb 15;56(4):2124-2133. doi: 10.1021/acs.est.1c06157. Epub 2022 Jan 27.

IoT enabled environmental toxicology for air pollution monitoring using AI techniques.

Environ Res. 2022 Apr 1;205:112574. doi: 10.1016/j.envres.2021.112574. Epub 2021 Dec 15.

Application of machine learning in anaerobic digestion: Perspectives and challenges.

Bioresour Technol. 2022 Feb;345:126433. doi: 10.1016/j.biortech.2021.126433. Epub 2021 Nov 27.

Enriched Catalytic Activity of TiO Nanoparticles Supported by Activated Carbon for Noxious Pollutant Elimination.

Nanomaterials (Basel). 2021 Oct 22;11(11):2808. doi: 10.3390/nano11112808.

A machine learning framework to improve effluent quality control in wastewater treatment plants.

Sci Total Environ. 2021 Aug 25;784:147138. doi: 10.1016/j.scitotenv.2021.147138. Epub 2021 Apr 16.

Prediction of stream nitrogen and phosphorus concentrations from high-frequency sensors using Random Forests Regression.

Sci Total Environ. 2021 Apr 1;763:143005. doi: 10.1016/j.scitotenv.2020.143005. Epub 2020 Oct 20.

A Monte Carlo-based integrated model to optimize the cost and pollution reduction in wastewater treatment processes in a typical comprehensive industrial park in China.

Sci Total Environ. 2019 Jan 10;647:1-10. doi: 10.1016/j.scitotenv.2018.07.358. Epub 2018 Jul 26.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

通过特征选择技术和预测模型的比较分析实现人工智能驱动的废水管理。

AI-driven wastewater management through comparative analysis of feature selection techniques and predictive models.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献