Rios Fuck João Vitor, Cechinel Maria Alice Prado, Neves Juliana, Campos de Andrade Rodrigo, Tristão Ricardo, Spogis Nicolas, Riella Humberto Gracher, Soares Cíntia, Padoin Natan
Hydroinfo - Hydroinformatics Solutions Ltda, Florianópolis, SC, Brazil; Laboratory of Materials and Scientific Computing (LabMAC), Department of Chemical and Food Engineering, Federal University of Santa Catarina (UFSC), Florianópolis, SC, Brazil.
Hydroinfo - Hydroinformatics Solutions Ltda, Florianópolis, SC, Brazil.
Chemosphere. 2024 Mar;352:141472. doi: 10.1016/j.chemosphere.2024.141472. Epub 2024 Feb 19.
Wastewater Treatment Plants (WWTPs) present complex biochemical processes of high variability and difficult prediction. This study presents an innovative approach using Machine Learning (ML) models to predict wastewater quality parameters. In particular, the models are applied to datasets from both a simulated wastewater treatment plant (WWTP), using DHI WEST software (WEST WWTP), and a real-world WWTP database from Santa Catarina Brewery AMBEV, located in Lages/SC - Brazil (AMBEV WWTP). A distinctive aspect is the evaluation of predictive performance in continuous data scenarios and the impact of changes in WWTP operations on predictive model performance, including changes in plant layout. For both plants, three different scenarios were addressed, and the quality of predictions by random forest (RF), support vector machine (SVM), and multilayer perceptron (MLP) models were evaluated. The prediction quality by the MLP model reached an R of 0.72 for TN prediction in the WEST WWTP output, and the RF model better adapted to the real data of the AMBEV WWTP, despite the significant discrepancy observed between the real and the predicted data. Techniques such as Partial Dependence Plots (PDP) and Permutation Importance (PI) were used to assess the importance of features, particularly in the simulated WEST tool scenario, showing a strong correlation of prediction results with influent parameters related to nitrogen content. The results of this study highlight the importance of collecting and storing high-quality data and the need for information on changes in WWTP operation for predictive model performance. These contributions advance the understanding of predictive modeling for wastewater quality and provide valuable insights for future practice in wastewater treatment.
污水处理厂(WWTPs)呈现出复杂且高度可变、难以预测的生化过程。本研究提出了一种使用机器学习(ML)模型来预测废水水质参数的创新方法。具体而言,这些模型被应用于来自模拟污水处理厂(使用DHI WEST软件,即WEST WWTP)以及位于巴西圣卡塔琳娜州拉热斯市的安贝夫啤酒厂的真实污水处理厂数据库(安贝夫WWTP)的数据集。一个独特的方面是评估连续数据场景下的预测性能以及污水处理厂运营变化对预测模型性能的影响,包括工厂布局的变化。对于这两个工厂,研究了三种不同的场景,并评估了随机森林(RF)、支持向量机(SVM)和多层感知器(MLP)模型的预测质量。MLP模型在WEST WWTP输出中对总氮(TN)预测的R值达到了0.72,尽管在真实数据和预测数据之间观察到了显著差异,但RF模型更能适应安贝夫WWTP的真实数据。诸如部分依赖图(PDP)和排列重要性(PI)等技术被用于评估特征的重要性,特别是在模拟的WEST工具场景中,结果表明预测结果与与氮含量相关的进水参数有很强的相关性。本研究结果突出了收集和存储高质量数据的重要性以及获取污水处理厂运营变化信息对预测模型性能的必要性。这些贡献增进了对废水水质预测建模的理解,并为未来污水处理实践提供了有价值的见解。