Department of Civil and Environ Engineering, University of Wisconsin-Milwaukee, Milwaukee, Wisconsin, USA.
Water Environ Res. 2024 Oct;96(10):e11136. doi: 10.1002/wer.11136.
This study investigates the use of machine learning (ML) models for wastewater treatment plant (WWTP) sludge predictions and explainable artificial intelligence (XAI) techniques for understanding the impact of variables behind the prediction. Three ML models, random forest (RF), gradient boosting machine (GBM), and gradient boosting tree (GBT), were evaluated for their performance using statistical indicators. Input variable combinations were selected through different feature selection (FS) methods. XAI techniques were employed to enhance the interpretability and transparency of ML models. The results suggest that prediction accuracy depends on the choice of model and the number of variables. XAI techniques were found to be effective in interpreting the decisions made by each ML model. This study provides an example of using ML models in sludge production prediction and interpreting models applying XAI to understand the factors influencing it. Understandable interpretation of ML model prediction can facilitate targeted interventions for process optimization and improve the efficiency and sustainability of wastewater treatment processes. PRACTITIONER POINTS: Explainable artificial intelligence can play a crucial role in promoting trust between machine learning models and their real-world applications. Widely practiced machine learning models were used to predict sludge production of a United States wastewater treatment plant. Feature selection methods can reduce the required number of input variables without compromising model accuracy. Explainable artificial intelligence techniques can explain driving variables behind machine learning prediction.
本研究探讨了机器学习 (ML) 模型在污水处理厂 (WWTP) 污泥预测中的应用,以及可解释人工智能 (XAI) 技术在理解预测背后变量影响方面的应用。使用统计指标评估了随机森林 (RF)、梯度提升机 (GBM) 和梯度提升树 (GBT) 这三种 ML 模型的性能。通过不同的特征选择 (FS) 方法选择输入变量组合。XAI 技术用于提高 ML 模型的可解释性和透明度。结果表明,预测准确性取决于模型的选择和变量的数量。XAI 技术被发现可以有效地解释每个 ML 模型的决策。本研究提供了一个在污泥产量预测中使用 ML 模型的示例,并通过应用 XAI 来解释模型,以了解影响它的因素。对 ML 模型预测的可理解解释可以促进针对过程优化的有针对性干预,并提高废水处理过程的效率和可持续性。
可解释人工智能可以在促进机器学习模型与其实际应用之间的信任方面发挥关键作用。
常用的机器学习模型被用于预测美国一家污水处理厂的污泥产量。
特征选择方法可以在不影响模型准确性的情况下减少所需的输入变量数量。
可解释人工智能技术可以解释机器学习预测背后的驱动变量。