School of Civil, Mining and Environmental Engineering, University of Wollongong, Australia; Illawarra Health and Medical Research Institute (IHMRI), University of Wollongong, Wollongong, Australia.
Environmental and industrial group, Urban utilities, Queensland, Pinkenba, Australia.
Sci Total Environ. 2021 Oct 1;789:147947. doi: 10.1016/j.scitotenv.2021.147947. Epub 2021 May 23.
Wastewater-based epidemiology (WBE) has been regarded as a potential tool for the prevalence estimation of coronavirus disease 2019 (COVID-19) in the community. However, the application of the conventional back-estimation approach is currently limited due to the methodological challenges and various uncertainties. This study systematically performed meta-analysis for WBE datasets and investigated the use of data-driven models for the COVID-19 community prevalence in lieu of the conventional WBE back-estimation approach. Three different data-driven models, i.e. multiple linear regression (MLR), artificial neural network (ANN), and adaptive neuro fuzzy inference system (ANFIS) were applied to the multi-national WBE dataset. To evaluate the robustness of these models, predictions for sixteen scenarios with partial inputs were compared against the actual prevalence reports from clinical testing. The performance of models was further validated using unseen data (data sets not included for establishing the model) from different stages of the COVID-19 outbreak. Generally, ANN and ANFIS models showed better accuracy and robustness over MLR models. Air and wastewater temperature played a critical role in the prevalence estimation by data-driven models, especially MLR models. With unseen datasets, ANN model reasonably estimated the prevalence of COVID-19 (cumulative cases) at the initial phase and forecasted the upcoming new cases in 2-4 days at the post-peak phase of the COVID-19 outbreak. This study provided essential information about the feasibility and accuracy of data-driven estimation of COVID-19 prevalence through the WBE approach.
基于污水的流行病学(WBE)已被视为一种估算社区中 2019 年冠状病毒病(COVID-19)流行率的潜在工具。然而,由于方法学挑战和各种不确定性,目前常规的反推方法的应用受到限制。本研究系统地对 WBE 数据集进行了荟萃分析,并研究了使用数据驱动模型来替代常规 WBE 反推方法估算 COVID-19 社区流行率。本研究应用了三种不同的数据驱动模型,即多元线性回归(MLR)、人工神经网络(ANN)和自适应神经模糊推理系统(ANFIS),对多国 WBE 数据集进行了分析。为了评估这些模型的稳健性,将十六种部分输入预测与实际临床检测报告的流行率进行了比较。使用来自 COVID-19 爆发不同阶段的未见数据集(未用于建立模型的数据)进一步验证了模型的性能。一般来说,ANN 和 ANFIS 模型比 MLR 模型具有更好的准确性和稳健性。空气和污水温度在数据驱动模型的流行率估算中起着关键作用,尤其是在 MLR 模型中。使用未见数据集,ANN 模型在 COVID-19 爆发的初始阶段合理地估计了 COVID-19 的流行率(累积病例),并在 COVID-19 爆发的高峰期后 2-4 天预测即将出现的新病例。本研究通过 WBE 方法提供了有关 COVID-19 流行率数据驱动估计的可行性和准确性的重要信息。