Armenta-Castro Arnoldo, de la Rosa Orlando, Aguayo-Acosta Alberto, Oyervides-Muñoz Mariel Araceli, Flores-Tlacuahuac Antonio, Parra-Saldívar Roberto, Sosa-Hernández Juan Eduardo
School of Engineering and Sciences, Tecnologico de Monterrey, Monterrey 64849, Mexico.
Institute of Advanced Materials for Sustainable Manufacturing, Tecnologico de Monterrey, Monterrey 64849, Mexico.
Viruses. 2025 Jan 15;17(1):109. doi: 10.3390/v17010109.
Detection and quantification of disease-related biomarkers in wastewater samples, denominated Wastewater-based Surveillance (WBS), has proven a valuable strategy for studying the prevalence of infectious diseases within populations in a time- and resource-efficient manner, as wastewater samples are representative of all cases within the catchment area, whether they are clinically reported or not. However, analysis and interpretation of WBS datasets for decision-making during public health emergencies, such as the COVID-19 pandemic, remains an area of opportunity. In this article, a database obtained from wastewater sampling at wastewater treatment plants (WWTPs) and university campuses in Monterrey and Mexico City between 2021 and 2022 was used to train simple clustering- and regression-based risk assessment models to allow for informed prevention and control measures in high-affluence facilities, even if working with low-dimensionality datasets and a limited number of observations. When dividing weekly data points based on whether the seven-day average daily new COVID-19 cases were above a certain threshold, the resulting clustering model could differentiate between weeks with surges in clinical reports and periods between them with an 87.9% accuracy rate. Moreover, the clustering model provided satisfactory forecasts one week (80.4% accuracy) and two weeks (81.8%) into the future. However, the prediction of the weekly average of new daily cases was limited (R = 0.80, MAPE = 72.6%), likely because of insufficient dimensionality in the database. Overall, while simple, WBS-supported models can provide relevant insights for decision-makers during epidemiological outbreaks, regression algorithms for prediction using low-dimensionality datasets can still be improved.
在废水样本中检测和量化与疾病相关的生物标志物,即基于废水的监测(WBS),已被证明是一种以高效省时且资源节约的方式研究人群中传染病流行情况的有价值策略,因为废水样本代表了集水区内的所有病例,无论这些病例是否有临床报告。然而,在公共卫生紧急事件(如新冠疫情)期间,分析和解释用于决策的WBS数据集仍是一个有待发展的领域。在本文中,利用2021年至2022年期间从蒙特雷和墨西哥城的污水处理厂(WWTPs)及大学校园采集的废水样本所获得的数据库,训练基于简单聚类和回归的风险评估模型,以便在高富裕度设施中采取明智的预防和控制措施,即便处理的是低维数据集且观测数量有限。当根据7天平均每日新增新冠病例是否高于某个阈值对每周数据点进行划分时,所得聚类模型能够以87.9%的准确率区分临床报告激增的周数和其间的时间段。此外,该聚类模型对未来一周(准确率80.4%)和两周(准确率81.8%)提供了令人满意的预测。然而,对每日新增病例周均值的预测有限(R = 0.80,平均绝对百分比误差 = 72.6%),这可能是由于数据库维度不足所致。总体而言,虽然简单,但基于WBS的模型可为流行病爆发期间的决策者提供相关见解,利用低维数据集进行预测的回归算法仍有改进空间。