Botz Jonas, Valderrama Diego, Guski Jannis, Fröhlich Holger
Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Sankt Augustin, Germany.
Bonn-Aachen International Center for IT, University of Bonn, Bonn, Germany.
PLOS Glob Public Health. 2024 Aug 22;4(8):e0003058. doi: 10.1371/journal.pgph.0003058. eCollection 2024.
During the COVID-19 pandemic, many hospitals reached their capacity limits and could no longer guarantee treatment of all patients. At the same time, governments endeavored to take sensible measures to stop the spread of the virus while at the same time trying to keep the economy afloat. Many models extrapolating confirmed cases and hospitalization rate over short periods of time have been proposed, including several ones coming from the field of machine learning. However, the highly dynamic nature of the pandemic with rapidly introduced interventions and new circulating variants imposed non-trivial challenges for the generalizability of such models. In the context of this paper, we propose the use of ensemble models, which are allowed to change in their composition or weighting of base models over time and could thus better adapt to highly dynamic pandemic or epidemic situations. In that regard, we also explored the use of secondary metadata-Google searches-to inform the ensemble model. We tested our approach using surveillance data from COVID-19, Influenza, and hospital syndromic surveillance of severe acute respiratory infections (SARI). In general, we found ensembles to be more robust than the individual models. Altogether we see our work as a contribution to enhance the preparedness for future pandemic situations.
在新冠疫情期间,许多医院达到了其容量极限,无法再保证对所有患者进行治疗。与此同时,各国政府努力采取合理措施来阻止病毒传播,同时试图维持经济运转。已经提出了许多用于短期推断确诊病例和住院率的模型,包括一些来自机器学习领域的模型。然而,疫情的高度动态性,加上迅速推出的干预措施和新出现的传播变体,给此类模型的通用性带来了不小的挑战。在本文的背景下,我们建议使用集成模型,这种模型可以随着时间的推移在其基础模型的组成或权重上发生变化,从而能够更好地适应高度动态的疫情或流行病情况。在这方面,我们还探索了使用二级元数据——谷歌搜索数据——来为集成模型提供信息。我们使用来自新冠病毒、流感以及严重急性呼吸道感染(SARI)的医院症候群监测的监测数据对我们的方法进行了测试。总体而言,我们发现集成模型比单个模型更稳健。我们认为我们的工作总体上为增强对未来疫情形势的应对准备做出了贡献。