da Costa Avelar Pedro Henrique, Del Coco Natalia, Lamb Luis C, Tsoka Sophia, Cardoso-Silva Jonathan
Data Science Brigade, Porto Alegre, Rio Grande do Sul, Brazil.
Institute of Informatics, Federal University of Rio Grande do Sul, Porto Alegre, Rio Grande do Sul, Brazil.
Healthc Anal (N Y). 2022 Nov;2:100115. doi: 10.1016/j.health.2022.100115. Epub 2022 Oct 5.
Following the outbreak of the coronavirus epidemic in early 2020, municipalities, regional governments and policymakers worldwide had to plan their Non-Pharmaceutical Interventions (NPIs) amidst a scenario of great uncertainty. At this early stage of an epidemic, where no vaccine or medical treatment is in sight, algorithmic prediction can become a powerful tool to inform local policymaking. However, when we replicated one prominent epidemiological model to inform health authorities in a region in the south of Brazil, we found that this model relied too heavily on manually predetermined covariates and was too reactive to changes in data trends. Our four proposed models access data of both daily reported deaths and infections as well as take into account missing data (e.g., the under-reporting of cases) more explicitly, with two of the proposed versions also attempting to model the delay in test reporting. We simulated weekly forecasting of deaths from the period from 31/05/2020 until 31/01/2021, with first week data being used as a cold-start to the algorithm, after which we use a lighter variant of the model for faster forecasting. Because our models are significantly more proactive in identifying trend changes, this has improved forecasting, especially in long-range predictions and after the peak of an infection wave, as they were quicker to adapt to scenarios after these peaks in reported deaths. Assuming reported cases were under-reported greatly benefited the model in its stability, and modelling retroactively-added data (due to the "hot" nature of the data used) had a negligible impact on performance.
2020年初新冠病毒疫情爆发后,全球各地的市政当局、地区政府和政策制定者不得不在极具不确定性的情况下规划其非药物干预措施(NPIs)。在疫情的这个早期阶段,由于看不到疫苗或治疗方法,算法预测可以成为为地方政策制定提供信息的有力工具。然而,当我们复制一个著名的流行病学模型以向巴西南部一个地区的卫生当局提供信息时,我们发现这个模型过于依赖手动预先确定的协变量,并且对数据趋势的变化反应过度。我们提出的四个模型获取每日报告的死亡和感染数据,并更明确地考虑缺失数据(例如病例报告不足),其中两个提议版本还试图对检测报告的延迟进行建模。我们模拟了从2020年5月31日至2021年1月31日期间的每周死亡预测,第一周的数据用作算法的冷启动,之后我们使用该模型的一个更轻量级的变体进行更快的预测。由于我们的模型在识别趋势变化方面更加主动,这改进了预测,特别是在长期预测和感染波峰值之后,因为它们能更快地适应报告死亡人数峰值后的情况。假设报告的病例被严重少报对模型的稳定性有很大好处,并且对追溯添加的数据(由于所使用数据的“热”性质)进行建模对性能的影响可以忽略不计。