Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America.
PLoS Comput Biol. 2022 Jun 3;18(6):e1010115. doi: 10.1371/journal.pcbi.1010115. eCollection 2022 Jun.
Infectious disease forecasting is of great interest to the public health community and policymakers, since forecasts can provide insight into disease dynamics in the near future and inform interventions. Due to delays in case reporting, however, forecasting models may often underestimate the current and future disease burden. In this paper, we propose a general framework for addressing reporting delay in disease forecasting efforts with the goal of improving forecasts. We propose strategies for leveraging either historical data on case reporting or external internet-based data to estimate the amount of reporting error. We then describe several approaches for adapting general forecasting pipelines to account for under- or over-reporting of cases. We apply these methods to address reporting delay in data on dengue fever cases in Puerto Rico from 1990 to 2009 and to reports of influenza-like illness (ILI) in the United States between 2010 and 2019. Through a simulation study, we compare method performance and evaluate robustness to assumption violations. Our results show that forecasting accuracy and prediction coverage almost always increase when correction methods are implemented to address reporting delay. Some of these methods required knowledge about the reporting error or high quality external data, which may not always be available. Provided alternatives include excluding recently-reported data and performing sensitivity analysis. This work provides intuition and guidance for handling delay in disease case reporting and may serve as a useful resource to inform practical infectious disease forecasting efforts.
传染病预测对公共卫生界和政策制定者非常感兴趣,因为预测可以提供对近期疾病动态的洞察,并为干预措施提供信息。然而,由于病例报告的延迟,预测模型往往可能低估当前和未来的疾病负担。在本文中,我们提出了一个用于解决传染病预测中报告延迟问题的通用框架,旨在改善预测。我们提出了利用病例报告的历史数据或基于互联网的外部数据来估计报告误差量的策略。然后,我们描述了几种方法,用于调整一般的预测流程,以考虑病例的少报或多报情况。我们将这些方法应用于解决波多黎各 1990 年至 2009 年登革热病例数据和美国 2010 年至 2019 年流感样疾病(ILI)报告中的报告延迟问题。通过模拟研究,我们比较了方法的性能,并评估了对违反假设的稳健性。我们的结果表明,当实施纠正报告延迟的方法时,预测准确性和预测覆盖率几乎总是会提高。其中一些方法需要关于报告误差或高质量外部数据的知识,而这些知识可能并不总是可用的。提供的替代方案包括排除最近报告的数据和进行敏感性分析。这项工作为处理疾病病例报告的延迟提供了直觉和指导,并可能为实际的传染病预测工作提供有用的资源。