Division of International Epidemiology and Population Studies, Fogarty International Center, National Institutes of Health, Bethesda, MD, USA.
Laboratory for the Modeling of Biological and Socio-technical Systems, Northeastern University, Boston, MA, USA.
Epidemics. 2018 Mar;22:13-21. doi: 10.1016/j.epidem.2017.08.002. Epub 2017 Aug 26.
Infectious disease forecasting is gaining traction in the public health community; however, limited systematic comparisons of model performance exist. Here we present the results of a synthetic forecasting challenge inspired by the West African Ebola crisis in 2014-2015 and involving 16 international academic teams and US government agencies, and compare the predictive performance of 8 independent modeling approaches. Challenge participants were invited to predict 140 epidemiological targets across 5 different time points of 4 synthetic Ebola outbreaks, each involving different levels of interventions and "fog of war" in outbreak data made available for predictions. Prediction targets included 1-4 week-ahead case incidences, outbreak size, peak timing, and several natural history parameters. With respect to weekly case incidence targets, ensemble predictions based on a Bayesian average of the 8 participating models outperformed any individual model and did substantially better than a null auto-regressive model. There was no relationship between model complexity and prediction accuracy; however, the top performing models for short-term weekly incidence were reactive models with few parameters, fitted to a short and recent part of the outbreak. Individual model outputs and ensemble predictions improved with data accuracy and availability; by the second time point, just before the peak of the epidemic, estimates of final size were within 20% of the target. The 4th challenge scenario - mirroring an uncontrolled Ebola outbreak with substantial data reporting noise - was poorly predicted by all modeling teams. Overall, this synthetic forecasting challenge provided a deep understanding of model performance under controlled data and epidemiological conditions. We recommend such "peace time" forecasting challenges as key elements to improve coordination and inspire collaboration between modeling groups ahead of the next pandemic threat, and to assess model forecasting accuracy for a variety of known and hypothetical pathogens.
传染病预测在公共卫生领域越来越受到关注;然而,针对模型性能的系统比较有限。在这里,我们展示了一项受 2014-2015 年西非埃博拉危机启发的综合预测挑战的结果,该挑战涉及 16 个国际学术团队和美国政府机构,并比较了 8 种独立建模方法的预测性能。挑战参与者被邀请预测 4 种合成埃博拉疫情中的 140 个流行病学指标,每个疫情涉及不同水平的干预措施和可用于预测的爆发数据的“战争迷雾”。预测目标包括 4 个不同时间点的未来 1-4 周的病例发生率、疫情规模、峰值时间和几个自然史参数。就每周病例发生率目标而言,基于 8 个参与模型的贝叶斯平均的综合预测优于任何单个模型,并且显著优于空自回归模型。模型复杂性与预测准确性之间没有关系;然而,短期每周发病率的表现最佳的模型是具有少量参数的反应性模型,适用于疫情的短期和近期部分。个体模型输出和综合预测随着数据准确性和可用性的提高而提高;在第二个时间点,即疫情高峰期之前,最终规模的估计值与目标值相差不到 20%。第四个挑战场景——模拟了一个数据报告噪音较大的不受控制的埃博拉疫情——所有建模团队的预测都很差。总体而言,这项综合预测挑战深入了解了在受控数据和流行病学条件下的模型性能。我们建议将此类“和平时期”预测挑战作为提高协调能力的关键要素,并在下次大流行威胁之前激发建模团队之间的合作,以及评估针对各种已知和假设病原体的模型预测准确性。