Department of Medical Psychology, University Medical Center Hamburg-Eppendorf, Martinistr. 52, D-20246, Hamburg, Germany.
BMC Med Res Methodol. 2020 Nov 16;20(1):278. doi: 10.1186/s12874-020-01160-2.
Infectious disease predictions models, including virtually all epidemiological models describing the spread of the SARS-CoV-2 pandemic, are rarely evaluated empirically. The aim of the present study was to investigate the predictive accuracy of a prognostic model for forecasting the development of the cumulative number of reported SARS-CoV-2 cases in countries and administrative regions worldwide until the end of May 2020.
The cumulative number of reported SARS-CoV-2 cases was forecasted in 251 regions with a horizon of two weeks, one month, and two months using a hierarchical logistic model at the end of March 2020. Forecasts were compared to actual observations by using a series of evaluation metrics.
On average, predictive accuracy was very high in nearly all regions at the two weeks forecast, high in most regions at the one month forecast, and notable in the majority of the regions at the two months forecast. Higher accuracy was associated with the availability of more data for estimation and with a more pronounced cumulative case growth from the first case to the date of estimation. In some strongly affected regions, cumulative case counts were considerably underestimated.
With keeping its limitations in mind, the investigated model may be used for the preparation and distribution of resources during the initial phase of epidemics. Future research should primarily address the model's assumptions and its scope of applicability. In addition, establishing a relationship with known mechanisms and traditional epidemiological models of disease transmission would be desirable.
传染病预测模型,包括几乎所有描述 SARS-CoV-2 大流行传播的流行病学模型,很少进行实证评估。本研究旨在调查一个预测模型对预测全球各国和地区截至 2020 年 5 月底报告的 SARS-CoV-2 累计病例数发展的预测准确性。
在 2020 年 3 月底,使用分层逻辑模型对 251 个地区进行了两周、一个月和两个月的累计报告 SARS-CoV-2 病例数预测。通过一系列评估指标将预测与实际观测进行比较。
平均而言,在两周预测中,几乎所有地区的预测准确性都非常高,在一个月预测中,大多数地区的预测准确性较高,在两个月预测中,大多数地区的预测准确性显著。更高的准确性与更多的估计数据可用性以及从首例病例到估计日期更明显的累计病例增长有关。在一些受影响严重的地区,累计病例数被大大低估。
考虑到其局限性,该模型可用于在传染病的初始阶段进行资源准备和分配。未来的研究应主要解决模型的假设及其适用范围。此外,建立与已知机制和疾病传播的传统流行病学模型的关系将是可取的。