Marshall Maximilian, Parker Felix, Gardner Lauren M
Department of Civil and Systems Engineering, Johns Hopkins University, Baltimore, MD, USA.
BMC Glob Public Health. 2024 Oct 3;2(1):67. doi: 10.1186/s44263-024-00098-7.
COVID-19 will not be the last pandemic of the twenty-first century. To better prepare for the next one, it is essential that we make honest appraisals of the utility of different responses to COVID. In this paper, we focus specifically on epidemiologic forecasting. Characterizing forecast efficacy over the history of the pandemic is challenging, especially given its significant spatial, temporal, and contextual variability. In this light, we introduce the Weighted Contextual Interval Score (WCIS), a new method for retrospective interval forecast evaluation.
The central tenet of the WCIS is a direct incorporation of contextual utility into the evaluation. This necessitates a specific characterization of forecast efficacy depending on the use case for predictions, accomplished via defining a utility threshold parameter. This idea is generalized to probabilistic interval-form forecasts, which are the preferred prediction format for epidemiological modeling, as an extension of the existing Weighted Interval Score (WIS).
We apply the WCIS to two forecasting scenarios: facility-level hospitalizations for a single state, and state-level hospitalizations for the whole of the United States. We observe that an appropriately parameterized application of the WCIS captures both the relative quality and the overall frequency of useful forecasts. Since the WCIS represents the utility of predictions using contextual normalization, it is easily comparable across highly variable pandemic scenarios while remaining intuitively representative of the in-situ quality of individual forecasts.
The WCIS provides a pragmatic utility-based characterization of probabilistic predictions. This method is expressly intended to enable practitioners and policymakers who may not have expertise in forecasting but are nevertheless essential partners in epidemic response to use and provide insightful analysis of predictions. We note that the WCIS is intended specifically for retrospective forecast evaluation and should not be used as a minimized penalty in a competitive context as it lacks statistical propriety. Code and data used for our analysis are available at https://github.com/maximilian-marshall/wcis .
2019冠状病毒病(COVID-19)不会是21世纪的最后一场大流行。为了更好地应对下一次大流行,我们必须诚实地评估对COVID-19的不同应对措施的效用。在本文中,我们特别关注流行病学预测。鉴于大流行历史上预测效果存在显著的空间、时间和背景差异,描述其预测效果具有挑战性。有鉴于此,我们引入了加权背景区间得分(WCIS),这是一种用于回顾性区间预测评估的新方法。
WCIS的核心原则是将背景效用直接纳入评估。这需要根据预测的用例对预测效果进行特定描述,通过定义一个效用阈值参数来实现。这个想法被推广到概率区间形式的预测,这是流行病学建模的首选预测格式,作为现有加权区间得分(WIS)的扩展。
我们将WCIS应用于两种预测场景:单个州的设施级住院人数预测,以及美国全国的州级住院人数预测。我们观察到,对WCIS进行适当的参数化应用,可以捕捉到有用预测的相对质量和总体频率。由于WCIS通过背景归一化来表示预测的效用,因此它在高度可变的大流行场景中易于比较,同时仍然直观地代表了单个预测的实际质量。
WCIS为概率预测提供了一种基于实用程序的务实描述。该方法明确旨在使可能没有预测专业知识但却是疫情应对中重要合作伙伴的从业者和政策制定者能够使用并对预测提供有见地的分析。我们注意到,WCIS专门用于回顾性预测评估,不应在竞争环境中用作最小化惩罚,因为它缺乏统计适当性。我们分析中使用的代码和数据可在https://github.com/maximilian-marshall/wcis上获取。