Harp Ryan D, Holcomb Karen M, Retkute Renata, Prusokiene Alisa, Prusokas Augustinas, Ertem Zeynep, Ajelli Marco, Kummer Allisandra G, Litvinova Maria, Merler Stefano, Piontti Ana Pastore Y, Poletti Piero, Vespignani Alessandro, Wilke Andre B B, Zardini Agnese, Smith Kelly Helm, Armstrong Philip, DeFelice Nicholas, Keyel Alexander, Shepard John, Smith Rebecca, Tyre Andrew, Humphreys John, Cohnstaedt Lee W, Hosseini Saman, Scoglio Caterina, Gorris Morgan E, Barnard Martha, Moser S Kane, Spencer Julie A, McCarter Maggie S J, Lee Christopher, Nolan Melissa S, Barker Christopher M, Staples J Erin, Nett Randall J, Johansson Michael A
Division of Vector-Borne Diseases, Centers for Disease Control and Prevention, Fort Collins, CO, USA.
Global Systems Laboratory, National Oceanic and Atmospheric Administration, Boulder, CO, USA.
Parasit Vectors. 2025 Apr 23;18(1):152. doi: 10.1186/s13071-025-06767-2.
West Nile virus (WNV) is the most common cause of mosquito-borne disease in the continental USA, with an average of ~1200 severe, neuroinvasive cases reported annually from 2005 to 2021 (range 386-2873). Despite this burden, efforts to forecast WNV disease to inform public health measures to reduce disease incidence have had limited success. Here, we analyze forecasts submitted to the 2022 WNV Forecasting Challenge, a follow-up to the 2020 WNV Forecasting Challenge.
Forecasting teams submitted probabilistic forecasts of annual West Nile virus neuroinvasive disease (WNND) cases for each county in the continental USA for the 2022 WNV season. We assessed the skill of team-specific forecasts, baseline forecasts, and an ensemble created from team-specific forecasts. We then characterized the impact of model characteristics and county-specific contextual factors (e.g., population) on forecast skill.
Ensemble forecasts for 2022 anticipated a season at or below median long-term WNND incidence for nearly all (> 99%) counties. More counties reported higher case numbers than anticipated by the ensemble forecast median, but national caseload (826) was well below the 10-year median (1386). Forecast skill was highest for the ensemble forecast, though the historical negative binomial baseline model and several team-submitted forecasts had similar forecast skill. Forecasts utilizing regression-based frameworks tended to have more skill than those that did not and models using climate, mosquito surveillance, demographic, or avian data had less skill than those that did not, potentially due to overfitting. County-contextual analysis showed strong relationships with the number of years that WNND had been reported and permutation entropy (historical variability). Evaluations based on weighted interval score and logarithmic scoring metrics produced similar results.
The relative success of the ensemble forecast, the best forecast for 2022, suggests potential gains in community ability to forecast WNV, an improvement from the 2020 Challenge. Similar to the previous challenge, however, our results indicate that skill was still limited with general underprediction despite a relative low incidence year. Potential opportunities for improvement include refining mechanistic approaches, integrating additional data sources, and considering different approaches for areas with and without previous cases.
西尼罗河病毒(WNV)是美国大陆由蚊子传播疾病的最常见病因,2005年至2021年期间,平均每年报告约1200例严重的神经侵袭性病例(范围为386 - 2873例)。尽管有如此负担,但为预测西尼罗河病毒疾病以指导公共卫生措施来降低疾病发病率所做的努力取得的成功有限。在此,我们分析了提交给2022年西尼罗河病毒预测挑战赛的预测结果,该挑战赛是2020年西尼罗河病毒预测挑战赛的后续活动。
预测团队提交了2022年西尼罗河病毒季节美国大陆各县年度西尼罗河病毒神经侵袭性疾病(WNND)病例的概率预测。我们评估了特定团队预测、基线预测以及由特定团队预测创建的集合预测的技能。然后,我们描述了模型特征和特定县的背景因素(如人口)对预测技能的影响。
2022年的集合预测预计几乎所有(>99%)县的季节发病率处于或低于长期WNND发病率中位数。报告的病例数高于集合预测中位数预期的县更多,但全国病例数(826例)远低于10年中位数(1386例)。集合预测的预测技能最高,尽管历史负二项式基线模型和几个团队提交的预测具有相似的预测技能。使用基于回归框架的预测往往比未使用的预测技能更高,而使用气候、蚊子监测、人口统计或鸟类数据的模型比未使用这些数据的模型技能更低,这可能是由于过度拟合。县背景分析显示与报告WNND的年数和排列熵(历史变异性)有很强的关系。基于加权区间得分和对数得分指标进行的评估产生了相似的结果。
2022年最佳预测——集合预测的相对成功表明社区预测西尼罗河病毒的能力有潜在提升,相较于2020年挑战赛有所改进。然而,与之前的挑战赛类似,我们的结果表明尽管发病率相对较低,但预测技能仍然有限,总体预测偏低。改进的潜在机会包括完善机制方法、整合更多数据源以及考虑对有既往病例和无既往病例的地区采用不同方法。