Hosseini Saman, Cohnstaedt Lee W, Humphreys John M, Scoglio Caterina
Department of Electrical and Computer Engineering, Kansas State University, Manhattan, KS, USA.
Foreign Arthropod-Borne Animal Diseases Research Unit National Bio- and Agro-defense Facility, USDA ARS, Manhattan, KS, USA.
Infect Dis Model. 2024 Jun 28;9(4):1175-1197. doi: 10.1016/j.idm.2024.06.004. eCollection 2024 Dec.
Upon researching predictive models related to West Nile virus disease, it is discovered that there are numerous parameters and extensive information in most models, thus contributing to unnecessary complexity. Another challenge frequently encountered is the lead time, which refers to the period for which predictions are made and often is too short. This paper addresses these issues by introducing a parsimonious method based on ICC curves, offering a logistic distribution model derived from the vector-borne SEIR model. Unlike existing models relying on diverse environmental data, our approach exclusively utilizes historical and present infected human cases (number of new cases). With a year-long lead time, the predictions extend throughout the 12 months, gaining precision as new data emerge. Theoretical conditions are derived to minimize Bayesian loss, enhancing predictive precision. We construct a Bayesian forecasting probability density function using carefully selected prior distributions. Applying these functions, we predict month-specific infections nationwide, rigorously evaluating accuracy with probabilistic metrics. Additionally, HPD credible intervals at 90%, 95%, and 99% levels is performed. Precision assessment is conducted for HPD intervals, measuring the proportion of intervals that does not include actual reported cases for 2020-2022.
在研究与西尼罗河病毒病相关的预测模型时,发现大多数模型中有众多参数和大量信息,从而导致不必要的复杂性。经常遇到的另一个挑战是提前期,即进行预测的时间段,且该时间段往往过短。本文通过引入基于ICC曲线的简约方法来解决这些问题,提供一种从媒介传播的SEIR模型推导而来的逻辑分布模型。与依赖各种环境数据的现有模型不同,我们的方法仅利用历史和当前感染人类病例(新病例数)。提前期为一年,预测涵盖全年12个月,并随着新数据的出现而提高精度。推导理论条件以最小化贝叶斯损失,提高预测精度。我们使用精心选择的先验分布构建贝叶斯预测概率密度函数。应用这些函数,我们预测全国特定月份的感染情况,并使用概率指标严格评估准确性。此外,还进行了90%、95%和99%水平的最高后验密度(HPD)可信区间分析。对HPD区间进行精度评估,测量2020 - 2022年不包含实际报告病例的区间比例。