Division of Biostatistics and Bioinformatics, Department of Public Health Sciences, Medical University of South Carolina, 135 Cannon Street, Charleston, 29425, USA.
School of Medicine, Usher Institute, University of Edinburgh, Edinburgh, UK.
BMC Med Res Methodol. 2023 Aug 11;23(1):182. doi: 10.1186/s12874-023-01997-3.
Bayesian models have been applied throughout the Covid-19 pandemic especially to model time series of case counts or deaths. Fewer examples exist of spatio-temporal modeling, even though the spatial spread of disease is a crucial factor in public health monitoring. The predictive capabilities of infectious disease models is also important.
In this study, the ability of Bayesian hierarchical models to recover different parts of the variation in disease counts is the focus. It is clear that different measures provide different views of behavior when models are fitted prospectively. Over a series of time horizons one step predictions have been generated and compared for different models (for case counts and death counts). These Bayesian SIR models were fitted using MCMC at 28 time horizons to mimic prospective prediction. A range of goodness of prediction measures were analyzed across the different time horizons.
A particularly important result is that the peak intensity of case load is often under-estimated, while random spikes in case load can be mimicked using time dependent random effects. It is also clear that during the early wave of the pandemic simpler model forms are favored, but subsequently lagged spatial dependence models for cases are favored, even if the sophisticated models perform better overall.
The models fitted mimic the situation where at a given time the history of the process is known but the future must be predicted based on the current evolution which has been observed. Using an overall 'best' model for prediction based on retrospective fitting of the complete pandemic waves is an assumption. However it is also clear that this case count model is well favored over other forms. During the first wave a simpler time series model predicts case counts better for counties than a spatially dependent one. The picture is more varied for morality.
From a predictive point of view it is clear that spatio-temporal models applied to county level Covid-19 data within the US vary in how well they fit over time and also how well they predict future events. At different times, SIR case count models and also mortality models with cumulative counts perform better in terms of prediction. A fundamental result is that predictive capability of models varies over time and using the same model could lead to poor predictive performance. In addition it is clear that models addressing the spatial context for case counts (i.e. with lagged neighborhood terms) and cumulative case counts for mortality data are clearly better at modeling spatio-temporal data which is commonly available for the Covid-19 pandemic in different areas of the globe.
贝叶斯模型在整个新冠疫情期间都得到了应用,特别是用于对病例或死亡的时间序列进行建模。尽管疾病的空间传播是公共卫生监测的一个关键因素,但很少有空间建模的例子。传染病模型的预测能力也很重要。
在本研究中,重点关注贝叶斯层次模型恢复疾病计数变化的不同部分的能力。显然,当模型前瞻性拟合时,不同的措施提供了不同的行为视图。对于不同的模型(病例计数和死亡计数),已经生成并比较了一系列时间跨度的一步预测。这些贝叶斯 SIR 模型使用 MCMC 在 28 个时间跨度进行拟合,以模拟前瞻性预测。在不同的时间跨度上分析了一系列预测效果的度量标准。
一个特别重要的结果是,病例负荷的峰值强度往往被低估,而病例负荷的随机尖峰可以通过随时间变化的随机效应来模拟。也很明显,在大流行的早期阶段,更简单的模型形式受到青睐,但随后病例的滞后空间相关性模型受到青睐,即使复杂模型总体上表现更好。
拟合的模型模拟了在给定时间已知过程历史但必须根据已观察到的当前演变来预测未来的情况。基于对整个大流行波的回溯拟合来选择一个总体上“最佳”的预测模型是一个假设。然而,很明显,这种病例计数模型比其他形式更受欢迎。在第一波中,与空间相关的模型相比,更简单的时间序列模型更能准确预测县一级的病例数。对于死亡率,情况则更加多样化。
从预测的角度来看,应用于美国县一级新冠病毒数据的时空模型在拟合随时间变化的情况以及预测未来事件的能力方面存在差异。在不同的时间点,SIR 病例计数模型和死亡率模型(带有累计计数)在预测方面表现更好。一个基本的结果是,模型的预测能力随时间而变化,使用相同的模型可能会导致预测性能不佳。此外,很明显,针对病例计数的空间背景(即带有滞后邻域项)和死亡率数据的累积病例计数的模型更能准确地对大流行期间不同地区的全球常见的时空数据进行建模。