Mutiso Fedelis, Pearce John L, Benjamin-Neelon Sara E, Mueller Noel T, Li Hong, Neelon Brian
Division of Biostatistics, Department of Public Health Sciences, Medical University of South Carolina, Charleston, South Carolina, USA.
Division of Environmental Health, Department of Public Health Sciences, Medical University of South Carolina, Charleston, South Carolina, USA.
Biom J. 2024 Jul;66(5):e202300182. doi: 10.1002/bimj.202300182.
Spatial count data with an abundance of zeros arise commonly in disease mapping studies. Typically, these data are analyzed using zero-inflated models, which comprise a mixture of a point mass at zero and an ordinary count distribution, such as the Poisson or negative binomial. However, due to their mixture representation, conventional zero-inflated models are challenging to explain in practice because the parameter estimates have conditional latent-class interpretations. As an alternative, several authors have proposed marginalized zero-inflated models that simultaneously model the excess zeros and the marginal mean, leading to a parameterization that more closely aligns with ordinary count models. Motivated by a study examining predictors of COVID-19 death rates, we develop a spatiotemporal marginalized zero-inflated negative binomial model that directly models the marginal mean, thus extending marginalized zero-inflated models to the spatial setting. To capture the spatiotemporal heterogeneity in the data, we introduce region-level covariates, smooth temporal effects, and spatially correlated random effects to model both the excess zeros and the marginal mean. For estimation, we adopt a Bayesian approach that combines full-conditional Gibbs sampling and Metropolis-Hastings steps. We investigate features of the model and use the model to identify key predictors of COVID-19 deaths in the US state of Georgia during the 2021 calendar year.
在疾病地图绘制研究中,大量零值的空间计数数据很常见。通常,这些数据使用零膨胀模型进行分析,零膨胀模型由零处的点质量和普通计数分布(如泊松分布或负二项分布)的混合组成。然而,由于其混合表示,传统的零膨胀模型在实际中难以解释,因为参数估计具有条件潜在类别解释。作为一种替代方法,一些作者提出了边际化零膨胀模型,该模型同时对过多的零值和边际均值进行建模,从而得到一种与普通计数模型更紧密对齐的参数化方法。受一项研究COVID-19死亡率预测因素的启发,我们开发了一种时空边际化零膨胀负二项模型,该模型直接对边际均值进行建模,从而将边际化零膨胀模型扩展到空间设置。为了捕捉数据中的时空异质性,我们引入区域水平协变量、平滑时间效应和空间相关随机效应,以对过多的零值和边际均值进行建模。对于估计,我们采用一种结合完全条件吉布斯采样和梅特罗波利斯-黑斯廷斯步骤的贝叶斯方法。我们研究了该模型的特征,并使用该模型识别2