Xu Mingchi, Douwes-Schultz Dirk, Schmidt Alexandra M
Department of Epidemiology, Biostatistics and Occupational Health, McGill University, Montreal, QC, Canada.
Stat Med. 2025 Jun;44(13-14):e70135. doi: 10.1002/sim.70135.
In epidemiological studies, zero-inflated and hurdle models are commonly used to handle excess zeros in reported infectious disease cases. However, they cannot model the persistence (transition from presence to presence) and reemergence (transition from absence to presence) of a disease separately. Covariates can sometimes have different effects on the reemergence and persistence of a disease. Recently, a zero-inflated Markov switching negative binomial model was proposed to accommodate this issue. We introduce a Markov switching negative binomial hurdle model as a competitor of that approach, as hurdle models are often also used as alternatives to zero-inflated models for accommodating excess zeroes. We begin the comparison by inspecting the underlying assumptions made by both models. Hurdle models assume perfect detection of the disease cases while zero-inflated models implicitly assume the case counts can be under-reported, thus, we investigate when a negative binomial distribution can approximate the true distribution of reported counts. A comparison of the fit of the two types of Markov switching models is undertaken on chikungunya cases across the neighborhoods of Rio de Janeiro. We find that, among the fitted models, the Markov switching negative binomial zero-inflated model produces the best predictions, and both Markov switching models produce remarkably better predictions than more traditional negative binomial hurdle and zero-inflated models.
在流行病学研究中,零膨胀模型和门槛模型通常用于处理报告的传染病病例中过多的零值。然而,它们无法分别对疾病的持续存在(从存在到存在的转变)和再次出现(从不存在到存在的转变)进行建模。协变量有时对疾病的再次出现和持续存在会有不同的影响。最近,有人提出了一种零膨胀马尔可夫切换负二项式模型来解决这个问题。我们引入一种马尔可夫切换负二项式门槛模型作为该方法的竞争模型,因为门槛模型也经常被用作零膨胀模型的替代方法来处理过多的零值。我们通过检查这两种模型所做的基本假设来开始比较。门槛模型假设对疾病病例有完美的检测,而零膨胀模型隐含地假设病例数可能被漏报,因此,我们研究负二项式分布何时可以近似报告病例数的真实分布。我们对里约热内卢各社区的基孔肯雅热病例进行了两种马尔可夫切换模型拟合的比较。我们发现,在拟合模型中,零膨胀马尔可夫切换负二项式模型产生了最佳预测,并且两种马尔可夫切换模型都比更传统的负二项式门槛模型和零膨胀模型产生了明显更好的预测。