Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.
Caries Res. 2017;51(3):198-208. doi: 10.1159/000452675. Epub 2017 Mar 15.
Marginalized zero-inflated count regression models have recently been introduced for the statistical analysis of dental caries indices and other zero-inflated count data as alternatives to traditional zero-inflated and hurdle models. Unlike the standard approaches, the marginalized models directly estimate overall exposure or treatment effects by relating covariates to the marginal mean count. This article discusses model interpretation and model class choice according to the research question being addressed in caries research. Two data sets, one consisting of fictional dmft counts in 2 groups and the other on DMFS among schoolchildren from a randomized clinical trial comparing 3 toothpaste formulations to prevent incident dental caries, are analyzed with negative binomial hurdle, zero-inflated negative binomial, and marginalized zero-inflated negative binomial models. In the first example, estimates of treatment effects vary according to the type of incidence rate ratio (IRR) estimated by the model. Estimates of IRRs in the analysis of the randomized clinical trial were similar despite their distinctive interpretations. The choice of statistical model class should match the study's purpose, while accounting for the broad decline in children's caries experience, such that dmft and DMFS indices more frequently generate zero counts. Marginalized (marginal mean) models for zero-inflated count data should be considered for direct assessment of exposure effects on the marginal mean dental caries count in the presence of high frequencies of zero counts.
边缘化零膨胀计数回归模型最近被引入用于牙龋指数和其他零膨胀计数数据的统计分析,作为传统零膨胀和障碍模型的替代方法。与标准方法不同,边缘化模型通过将协变量与边缘均值计数相关联,直接估计总体暴露或治疗效果。本文根据龋病研究中所解决的研究问题讨论模型解释和模型类别选择。使用负二项式障碍、零膨胀负二项式和边缘化零膨胀负二项式模型分析了两个数据集,一个数据集由 2 组虚构的 dmft 计数组成,另一个数据集来自一项比较 3 种牙膏配方预防新发生龋病的随机临床试验中的 DMFS。在第一个示例中,根据模型估计的发病率比 (IRR) 的类型,治疗效果的估计值会有所不同。尽管分析中随机临床试验的 IRR 估计值具有不同的解释,但它们的估计值相似。统计模型类别的选择应与研究目的相匹配,同时考虑到儿童龋病经验的广泛下降,使得 dmft 和 DMFS 指数更频繁地产生零计数。在存在高零计数频率的情况下,应考虑使用零膨胀计数数据的边缘化(边缘均值)模型来直接评估暴露对边缘均值牙龋计数的影响。