School of Human Movement and Sport Sciences, University of Ballarat, Mt Helen, VIC 3353, Australia.
Accid Anal Prev. 2010 Mar;42(2):384-92. doi: 10.1016/j.aap.2009.08.018. Epub 2009 Oct 1.
Falls and their injury outcomes have count distributions that are highly skewed toward the right with clumping at zero, posing analytical challenges. Different modelling approaches have been used in the published literature to describe falls count distributions, often without consideration of the underlying statistical and modelling assumptions. This paper compares the use of modified Poisson and negative binomial (NB) models as alternatives to Poisson (P) regression, for the analysis of fall outcome counts. Four different count-based regression models (P, NB, zero-inflated Poisson (ZIP), zero-inflated negative binomial (ZINB)) were each individually fitted to four separate fall count datasets from Australia, New Zealand and United States. The finite mixtures of P and NB regression models were also compared to the standard NB model. Both analytical (F, Vuong and bootstrap tests) and graphical approaches were used to select and compare models. Simulation studies assessed the size and power of each model fit. This study confirms that falls count distributions are over-dispersed, but not dispersed due to excess zero counts or heterogeneous population. Accordingly, the P model generally provided the poorest fit to all datasets. The fit improved significantly with NB and both zero-inflated models. The fit was also improved with the NB model, compared to finite mixtures of both P and NB regression models. Although there was little difference in fit between NB and ZINB models, in the interests of parsimony it is recommended that future studies involving modelling of falls count data routinely use the NB models in preference to the P or ZINB or finite mixture distribution. The fact that these conclusions apply across four separate datasets from four different samples of older people participating in studies of different methodology, adds strength to this general guiding principle.
跌倒及其伤害结果的分布呈高度右偏态,集中在零处,这给分析带来了挑战。已发表的文献中使用了不同的建模方法来描述跌倒次数的分布,而通常没有考虑到潜在的统计和建模假设。本文比较了使用修正泊松和负二项(NB)模型替代泊松(P)回归,用于分析跌倒结果计数。将四种不同的基于计数的回归模型(P、NB、零膨胀泊松(ZIP)、零膨胀负二项(ZINB))分别单独拟合到来自澳大利亚、新西兰和美国的四个独立的跌倒计数数据集。还将 P 和 NB 回归模型的有限混合与标准 NB 模型进行了比较。使用分析(F、Vuong 和自举检验)和图形方法来选择和比较模型。模拟研究评估了每个模型拟合的大小和功效。本研究证实,跌倒次数分布是过离散的,但不是由于过剩的零计数或异质人群造成的离散。因此,P 模型通常对所有数据集的拟合效果最差。NB 和两种零膨胀模型的拟合效果显著改善。与 P 和 NB 回归模型的有限混合相比,NB 模型的拟合效果也得到了改善。虽然 NB 和 ZINB 模型的拟合效果差异不大,但为了简约起见,建议未来涉及跌倒计数数据建模的研究通常使用 NB 模型而不是 P 或 ZINB 或有限混合分布。这些结论适用于来自四个不同的老年人样本的四个独立数据集,这些样本参与了不同方法学的研究,这为这一普遍指导原则增加了力度。