Zheng Han, Kimber Alan, Goodwin Victoria A, Pickering Ruth M
Medical Statistics Group, Faculty of Medicine, University of Southampton, Southampton, England.
Mathematical Sciences, University of Southampton, Southampton, England.
Biom J. 2018 Jan;60(1):66-78. doi: 10.1002/bimj.201700103. Epub 2017 Oct 25.
A common design for a falls prevention trial is to assess falling at baseline, randomize participants into an intervention or control group, and ask them to record the number of falls they experience during a follow-up period of time. This paper addresses how best to include the baseline count in the analysis of the follow-up count of falls in negative binomial (NB) regression. We examine the performance of various approaches in simulated datasets where both counts are generated from a mixed Poisson distribution with shared random subject effect. Including the baseline count after log-transformation as a regressor in NB regression (NB-logged) or as an offset (NB-offset) resulted in greater power than including the untransformed baseline count (NB-unlogged). Cook and Wei's conditional negative binomial (CNB) model replicates the underlying process generating the data. In our motivating dataset, a statistically significant intervention effect resulted from the NB-logged, NB-offset, and CNB models, but not from NB-unlogged, and large, outlying baseline counts were overly influential in NB-unlogged but not in NB-logged. We conclude that there is little to lose by including the log-transformed baseline count in standard NB regression compared to CNB for moderate to larger sized datasets.
预防跌倒试验的常见设计是在基线时评估跌倒情况,将参与者随机分为干预组或对照组,并要求他们记录在随访期间经历的跌倒次数。本文探讨了在负二项式(NB)回归中,如何最好地将基线计数纳入跌倒随访计数的分析中。我们在模拟数据集中检验了各种方法的性能,其中两个计数均由具有共享随机个体效应的混合泊松分布生成。在NB回归(NB-logged)中将对数转换后的基线计数作为回归变量包含在内,或者作为偏移量(NB-offset)包含在内,比包含未转换的基线计数(NB-unlogged)具有更大的功效。Cook和Wei的条件负二项式(CNB)模型复制了生成数据的潜在过程。在我们的激励数据集中,NB-logged、NB-offset和CNB模型产生了具有统计学意义的干预效果,但NB-unlogged模型没有,并且大的、离群的基线计数在NB-unlogged中影响过大,但在NB-logged中并非如此。我们得出结论,对于中等至较大规模的数据集,与CNB相比,在标准NB回归中纳入对数转换后的基线计数几乎没有损失。