Department of Statistical Sciences, University of Padova, Padova, Italy.
Stat Med. 2022 Jun 15;41(13):2403-2416. doi: 10.1002/sim.9361. Epub 2022 Mar 11.
Negative binomial regression is commonly employed to analyze overdispersed count data. With small to moderate sample sizes, the maximum likelihood estimator of the dispersion parameter may be subject to a significant bias, that in turn affects inference on mean parameters. This article proposes inference for negative binomial regression based on adjustments of the score function aimed at mean or median bias reduction. The resulting estimating equations generalize those available for improved inference in generalized linear models and can be solved using a suitable extension of iterative weighted least squares. Simulation studies confirm the good properties of the new methods, which are also found to solve in many cases numerical problems of maximum likelihood estimation. The methods are illustrated and evaluated using two case studies: an Ames salmonella assay data set and data on epileptic seizures. Inference based on adjusted scores turns out to generally improve on maximum likelihood, and even on explicit bias correction, with median bias reduction being overall preferable.
负二项回归常用于分析过离散的计数数据。在小到中等样本量的情况下,离散参数的最大似然估计可能存在显著的偏差,这反过来又影响了对均值参数的推断。本文提出了基于分数函数调整的负二项回归推断方法,旨在减少均值或中位数偏差。得到的估计方程推广了广义线性模型中用于改进推断的那些方程,并且可以使用迭代加权最小二乘法的适当扩展来求解。模拟研究证实了新方法的良好性质,这些方法还解决了最大似然估计中许多情况下的数值问题。使用两个案例研究来说明和评估这些方法:Ames 沙门氏菌检测数据集和癫痫发作数据。基于调整后的分数的推断通常优于最大似然推断,甚至优于显式偏差修正,总体而言,中位数偏差减少更优。