Huang Yijian
Department of Biostatistics and Bioinformatics, Rollins School of Public Health, Emory University, 1518 Clifton Rd. NE, Atlanta, GA 30322, U.S.A.
Stat Sin. 2014 Jan 1;24(1):357-374. doi: 10.5705/ss.2012.181.
Corrected score (Nakamura, 1990; Stefanski, 1989) is an important consistent functional modeling method for covariate measurement error in nonlinear regression. Although its pathological behaviors are known to exacerbate with increasing error contamination, neither their nature nor severity is well understood. In this article, we conduct a detailed investigation with the log-linear model for count data in the presence of sizable measurement error. Our study reveals that multiple roots, estimate-finding failure, and skewness in distribution are common and they may persist even when the sample size is practically large. Furthermore, these pathological behaviors are attributed to a surprising fact that desirable trend of the corrected score always goes astray as the parameter space approaches extremes. A novel remedy is proposed to constrain the derivatives with additional estimating functions. The resulting trend-constrained corrected score may also substantially improve the estimation efficiency. These findings and estimation strategy shed light on the developments for other nonlinear models as well, including logistic and Cox regression models, and for nonparametric correction.
校正得分(中村,1990年;斯特凡斯基,1989年)是用于非线性回归中协变量测量误差的一种重要的一致性函数建模方法。尽管已知其病态行为会随着误差污染的增加而加剧,但其本质和严重程度都尚未得到很好的理解。在本文中,我们针对存在相当大测量误差的计数数据的对数线性模型进行了详细研究。我们的研究表明,多重根、估计失败以及分布中的偏度很常见,并且即使样本量实际上很大,它们也可能持续存在。此外,这些病态行为归因于一个令人惊讶的事实,即随着参数空间接近极值,校正得分的理想趋势总是会出错。我们提出了一种新颖的补救方法,用额外的估计函数来约束导数。由此产生的趋势约束校正得分也可能显著提高估计效率。这些发现和估计策略也为其他非线性模型的发展提供了启示,包括逻辑回归和Cox回归模型,以及非参数校正。