Wu Wenqi, Stamey James, Kahle David
Department of Statistical Science, Baylor University, One Bear Place #97140, Waco, TX, 76706, USA.
Int J Environ Res Public Health. 2015 Aug 28;12(9):10648-61. doi: 10.3390/ijerph120910648.
Count data are subject to considerable sources of what is often referred to as non-sampling error. Errors such as misclassification, measurement error and unmeasured confounding can lead to substantially biased estimators. It is strongly recommended that epidemiologists not only acknowledge these sorts of errors in data, but incorporate sensitivity analyses into part of the total data analysis. We extend previous work on Poisson regression models that allow for misclassification by thoroughly discussing the basis for the models and allowing for extra-Poisson variability in the form of random effects. Via simulation we show the improvements in inference that are brought about by accounting for both the misclassification and the overdispersion.
计数数据存在大量通常被称为非抽样误差的来源。诸如错误分类、测量误差和未测量的混杂因素等误差可能导致估计值出现严重偏差。强烈建议流行病学家不仅要认识到数据中的这类误差,还要将敏感性分析纳入整体数据分析的一部分。我们扩展了先前关于允许错误分类的泊松回归模型的工作,通过全面讨论模型的基础并以随机效应的形式考虑超泊松变异。通过模拟,我们展示了通过同时考虑错误分类和过度离散所带来的推断改进。