Fearn T, Hill D C, Darby S C
Department of Statistical Science, University College London, London, U.K.
Stat Med. 2008 May 30;27(12):2159-76. doi: 10.1002/sim.3163.
In epidemiology, one approach to investigating the dependence of disease risk on an explanatory variable in the presence of several confounding variables is by fitting a binary regression using a conditional likelihood, thus eliminating the nuisance parameters. When the explanatory variable is measured with error, the estimated regression coefficient is biased usually towards zero. Motivated by the need to correct for this bias in analyses that combine data from a number of case-control studies of lung cancer risk associated with exposure to residential radon, two approaches are investigated. Both employ the conditional distribution of the true explanatory variable given the measured one. The method of regression calibration uses the expected value of the true given measured variable as the covariate. The second approach integrates the conditional likelihood numerically by sampling from the distribution of the true given measured explanatory variable. The two approaches give very similar point estimates and confidence intervals not only for the motivating example but also for an artificial data set with known properties. These results and some further simulations that demonstrate correct coverage for the confidence intervals suggest that for studies of residential radon and lung cancer the regression calibration approach will perform very well, so that nothing more sophisticated is needed to correct for measurement error.
在流行病学中,在存在多个混杂变量的情况下,研究疾病风险对一个解释变量的依赖性的一种方法是使用条件似然拟合二元回归,从而消除讨厌的参数。当解释变量存在测量误差时,估计的回归系数通常会偏向于零。出于在结合多项与接触住宅氡相关的肺癌风险病例对照研究数据的分析中校正这种偏差的需要,研究了两种方法。两种方法都利用给定测量值的真实解释变量的条件分布。回归校准方法使用给定测量变量的真实变量的期望值作为协变量。第二种方法通过从给定测量解释变量的真实分布中抽样来对条件似然进行数值积分。这两种方法不仅对于激励示例,而且对于具有已知属性的人工数据集,都给出了非常相似的点估计和置信区间。这些结果以及一些进一步的模拟表明置信区间具有正确的覆盖率,这表明对于住宅氡与肺癌的研究,回归校准方法将表现得非常好,因此无需更复杂的方法来校正测量误差。