Schill W, Drescher K
Bremen Institute for Prevention Research and Social Medicine, Germany.
Stat Med. 1997;16(1-3):117-32. doi: 10.1002/(sici)1097-0258(19970130)16:2<117::aid-sim475>3.0.co;2-5.
This paper discusses the analysis of two-stage studies where covariates are missing or measured with error at the first stage of sampling and are validated at the second stage in a subsample. Four recently developed approaches, the weighted pseudo-likelihood method of Flanders and Greenland (1991), the pseudo-conditional likelihood methods of Breslow and Cain (1988) and Schill et al. (1993) and the maximum likelihood estimate obtained via the EM-algorithm (Wacholder and Weinberg, 1994) are reviewed, and some connections between them are established. It is shown that, with respect to odds ratio estimation, case-control designs can be analysed as if first-stage sampling had been prospective. The procedures are numerically compared with respect to asymptotic relative efficiency in a missing value setting.
本文讨论了两阶段研究的分析方法,其中协变量在第一阶段抽样时缺失或测量有误差,并在第二阶段的子样本中进行验证。回顾了四种最近开发的方法,即弗兰德斯和格林兰(1991年)的加权伪似然法、布雷斯洛和凯恩(1988年)以及希尔等人(1993年)的伪条件似然法,以及通过期望最大化算法(瓦霍尔德和温伯格,1994年)获得的最大似然估计,并建立了它们之间的一些联系。结果表明,在估计比值比方面,病例对照设计可以像第一阶段抽样是前瞻性的那样进行分析。在缺失值设定下,对这些程序的渐近相对效率进行了数值比较。