Kim Soyoung, Zeng Donglin, Cai Jianwen
Division of Biostatistics, Medical College of Wisconsin, Milwaukee, Wisconsin, U.S.A.
Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, U.S.A.
Biometrics. 2018 Dec;74(4):1250-1260. doi: 10.1111/biom.12923. Epub 2018 Jul 10.
Generalized case-cohort design has been proposed to assess the effects of exposures on survival outcomes when measuring exposures is expensive and events are not rare in the cohort. In such design, expensive exposure information is collected from both a (stratified) randomly selected subcohort and a subset of individuals with events. In this article, we consider extension of such design to study multiple types of survival events by selecting a proportion of cases for each type of event. We propose a general weighting scheme to analyze data. Furthermore, we examine the optimal choice of weights and show that this optimal weighting yields much improved efficiency gain both asymptotically and in simulation studies. Finally, we apply our proposed methods to data from the Atherosclerosis Risk in Communities study.
当测量暴露因素成本高昂且队列中事件并不罕见时,有人提出采用广义病例队列设计来评估暴露因素对生存结局的影响。在这种设计中,从一个(分层)随机选择的子队列和有事件发生的个体子集中收集昂贵的暴露信息。在本文中,我们考虑将这种设计扩展到通过为每种类型的事件选择一定比例的病例来研究多种类型的生存事件。我们提出一种通用的加权方案来分析数据。此外,我们研究了权重的最优选择,并表明这种最优加权在渐近情况下和模拟研究中都能显著提高效率增益。最后,我们将所提出的方法应用于社区动脉粥样硬化风险研究的数据。