Schildcrout Jonathan S, Rathouz Paul J, Zelnick Leila R, Garbett Shawn P, Heagerty Patrick J
Departments of Biostatistics, Vanderbilt Univeristy School of Medicine.
Department of Biostatistics and Medical Informatics, University of Wisconsin School of Medicine and Public Health.
Ann Appl Stat. 2015 Jun;9(2):731-753. doi: 10.1214/15-AOAS826.
Substudies of the Childhood Asthma Management Program (CAMP Research Group, 1999, 2000) seek to identify patient characteristics associated with asthma symptoms and lung function. To determine if genetic measures are associated with trajectories of lung function as measured by forced vital capacity (FVC), children in the primary cohort study retrospectively had candidate loci evaluated. Given participant burden and constraints on financial resources, it is often desirable to target a sub-sample for ascertainment of costly measures. Methods that can leverage the longitudinal outcome on the full cohort to selectively measure informative individuals have been promising, but have been restricted in their use to analysis of the targeted sub-sample. In this paper we detail two multiple imputation analysis strategies that exploit outcome and partially observed covariate data on the non-sampled subjects, and we characterize alternative design and analysis combinations that could be used for future studies of pulmonary function and other outcomes. Candidate predictor (e.g. IL10 cytokine polymorphisms) associations obtained from targeted sampling designs can be estimated with very high efficiency compared to standard designs. Further, even though multiple imputation can dramatically improve estimation efficiency for covariates available on all subjects (e.g., gender and baseline age), only modest efficiency gains were observed in parameters associated with predictors that are exclusive to the targeted sample. Our results suggest that future studies of longitudinal trajectories can be efficiently conducted by use of outcome-dependent designs and associated full cohort analysis.
儿童哮喘管理项目的子研究(儿童哮喘管理项目研究组,1999年,2000年)旨在确定与哮喘症状和肺功能相关的患者特征。为了确定基因检测指标是否与通过用力肺活量(FVC)测量的肺功能轨迹相关,主要队列研究中的儿童对候选基因座进行了回顾性评估。考虑到参与者的负担和资金资源的限制,通常希望针对一个子样本进行昂贵检测指标的确证。能够利用整个队列的纵向结果来选择性测量信息丰富个体的方法很有前景,但仅限于对目标子样本的分析。在本文中,我们详细介绍了两种多重填补分析策略,这些策略利用了未抽样个体的结果和部分观测到的协变量数据,并且我们描述了可用于未来肺功能及其他结果研究的替代设计和分析组合。与标准设计相比,从目标抽样设计中获得的候选预测指标(如IL10细胞因子多态性)关联可以非常高效地进行估计。此外,尽管多重填补可以显著提高对所有个体都可用的协变量(如性别和基线年龄)的估计效率,但在与目标样本独有的预测指标相关的参数中,仅观察到适度的效率提升。我们的结果表明,未来对纵向轨迹的研究可以通过使用依赖结果的设计和相关的全队列分析来高效开展。