Suppr超能文献

具有缺失数据的多层模型的协变量选择

Covariate Selection for Multilevel Models with Missing Data.

作者信息

Marino Miguel, Buxton Orfeu M, Li Yi

机构信息

Department of Family Medicine, Department of Public Health, Division of Biostatistics, Oregon Health and Science University, Portland, OR 97239 USA.

Associate Professor, Department of Biobehavioral Health, Pennsylvania State University, University Park, PA 16802. Lecturer on Medicine, Division of Sleep Medicine, Harvard Medical School, Boston, MA 02115. Associate Neuroscientist, Department of Medicine, Brigham and Women's Hospital, Boston, MA 02115. Adjunct Associate Professor, Department of Social and Behavioral Sciences, Harvard T.H. Chan School of Public Health, Boston, MA 02115.

出版信息

Stat (Int Stat Inst). 2017;6(1):31-46. doi: 10.1002/sta4.133. Epub 2017 Jan 8.

Abstract

Missing covariate data hampers variable selection in multilevel regression settings. Current variable selection techniques for multiply-imputed data commonly address missingness in the predictors through list-wise deletion and stepwise-selection methods which are problematic. Moreover, most variable selection methods are developed for independent linear regression models and do not accommodate multilevel mixed effects regression models with incomplete covariate data. We develop a novel methodology that is able to perform covariate selection across multiply-imputed data for multilevel random effects models when missing data is present. Specifically, we propose to stack the multiply-imputed data sets from a multiple imputation procedure and to apply a group variable selection procedure through group lasso regularization to assess the overall impact of each predictor on the outcome across the imputed data sets. Simulations confirm the advantageous performance of the proposed method compared with the competing methods. We applied the method to reanalyze the Healthy Directions-Small Business cancer prevention study, which evaluated a behavioral intervention program targeting multiple risk-related behaviors in a working-class, multi-ethnic population.

摘要

缺失的协变量数据会妨碍多级回归设置中的变量选择。当前用于多重填补数据的变量选择技术通常通过存在问题的逐行删除和逐步选择方法来处理预测变量中的缺失值。此外,大多数变量选择方法是为独立线性回归模型开发的,不适用于具有不完整协变量数据的多级混合效应回归模型。我们开发了一种新颖的方法,当存在缺失数据时,该方法能够对多级随机效应模型的多重填补数据进行协变量选择。具体而言,我们建议将多重填补程序中的多重填补数据集堆叠起来,并通过组套索正则化应用组变量选择程序,以评估每个预测变量对整个填补数据集结果的总体影响。模拟结果证实了所提出的方法与竞争方法相比具有优势性能。我们应用该方法重新分析了健康方向 - 小企业癌症预防研究,该研究评估了一项针对工人阶级多民族人群中多种与风险相关行为的行为干预计划。

相似文献

1
Covariate Selection for Multilevel Models with Missing Data.具有缺失数据的多层模型的协变量选择
Stat (Int Stat Inst). 2017;6(1):31-46. doi: 10.1002/sta4.133. Epub 2017 Jan 8.

本文引用的文献

2
A Perturbation Method for Inference on Regularized Regression Estimates.一种用于正则化回归估计推断的摄动方法。
J Am Stat Assoc. 2011 Jan 1;106(496):1371-1382. doi: 10.1198/jasa.2011.tm10382. Epub 2012 Jan 24.
4
Fixed and random effects selection in mixed effects models.混合效应模型中的固定效应和随机效应选择
Biometrics. 2011 Jun;67(2):495-503. doi: 10.1111/j.1541-0420.2010.01463.x. Epub 2010 Jul 21.
10
Variable selection for semiparametric mixed models in longitudinal studies.纵向研究中半参数混合模型的变量选择
Biometrics. 2010 Mar;66(1):79-88. doi: 10.1111/j.1541-0420.2009.01240.x. Epub 2009 Apr 13.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验