Koch Brandon, Vock David M, Wolfson Julian
Division of Biostatistics, University of Minnesota, Minneapolis, Minnesota, U.S.A.
Biometrics. 2018 Mar;74(1):8-17. doi: 10.1111/biom.12736. Epub 2017 Jun 21.
The efficiency of doubly robust estimators of the average causal effect (ACE) of a treatment can be improved by including in the treatment and outcome models only those covariates which are related to both treatment and outcome (i.e., confounders) or related only to the outcome. However, it is often challenging to identify such covariates among the large number that may be measured in a given study. In this article, we propose GLiDeR (Group Lasso and Doubly Robust Estimation), a novel variable selection technique for identifying confounders and predictors of outcome using an adaptive group lasso approach that simultaneously performs coefficient selection, regularization, and estimation across the treatment and outcome models. The selected variables and corresponding coefficient estimates are used in a standard doubly robust ACE estimator. We provide asymptotic results showing that, for a broad class of data generating mechanisms, GLiDeR yields a consistent estimator of the ACE when either the outcome or treatment model is correctly specified. A comprehensive simulation study shows that GLiDeR is more efficient than doubly robust methods using standard variable selection techniques and has substantial computational advantages over a recently proposed doubly robust Bayesian model averaging method. We illustrate our method by estimating the causal treatment effect of bilateral versus single-lung transplant on forced expiratory volume in one year after transplant using an observational registry.
通过仅在治疗模型和结果模型中纳入那些与治疗和结果均相关(即混杂因素)或仅与结果相关的协变量,可以提高治疗平均因果效应(ACE)的双稳健估计量的效率。然而,在给定研究中可能测量的大量协变量中识别此类协变量通常具有挑战性。在本文中,我们提出了GLiDeR(组套索和双稳健估计),这是一种新颖的变量选择技术,用于使用自适应组套索方法识别混杂因素和结果预测因子,该方法同时在治疗模型和结果模型中进行系数选择、正则化和估计。所选变量和相应的系数估计用于标准的双稳健ACE估计量。我们提供了渐近结果,表明对于广泛的数据生成机制,当结果模型或治疗模型正确设定时,GLiDeR会产生ACE的一致估计量。一项全面的模拟研究表明,GLiDeR比使用标准变量选择技术的双稳健方法更有效,并且与最近提出的双稳健贝叶斯模型平均方法相比具有显著的计算优势。我们使用一个观察性登记处的数据,通过估计双侧与单肺移植对移植后一年用力呼气量的因果治疗效应来说明我们的方法。