Suppr超能文献

结果自适应套索:用于因果推断的变量选择

Outcome-adaptive lasso: Variable selection for causal inference.

作者信息

Shortreed Susan M, Ertefaie Ashkan

机构信息

Biostatistics Unit, Group Health Research Institute, Seattle, Washington, U.S.A.

Department of Biostatistics, University of Washington, School of Public Health, Seattle, Washington, U.S.A.

出版信息

Biometrics. 2017 Dec;73(4):1111-1122. doi: 10.1111/biom.12679. Epub 2017 Mar 8.

Abstract

Methodological advancements, including propensity score methods, have resulted in improved unbiased estimation of treatment effects from observational data. Traditionally, a "throw in the kitchen sink" approach has been used to select covariates for inclusion into the propensity score, but recent work shows including unnecessary covariates can impact both the bias and statistical efficiency of propensity score estimators. In particular, the inclusion of covariates that impact exposure but not the outcome, can inflate standard errors without improving bias, while the inclusion of covariates associated with the outcome but unrelated to exposure can improve precision. We propose the outcome-adaptive lasso for selecting appropriate covariates for inclusion in propensity score models to account for confounding bias and maintaining statistical efficiency. This proposed approach can perform variable selection in the presence of a large number of spurious covariates, that is, covariates unrelated to outcome or exposure. We present theoretical and simulation results indicating that the outcome-adaptive lasso selects the propensity score model that includes all true confounders and predictors of outcome, while excluding other covariates. We illustrate covariate selection using the outcome-adaptive lasso, including comparison to alternative approaches, using simulated data and in a survey of patients using opioid therapy to manage chronic pain.

摘要

方法学的进步,包括倾向得分方法,已使从观察性数据中对治疗效果进行无偏估计得到改善。传统上,一直采用“把所有东西都扔进厨房水槽”的方法来选择纳入倾向得分的协变量,但最近的研究表明,纳入不必要的协变量会影响倾向得分估计量的偏差和统计效率。特别是,纳入影响暴露但不影响结局的协变量,会在不改善偏差的情况下使标准误膨胀,而纳入与结局相关但与暴露无关的协变量则可提高精度。我们提出了结局自适应套索法,用于选择纳入倾向得分模型的合适协变量,以解决混杂偏差并保持统计效率。这种提出的方法可以在存在大量虚假协变量(即与结局或暴露无关的协变量)的情况下进行变量选择。我们给出了理论和模拟结果,表明结局自适应套索法选择的倾向得分模型包含所有真正的混杂因素和结局预测因子,同时排除其他协变量。我们使用结局自适应套索法展示协变量选择,包括与其他方法的比较,使用模拟数据以及在一项使用阿片类药物疗法治疗慢性疼痛的患者调查中进行展示。

相似文献

3
Variable selection for causal mediation analysis using LASSO-based methods.基于 LASSO 的方法进行因果中介分析的变量选择。
Stat Methods Med Res. 2021 Jun;30(6):1413-1427. doi: 10.1177/0962280221997505. Epub 2021 Mar 23.

引用本文的文献

1
Variable selection for doubly robust causal inference.双重稳健因果推断的变量选择
Stat Interface. 2025;18(1):93-105. doi: 10.4310/sii.241023040813. Epub 2024 Oct 22.
6
Estimating wage disparities using foundation models.使用基础模型估计工资差距。
Proc Natl Acad Sci U S A. 2025 Jun 3;122(22):e2427298122. doi: 10.1073/pnas.2427298122. Epub 2025 May 30.
10
Robust propensity score estimation via loss function calibration.通过损失函数校准进行稳健的倾向得分估计。
Stat Methods Med Res. 2025 Mar;34(3):457-472. doi: 10.1177/09622802241308709. Epub 2025 Feb 12.

本文引用的文献

6
Estimation and Accuracy after Model Selection.模型选择后的估计与准确性。
J Am Stat Assoc. 2014 Jul 1;109(507):991-1007. doi: 10.1080/01621459.2013.823775.
7
Confounder selection via penalized credible regions.通过惩罚可信区域进行混杂因素选择。
Biometrics. 2014 Dec;70(4):852-61. doi: 10.1111/biom.12203. Epub 2014 Aug 14.
9
Prescription opioid analgesics increase the risk of depression.处方类阿片类镇痛药会增加患抑郁症的风险。
J Gen Intern Med. 2014 Mar;29(3):491-9. doi: 10.1007/s11606-013-2648-1. Epub 2013 Oct 29.
10
Model feedback in Bayesian propensity score estimation.贝叶斯倾向得分估计中的模型反馈。
Biometrics. 2013 Mar;69(1):263-73. doi: 10.1111/j.1541-0420.2012.01830.x. Epub 2013 Feb 4.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验