Suppr超能文献

使用连续尖峰和平板先验的高维混杂调整

High-Dimensional Confounding Adjustment Using Continuous Spike and Slab Priors.

作者信息

Antonelli Joseph, Parmigiani Giovanni, Dominici Francesca

机构信息

Department of Statistics, University of Florida, 102 Griffin-Floyd Hall, P.O. Box 118545, Gainesville, Fl, 32611, USA.

Department of Biostatistics and Computational Biology, CLS 11007, Dana-Farber Cancer Institute, 450 Brookline Ave, Boston, MA, 02215, USA.

出版信息

Bayesian Anal. 2019 Sep;14(3):805-828. doi: 10.1214/18-ba1131. Epub 2019 Jun 11.

Abstract

In observational studies, estimation of a causal effect of a treatment on an outcome relies on proper adjustment for confounding. If the number of the potential confounders () is larger than the number of observations (), then direct control for all potential confounders is infeasible. Existing approaches for dimension reduction and penalization are generally aimed at predicting the outcome, and are less suited for estimation of causal effects. Under standard penalization approaches (e.g. Lasso), if a variable is strongly associated with the treatment but weakly with the outcome , the coefficient will be shrunk towards zero thus leading to confounding bias. Under the assumption of a linear model for the outcome and sparsity, we propose continuous spike and slab priors on the regression coefficients corresponding to the potential confounders . Specifically, we introduce a prior distribution that does not heavily shrink to zero the coefficients ( s) of the s that are strongly associated with but weakly associated with . We compare our proposed approach to several state of the art methods proposed in the literature. Our proposed approach has the following features: 1) it reduces confounding bias in high dimensional settings; 2) it shrinks towards zero coefficients of instrumental variables; and 3) it achieves good coverages even in small sample sizes. We apply our approach to the National Health and Nutrition Examination Survey (NHANES) data to estimate the causal effects of persistent pesticide exposure on triglyceride levels.

摘要

在观察性研究中,对治疗对结果的因果效应进行估计依赖于对混杂因素的适当调整。如果潜在混杂因素的数量()大于观察值的数量(),那么直接控制所有潜在混杂因素是不可行的。现有的降维和惩罚方法通常旨在预测结果,不太适合估计因果效应。在标准惩罚方法(如套索回归)下,如果一个变量与治疗强烈相关但与结果弱相关,系数将向零收缩,从而导致混杂偏差。在结果的线性模型和稀疏性假设下,我们针对与潜在混杂因素对应的回归系数提出连续尖峰和平板先验。具体来说,我们引入一种先验分布,该分布不会将与强烈相关但与弱相关的的系数()严重收缩至零。我们将我们提出的方法与文献中提出的几种先进方法进行比较。我们提出的方法具有以下特点:1)它在高维设置中减少混杂偏差;2)它将工具变量的系数向零收缩;3)即使在小样本量下也能实现良好的覆盖率。我们将我们的方法应用于国家健康和营养检查调查(NHANES)数据,以估计持续接触农药对甘油三酯水平的因果效应。

相似文献

10
Bayesian semiparametric multiple shrinkage.贝叶斯半参数多重收缩法
Biometrics. 2010 Jun;66(2):455-62. doi: 10.1111/j.1541-0420.2009.01275.x. Epub 2009 Jun 8.

引用本文的文献

6
Data Science in Environmental Health Research.环境健康研究中的数据科学
Curr Epidemiol Rep. 2019 Sep;6(3):291-299. doi: 10.1007/s40471-019-00205-5. Epub 2019 Jul 15.

本文引用的文献

2
Bayesian factorizations of big sparse tensors.大稀疏张量的贝叶斯因式分解
J Am Stat Assoc. 2015;110(512):1562-1576. doi: 10.1080/01621459.2014.983233. Epub 2016 Jan 15.
3
Post-Selection Inference for -Penalized Likelihood Models.用于惩罚似然模型的选择后推断
Can J Stat. 2018 Mar;46(1):41-61. doi: 10.1002/cjs.11313. Epub 2017 Mar 6.
7
Model averaged double robust estimation.模型平均双稳健估计
Biometrics. 2017 Jun;73(2):410-421. doi: 10.1111/biom.12622. Epub 2016 Nov 28.
9
Dirichlet-Laplace priors for optimal shrinkage.用于最优收缩的狄利克雷-拉普拉斯先验
J Am Stat Assoc. 2015 Dec 1;110(512):1479-1490. doi: 10.1080/01621459.2014.960967. Epub 2014 Sep 25.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验