Suppr超能文献

倾向得分模型的变量选择

Variable selection for propensity score models.

作者信息

Brookhart M Alan, Schneeweiss Sebastian, Rothman Kenneth J, Glynn Robert J, Avorn Jerry, Stürmer Til

机构信息

Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02120, USA.

出版信息

Am J Epidemiol. 2006 Jun 15;163(12):1149-56. doi: 10.1093/aje/kwj149. Epub 2006 Apr 19.

Abstract

Despite the growing popularity of propensity score (PS) methods in epidemiology, relatively little has been written in the epidemiologic literature about the problem of variable selection for PS models. The authors present the results of two simulation studies designed to help epidemiologists gain insight into the variable selection problem in a PS analysis. The simulation studies illustrate how the choice of variables that are included in a PS model can affect the bias, variance, and mean squared error of an estimated exposure effect. The results suggest that variables that are unrelated to the exposure but related to the outcome should always be included in a PS model. The inclusion of these variables will decrease the variance of an estimated exposure effect without increasing bias. In contrast, including variables that are related to the exposure but not to the outcome will increase the variance of the estimated exposure effect without decreasing bias. In very small studies, the inclusion of variables that are strongly related to the exposure but only weakly related to the outcome can be detrimental to an estimate in a mean squared error sense. The addition of these variables removes only a small amount of bias but can increase the variance of the estimated exposure effect. These simulation studies and other analytical results suggest that standard model-building tools designed to create good predictive models of the exposure will not always lead to optimal PS models, particularly in small studies.

摘要

尽管倾向评分(PS)方法在流行病学中越来越受欢迎,但流行病学文献中关于PS模型变量选择问题的论述相对较少。作者展示了两项模拟研究的结果,旨在帮助流行病学家深入了解PS分析中的变量选择问题。模拟研究说明了PS模型中纳入的变量选择如何影响估计暴露效应的偏差、方差和均方误差。结果表明,与暴露无关但与结局相关的变量应始终纳入PS模型。纳入这些变量将降低估计暴露效应的方差,而不会增加偏差。相比之下,纳入与暴露相关但与结局无关的变量会增加估计暴露效应的方差,而不会降低偏差。在非常小的研究中,纳入与暴露强相关但与结局弱相关的变量在均方误差意义上可能对估计不利。添加这些变量只能消除少量偏差,但会增加估计暴露效应的方差。这些模拟研究和其他分析结果表明,旨在创建良好暴露预测模型的标准模型构建工具并不总是能产生最优的PS模型,尤其是在小型研究中。

相似文献

1
Variable selection for propensity score models.
Am J Epidemiol. 2006 Jun 15;163(12):1149-56. doi: 10.1093/aje/kwj149. Epub 2006 Apr 19.
5
Effects of adjusting for instrumental variables on bias and precision of effect estimates.
Am J Epidemiol. 2011 Dec 1;174(11):1213-22. doi: 10.1093/aje/kwr364. Epub 2011 Oct 24.
6
Variable selection for propensity score models when estimating treatment effects on multiple outcomes: a simulation study.
Pharmacoepidemiol Drug Saf. 2013 Jan;22(1):77-85. doi: 10.1002/pds.3356. Epub 2012 Oct 16.
7
The implications of propensity score variable selection strategies in pharmacoepidemiology: an empirical illustration.
Pharmacoepidemiol Drug Saf. 2011 Jun;20(6):551-9. doi: 10.1002/pds.2098. Epub 2011 Mar 10.
8
Measuring balance and model selection in propensity score methods.
Pharmacoepidemiol Drug Saf. 2011 Nov;20(11):1115-29. doi: 10.1002/pds.2188. Epub 2011 Jul 29.
10
Magnitude and direction of missing confounders had different consequences on treatment effect estimation in propensity score analysis.
J Clin Epidemiol. 2017 Jul;87:87-97. doi: 10.1016/j.jclinepi.2017.04.001. Epub 2017 Apr 12.

引用本文的文献

1
Variable selection for doubly robust causal inference.
Stat Interface. 2025;18(1):93-105. doi: 10.4310/sii.241023040813. Epub 2024 Oct 22.
7
The link between child dietary diversity and child anemia: The power of colorful plates.
PLOS Glob Public Health. 2025 Jul 30;5(7):e0005001. doi: 10.1371/journal.pgph.0005001. eCollection 2025.
8
Dynamic versus fixed cerebral perfusion pressure targets in paediatric traumatic brain injury: a STARSHIP analysis.
EClinicalMedicine. 2025 Jul 17;86:103370. doi: 10.1016/j.eclinm.2025.103370. eCollection 2025 Aug.
9
Propensity score analysis revisited.
Ann Clin Epidemiol. 2025 Mar 14;7(3):99-104. doi: 10.37737/ace.25012. eCollection 2025 Jul 1.

本文引用的文献

1
The use of propensity scores in pharmacoepidemiologic research.
Pharmacoepidemiol Drug Saf. 2000 Mar;9(2):93-101. doi: 10.1002/(SICI)1099-1557(200003/04)9:2<93::AID-PDS474>3.0.CO;2-I.
3
On principles for modeling propensity scores in medical research.
Pharmacoepidemiol Drug Saf. 2004 Dec;13(12):855-7. doi: 10.1002/pds.968.
4
Principles for modeling propensity scores in medical research: a systematic literature review.
Pharmacoepidemiol Drug Saf. 2004 Dec;13(12):841-53. doi: 10.1002/pds.969.
5
Marginal structural models and causal inference in epidemiology.
Epidemiology. 2000 Sep;11(5):550-60. doi: 10.1097/00001648-200009000-00011.
6
Invited commentary: propensity scores.
Am J Epidemiol. 1999 Aug 15;150(4):327-33. doi: 10.1093/oxfordjournals.aje.a010011.
7
Propensity score methods for bias reduction in the comparison of a treatment to a non-randomized control group.
Stat Med. 1998 Oct 15;17(19):2265-81. doi: 10.1002/(sici)1097-0258(19981015)17:19<2265::aid-sim918>3.0.co;2-b.
8
Estimating causal effects from large data sets using propensity scores.
Ann Intern Med. 1997 Oct 15;127(8 Pt 2):757-63. doi: 10.7326/0003-4819-127-8_part_2-199710151-00064.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验