• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

倾向得分加权分析中使用多重填补的注意事项——附应用示例的教程

Considerations for Using Multiple Imputation in Propensity Score-Weighted Analysis - A Tutorial with Applied Example.

作者信息

Eiset Andreas Halgreen, Frydenberg Morten

机构信息

Department of Affective Disorders, Aarhus University Hospital-Psychiatry, Aarhus, Denmark.

Department of Public Health, Aarhus University, Aarhus, Denmark.

出版信息

Clin Epidemiol. 2022 Jul 7;14:835-847. doi: 10.2147/CLEP.S354733. eCollection 2022.

DOI:10.2147/CLEP.S354733
PMID:35832574
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9272848/
Abstract

PURPOSE

Propensity score-weighting for confounder control and multiple imputation to counter missing data are both widely used methods in epidemiological research. Combination of the two is not trivial and requires a number of decisions to produce valid inference. In this tutorial, we outline the assumptions underlying each of the methods, present our considerations in combining the two, discuss the methodological and practical implications of our choices and briefly point to alternatives. Throughout we apply the theory to a research project about post-traumatic stress disorder in Syrian refugees.

PATIENTS AND METHODS

We detail how we used logistic regression-based propensity scores to produce "standardized mortality ratio"-weights and Substantive Model Compatible-Full Conditional Specification for multiple imputation of missing data to get the estimate of association. Finally, a percentile confidence interval was produced by bootstrapping.

RESULTS

A simple propensity score model with weight truncation at 1st and 99th percentile obtained acceptable balance on all covariates and was chosen as our model. Due to computational issues in the multiple imputation, two levels of one of the substantive model covariates and two levels of one of the auxiliary covariates were collapsed. This slightly modified propensity score model was the substantive model in the SMC-FCS multiple imputation, and regression models were set up for all partially observed covariates. We set the number of imputations to 10 and number of iterations to 40. We produced 999 bootstrap estimates to compute the 95-percentile confidence interval.

CONCLUSION

Combining propensity score-weighting and multiple imputation is not a trivial task. We present considerations necessary to do so, realizing it is demanding in terms of both workload and computational time; however, we do not consider the former a drawback: it makes some of the underlying assumptions explicit and the latter may be a nuisance that will diminish with faster computers and better implementations.

摘要

目的

倾向评分加权用于混杂因素控制以及多重填补法用于处理缺失数据,这两种方法在流行病学研究中都被广泛使用。将这两种方法结合并非易事,需要做出一些决策才能得出有效的推断。在本教程中,我们概述了每种方法的基本假设,介绍了我们在将两者结合时的考虑因素,讨论了我们选择的方法在方法学和实际应用方面的影响,并简要指出了其他替代方法。在整个过程中,我们将理论应用于一个关于叙利亚难民创伤后应激障碍的研究项目。

患者与方法

我们详细说明了如何使用基于逻辑回归的倾向评分来生成“标准化死亡率比”权重,以及如何使用实质性模型兼容 - 完全条件设定法对缺失数据进行多重填补以获得关联估计。最后,通过自抽样法生成百分位数置信区间。

结果

一个在第1和第99百分位数处进行权重截断的简单倾向评分模型在所有协变量上获得了可接受的平衡,并被选为我们的模型。由于多重填补中的计算问题,一个实质性模型协变量的两个水平和一个辅助协变量的两个水平被合并。这个略有修改的倾向评分模型成为SMC - FCS多重填补中的实质性模型,并为所有部分观测的协变量建立了回归模型。我们将填补次数设置为10次,迭代次数设置为40次。我们生成了999个自抽样估计值来计算第95百分位数置信区间。

结论

结合倾向评分加权和多重填补并非易事。我们提出了这样做所需的考虑因素,意识到这在工作量和计算时间方面都要求很高;然而,我们并不认为前者是一个缺点:它使一些潜在假设变得明确,而后者可能是一个麻烦,但随着计算机速度更快和实现更好,这种麻烦将会减少。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96f8/9272848/9759b523e12a/CLEP-14-835-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96f8/9272848/9759b523e12a/CLEP-14-835-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/96f8/9272848/9759b523e12a/CLEP-14-835-g0001.jpg

相似文献

1
Considerations for Using Multiple Imputation in Propensity Score-Weighted Analysis - A Tutorial with Applied Example.倾向得分加权分析中使用多重填补的注意事项——附应用示例的教程
Clin Epidemiol. 2022 Jul 7;14:835-847. doi: 10.2147/CLEP.S354733. eCollection 2022.
2
Propensity score analysis with partially observed covariates: How should multiple imputation be used?倾向评分分析与部分观测协变量:应如何使用多重插补?
Stat Methods Med Res. 2019 Jan;28(1):3-19. doi: 10.1177/0962280217713032. Epub 2017 Jun 2.
3
Dealing with missing information on covariates for excess mortality hazard regression models - Making the imputation model compatible with the substantive model.处理超额死亡率风险回归模型中协变量缺失信息 - 使插补模型与实质模型兼容。
Stat Methods Med Res. 2021 Oct;30(10):2256-2268. doi: 10.1177/09622802211031615. Epub 2021 Sep 2.
4
Multiple imputation of covariates by fully conditional specification: Accommodating the substantive model.通过完全条件设定对协变量进行多重填补:适配实质性模型。
Stat Methods Med Res. 2015 Aug;24(4):462-87. doi: 10.1177/0962280214521348. Epub 2014 Feb 12.
5
Propensity score matching after multiple imputation when a confounder has missing data.当混杂因素存在缺失数据时,采用多次插补后的倾向评分匹配。
Stat Med. 2023 Mar 30;42(7):1082-1095. doi: 10.1002/sim.9658. Epub 2023 Jan 25.
6
Multiple imputation for propensity score analysis with covariates missing at random: some clarity on "within" and "across" methods.对于协变量缺失随机的倾向评分分析的多重插补:“内部”和“外部”方法的一些澄清。
Am J Epidemiol. 2024 Oct 7;193(10):1470-1476. doi: 10.1093/aje/kwae105.
7
A comparison of different methods to handle missing data in the context of propensity score analysis.不同方法在倾向评分分析中处理缺失数据的比较。
Eur J Epidemiol. 2019 Jan;34(1):23-36. doi: 10.1007/s10654-018-0447-z. Epub 2018 Oct 19.
8
Propensity Score Weighting with Missing Data on Covariates and Clustered Data Structure.倾向评分加权法在协变量缺失数据和聚类数据结构下的应用。
Multivariate Behav Res. 2024 May-Jun;59(3):411-433. doi: 10.1080/00273171.2024.2307529. Epub 2024 Feb 20.
9
Avoiding pitfalls when combining multiple imputation and propensity scores.当组合多重插补和倾向评分时避免陷阱。
Stat Med. 2019 Nov 20;38(26):5120-5132. doi: 10.1002/sim.8355. Epub 2019 Sep 11.
10
A stacked approach for chained equations multiple imputation incorporating the substantive model.一种结合实质性模型的链式方程多重插补的堆叠方法。
Biometrics. 2021 Dec;77(4):1342-1354. doi: 10.1111/biom.13372. Epub 2020 Oct 5.

引用本文的文献

1
Imaging-pathology correlation in pancreatic cancer: Methodological considerations and future directions.胰腺癌的影像-病理相关性:方法学考量与未来方向。
World J Gastrointest Oncol. 2025 Jul 15;17(7):103282. doi: 10.4251/wjgo.v17.i7.103282.
2
Hypertension in Syrian refugees: prevalence, awareness, and access to care in Denmark and Lebanon.叙利亚难民中的高血压:丹麦和黎巴嫩的患病率、知晓率及医疗服务可及性
Front Public Health. 2025 Jun 19;13:1486806. doi: 10.3389/fpubh.2025.1486806. eCollection 2025.
3
Cost-effectiveness of the National Diabetes Prevention Program: A Real-world, 2-Year Prospective Study.

本文引用的文献

1
The association between long-distance migration and PTSD prevalence in Syrian refugees.叙利亚难民长途迁移与 PTSD 患病率之间的关联。
BMC Psychiatry. 2022 May 27;22(1):363. doi: 10.1186/s12888-022-03982-4.
2
Bootstrap inference for multiple imputation under uncongeniality and misspecification.在不相容性和错误设定下多重填补的自助法推断
Stat Methods Med Res. 2020 Dec;29(12):3533-3546. doi: 10.1177/0962280220932189. Epub 2020 Jun 30.
3
Understanding and diagnosing the potential for bias when using machine learning methods with doubly robust causal estimators.
国家糖尿病预防计划的成本效益:一项为期两年的真实世界前瞻性研究。
Diabetes Care. 2025 Jul 1;48(7):1180-1188. doi: 10.2337/dc24-1110.
4
Treatments with versus without medication for children with behavioural difficulties in clinical practice: an economic evaluation with observational data.临床实践中针对行为困难儿童的药物治疗与非药物治疗:基于观察性数据的经济学评估
J Child Psychol Psychiatry. 2025 Mar;66(3):289-300. doi: 10.1111/jcpp.14057. Epub 2024 Sep 30.
5
The association between long-distance migration and PTSD prevalence in Syrian refugees.叙利亚难民长途迁移与 PTSD 患病率之间的关联。
BMC Psychiatry. 2022 May 27;22(1):363. doi: 10.1186/s12888-022-03982-4.
理解和诊断在使用具有双重稳健因果估计器的机器学习方法时潜在的偏差。
Stat Methods Med Res. 2019 Jun;28(6):1637-1650. doi: 10.1177/0962280218772065. Epub 2018 May 2.
4
Bootstrap inference when using multiple imputation.当使用多重插补时的引导推断。
Stat Med. 2018 Jun 30;37(14):2252-2266. doi: 10.1002/sim.7654. Epub 2018 Apr 16.
5
Propensity score analysis with partially observed covariates: How should multiple imputation be used?倾向评分分析与部分观测协变量:应如何使用多重插补?
Stat Methods Med Res. 2019 Jan;28(1):3-19. doi: 10.1177/0962280217713032. Epub 2017 Jun 2.
6
Variance estimation when using inverse probability of treatment weighting (IPTW) with survival analysis.在生存分析中使用治疗权重逆概率(IPTW)时的方差估计。
Stat Med. 2016 Dec 30;35(30):5642-5655. doi: 10.1002/sim.7084. Epub 2016 Aug 22.
7
Globally efficient non-parametric inference of average treatment effects by empirical balancing calibration weighting.通过经验平衡校准加权对平均治疗效果进行全局高效非参数推断。
J R Stat Soc Series B Stat Methodol. 2016 Jun;78(3):673-700. doi: 10.1111/rssb.12129. Epub 2015 Nov 8.
8
Moving towards best practice when using inverse probability of treatment weighting (IPTW) using the propensity score to estimate causal treatment effects in observational studies.在观察性研究中,利用倾向得分采用治疗权重的逆概率(IPTW)估计因果治疗效果时,朝着最佳实践迈进。
Stat Med. 2015 Dec 10;34(28):3661-79. doi: 10.1002/sim.6607. Epub 2015 Aug 3.
9
The performance of inverse probability of treatment weighting and full matching on the propensity score in the presence of model misspecification when estimating the effect of treatment on survival outcomes.在估计治疗对生存结局的影响时,存在模型误设情况下治疗权重逆概率法和倾向得分完全匹配法的表现。
Stat Methods Med Res. 2017 Aug;26(4):1654-1670. doi: 10.1177/0962280215584401. Epub 2015 Apr 30.
10
Multiple imputation of covariates by fully conditional specification: Accommodating the substantive model.通过完全条件设定对协变量进行多重填补:适配实质性模型。
Stat Methods Med Res. 2015 Aug;24(4):462-87. doi: 10.1177/0962280214521348. Epub 2014 Feb 12.