不同方法在倾向评分分析中处理缺失数据的比较。

A comparison of different methods to handle missing data in the context of propensity score analysis.

机构信息

Department of Clinical Epidemiology, Leiden University Medical Center, Albinusdreef 2, C7-P, 2333 ZA, Leiden, The Netherlands.

Department of Endocrinology and Metabolism, Leiden University Medical Center, Albinusdreef 2, C7-P, 2333 ZA, Leiden, The Netherlands.

出版信息

Eur J Epidemiol. 2019 Jan;34(1):23-36. doi: 10.1007/s10654-018-0447-z. Epub 2018 Oct 19.

DOI:10.1007/s10654-018-0447-z

PMID:30341708

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6325992/

Abstract

Propensity score analysis is a popular method to control for confounding in observational studies. A challenge in propensity methods is missing values in confounders. Several strategies for handling missing values exist, but guidance in choosing the best method is needed. In this simulation study, we compared four strategies of handling missing covariate values in propensity matching and propensity weighting. These methods include: complete case analysis, missing indicator method, multiple imputation and combining multiple imputation and missing indicator method. Concurrently, we aimed to provide guidance in choosing the optimal strategy. Simulated scenarios varied regarding missing mechanism, presence of effect modification or unmeasured confounding. Additionally, we demonstrated how missingness graphs help clarifying the missing structure. When no effect modification existed, complete case analysis yielded valid causal treatment effects even when data were missing not at random. In some situations, complete case analysis was also able to partially correct for unmeasured confounding. Multiple imputation worked well if the data were missing (completely) at random, and if the imputation model was correctly specified. In the presence of effect modification, more complex imputation models than default options of commonly used statistical software were required. Multiple imputation may fail when data are missing not at random. Here, combining multiple imputation and the missing indicator method reduced the bias as the missing indicator variable can be a proxy for unobserved confounding. The optimal way to handle missing values in covariates of propensity score models depends on the missing data structure and the presence of effect modification. When effect modification is present, default settings of imputation methods may yield biased results even if data are missing at random.

摘要

倾向评分分析是控制观察性研究中混杂因素的一种常用方法。在倾向评分方法中，混杂因素存在缺失值是一个挑战。目前存在几种处理缺失值的策略，但需要指导如何选择最佳方法。在这项模拟研究中，我们比较了在倾向匹配和倾向评分加权中处理缺失协变量值的四种策略。这些方法包括：完全案例分析、缺失指示符方法、多重插补和结合多重插补和缺失指示符方法。同时，我们旨在提供选择最佳策略的指导。模拟场景在缺失机制、存在效应修饰或未测量混杂因素方面存在差异。此外，我们展示了缺失图如何帮助澄清缺失结构。当不存在效应修饰时，即使数据不是随机缺失，完全案例分析也能得出有效的因果治疗效果。在某些情况下，完全案例分析也能够部分纠正未测量的混杂因素。如果数据是随机缺失的，并且插补模型正确指定，多重插补效果良好。当存在效应修饰时，需要比常用统计软件的默认选项更复杂的插补模型。当数据不是随机缺失时，多重插补可能会失败。在这里，结合多重插补和缺失指示符方法可以减少偏差，因为缺失指示变量可以作为未观察到的混杂因素的替代物。在倾向评分模型的协变量中处理缺失值的最佳方法取决于缺失数据结构和效应修饰的存在。当存在效应修饰时，即使数据是随机缺失的，插补方法的默认设置也可能产生有偏的结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/26f1/6325992/df5fd8f618aa/10654_2018_447_Fig1_HTML.jpg

相似文献

A comparison of different methods to handle missing data in the context of propensity score analysis.

Eur J Epidemiol. 2019 Jan;34(1):23-36. doi: 10.1007/s10654-018-0447-z. Epub 2018 Oct 19.

Propensity score analysis with partially observed covariates: How should multiple imputation be used?

Stat Methods Med Res. 2019 Jan;28(1):3-19. doi: 10.1177/0962280217713032. Epub 2017 Jun 2.

Multiple imputation with missing indicators as proxies for unmeasured variables: simulation study.

BMC Med Res Methodol. 2020 Jul 8;20(1):185. doi: 10.1186/s12874-020-01068-x.

Propensity score matching after multiple imputation when a confounder has missing data.

Stat Med. 2023 Mar 30;42(7):1082-1095. doi: 10.1002/sim.9658. Epub 2023 Jan 25.

Comparison of methods for handling covariate missingness in propensity score estimation with a binary exposure.

BMC Med Res Methodol. 2020 Jun 26;20(1):168. doi: 10.1186/s12874-020-01053-4.

Propensity score estimation with missing values using a multiple imputation missingness pattern (MIMP) approach.

Stat Med. 2009 Apr 30;28(9):1402-14. doi: 10.1002/sim.3549.

Multiple imputation for propensity score analysis with covariates missing at random: some clarity on "within" and "across" methods.

Am J Epidemiol. 2024 Oct 7;193(10):1470-1476. doi: 10.1093/aje/kwae105.

Propensity Score Weighting with Missing Data on Covariates and Clustered Data Structure.

Multivariate Behav Res. 2024 May-Jun;59(3):411-433. doi: 10.1080/00273171.2024.2307529. Epub 2024 Feb 20.

Dealing with missing outcome data in randomized trials and observational studies.

Am J Epidemiol. 2012 Feb 1;175(3):210-7. doi: 10.1093/aje/kwr302. Epub 2011 Dec 23.

Outcome-sensitive multiple imputation: a simulation study.

BMC Med Res Methodol. 2017 Jan 9;17(1):2. doi: 10.1186/s12874-016-0281-5.

引用本文的文献

Continuation Versus Discontinuation of Sodium-Glucose Cotransporter-2 Inhibitors and Cardiorenal Outcomes Among Patients With Type 2 Diabetes and Chronic Kidney Disease: A Nationwide Cohort Study With a Target Trial Emulation Framework.

Clin Transl Sci. 2025 Aug;18(8):e70319. doi: 10.1111/cts.70319.

Comparison of the Risk of Pneumonia Between Fluticasone Furoate/Umeclidinium/Vilanterol and Multiple-Inhaler Triple Therapy in Patients with COPD Using Health Insurance Claims Data: Final Analysis of Post-Marketing Database Surveillance in Japan.

J Clin Med. 2025 Jul 2;14(13):4697. doi: 10.3390/jcm14134697.

Incorporation of missing indicator with multiple imputation in propensity score analysis with partially observed covariates: A simulation study.

Stat Methods Med Res. 2025 Jul;34(7):1293-1302. doi: 10.1177/09622802251338365. Epub 2025 Jun 19.

Type 2 diabetes, metabolic health, and the development of frozen shoulder: a cohort study in UK electronic health records.

BMC Musculoskelet Disord. 2025 May 14;26(1):471. doi: 10.1186/s12891-025-08672-2.

A comprehensive analysis of the prognostic value, expression characteristics and immune correlation of MKI67 in cancers.

Front Immunol. 2025 Feb 24;16:1531708. doi: 10.3389/fimmu.2025.1531708. eCollection 2025.

Outcomes for people experiencing homelessness with COVID-19 presenting to emergency departments in Canada, compared with housed patients.

CMAJ. 2025 Mar 10;197(9):E236-E243. doi: 10.1503/cmaj.241282.

The relationship between estimated glucose disposal rate and cognitive function in older individuals.

Sci Rep. 2025 Feb 18;15(1):5874. doi: 10.1038/s41598-025-89623-8.

Prognostic Nutritional Index as a Potential Biomarker for the Risk of Lower Extremity Deep Venous Thrombosis: A Large Retrospective Study.

Clin Appl Thromb Hemost. 2025 Jan-Dec;31:10760296251317520. doi: 10.1177/10760296251317520.

Estimating the effect of pre-exposure prophylaxis in Black men who have sex with men.

Int J Epidemiol. 2024 Dec 16;54(1). doi: 10.1093/ije/dyae170.

Real-World Persistence and Effectiveness of Upadacitinib versus Other Janus Kinase Inhibitors and Tumor Necrosis Factor Inhibitors in Australian Patients with Rheumatoid Arthritis.

Rheumatol Ther. 2025 Feb;12(1):173-202. doi: 10.1007/s40744-024-00736-4. Epub 2025 Jan 6.

本文引用的文献

Handling missing data in propensity score estimation in comparative effectiveness evaluations: a systematic review.

J Comp Eff Res. 2018 Mar;7(3):271-279. doi: 10.2217/cer-2017-0071. Epub 2017 Oct 5.

Propensity score analysis with partially observed covariates: How should multiple imputation be used?

Stat Methods Med Res. 2019 Jan;28(1):3-19. doi: 10.1177/0962280217713032. Epub 2017 Jun 2.

Comments on propensity score matching following multiple imputation.

Stat Methods Med Res. 2016 Dec;25(6):3066-3068. doi: 10.1177/0962280216674296.

Appropriate inclusion of interactions was needed to avoid bias in multiple imputation.

J Clin Epidemiol. 2016 Dec;80:107-115. doi: 10.1016/j.jclinepi.2016.07.004. Epub 2016 Jul 19.

Multiple imputation for IPD meta-analysis: allowing for heterogeneity and studies with missing covariates.

Stat Med. 2016 Jul 30;35(17):2938-54. doi: 10.1002/sim.6837. Epub 2015 Dec 17.

The REporting of studies Conducted using Observational Routinely-collected health Data (RECORD) statement.

PLoS Med. 2015 Oct 6;12(10):e1001885. doi: 10.1371/journal.pmed.1001885. eCollection 2015 Oct.

The performance of different propensity score methods for estimating absolute effects of treatments on survival outcomes: A simulation study.

Stat Methods Med Res. 2016 Oct;25(5):2214-2237. doi: 10.1177/0962280213519716. Epub 2014 Jan 23.

Selecting an appropriate caliper can be essential for achieving good balance with propensity score matching.

Am J Epidemiol. 2014 Jan 15;179(2):226-35. doi: 10.1093/aje/kwt212. Epub 2013 Oct 10.

A comparison of two methods of estimating propensity scores after multiple imputation.

Stat Methods Med Res. 2016 Feb;25(1):188-204. doi: 10.1177/0962280212445945. Epub 2012 Jun 11.

Missing covariate data in clinical research: when and when not to use the missing-indicator method for analysis.

CMAJ. 2012 Aug 7;184(11):1265-9. doi: 10.1503/cmaj.110977. Epub 2012 Feb 27.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

不同方法在倾向评分分析中处理缺失数据的比较。

A comparison of different methods to handle missing data in the context of propensity score analysis.

机构信息

Department of Clinical Epidemiology, Leiden University Medical Center, Albinusdreef 2, C7-P, 2333 ZA, Leiden, The Netherlands.

Department of Endocrinology and Metabolism, Leiden University Medical Center, Albinusdreef 2, C7-P, 2333 ZA, Leiden, The Netherlands.

出版信息

Eur J Epidemiol. 2019 Jan;34(1):23-36. doi: 10.1007/s10654-018-0447-z. Epub 2018 Oct 19.

DOI:10.1007/s10654-018-0447-z

PMID:30341708

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6325992/

Abstract

摘要

不同方法在倾向评分分析中处理缺失数据的比较。

A comparison of different methods to handle missing data in the context of propensity score analysis.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

不同方法在倾向评分分析中处理缺失数据的比较。

A comparison of different methods to handle missing data in the context of propensity score analysis.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献