比较处理二分类暴露因素倾向性评分估计中协变量缺失的方法。

Comparison of methods for handling covariate missingness in propensity score estimation with a binary exposure.

机构信息

Temple University, 1301 Cecil B. Moore Ave. Ritter Annex, 9th floor, Philadelphia, PA, 19122, USA.

GlaxoSmithKline, Philadelphia, USA.

出版信息

BMC Med Res Methodol. 2020 Jun 26;20(1):168. doi: 10.1186/s12874-020-01053-4.

DOI:10.1186/s12874-020-01053-4

PMID:32586271

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7318364/

Abstract

BACKGROUND

Causal effect estimation with observational data is subject to bias due to confounding, which is often controlled for using propensity scores. One unresolved issue in propensity score estimation is how to handle missing values in covariates.

METHOD

Several approaches have been proposed for handling covariate missingness, including multiple imputation (MI), multiple imputation with missingness pattern (MIMP), and treatment mean imputation. However, there are other potentially useful approaches that have not been evaluated, including single imputation (SI) + prediction error (PE), SI + PE + parameter uncertainty (PU), and Generalized Boosted Modeling (GBM), which is a nonparametric approach for estimating propensity scores in which missing values are automatically handled in the estimation using a surrogate split method. To evaluate the performance of these approaches, a simulation study was conducted.

RESULTS

Results suggested that SI + PE, SI + PE + PU, MI, and MIMP perform almost equally well and better than treatment mean imputation and GBM in terms of bias; however, MI and MIMP account for the additional uncertainty of imputing the missingness.

CONCLUSIONS

Applying GBM to the incomplete data and relying on the surrogate split approach resulted in substantial bias. Imputation prior to implementing GBM is recommended.

摘要

背景

由于混杂因素的影响，观察性数据的因果效应估计会存在偏差，而倾向评分通常可用于控制混杂因素。在倾向评分估计中，一个未解决的问题是如何处理协变量中的缺失值。

方法

已经提出了几种处理协变量缺失值的方法，包括多重插补（MI）、带有缺失模式的多重插补（MIMP）和处理均值插补。然而，还有其他一些可能有用的方法尚未得到评估，包括单一插补（SI）+预测误差（PE）、SI+PE+参数不确定性（PU）和广义提升模型（GBM），这是一种非参数方法，用于估计倾向评分，其中缺失值在估计中使用替代分裂方法自动处理。为了评估这些方法的性能，进行了一项模拟研究。

结果

结果表明，在偏差方面，SI+PE、SI+PE+PU、MI 和 MIMP 的表现几乎相同，并且优于处理均值插补和 GBM；然而，MI 和 MIMP 考虑了插补缺失值的额外不确定性。

结论

将 GBM 应用于不完整数据并依赖于替代分裂方法会导致严重的偏差。建议在实施 GBM 之前进行插补。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b70/7318364/011b2ae1f5c5/12874_2020_1053_Fig1_HTML.jpg

相似文献

Comparison of methods for handling covariate missingness in propensity score estimation with a binary exposure.比较处理二分类暴露因素倾向性评分估计中协变量缺失的方法。

BMC Med Res Methodol. 2020 Jun 26;20(1):168. doi: 10.1186/s12874-020-01053-4.

Propensity score estimation with missing values using a multiple imputation missingness pattern (MIMP) approach.使用多重填补缺失模式（MIMP）方法对缺失值进行倾向得分估计。

Stat Med. 2009 Apr 30;28(9):1402-14. doi: 10.1002/sim.3549.

Handling missing data when estimating causal effects with targeted maximum likelihood estimation. 采用有向极大似然估计法估计因果效应时处理缺失数据。

Am J Epidemiol. 2024 Jul 8;193(7):1019-1030. doi: 10.1093/aje/kwae012.

Propensity score analysis with partially observed covariates: How should multiple imputation be used?倾向评分分析与部分观测协变量：应如何使用多重插补？

Stat Methods Med Res. 2019 Jan;28(1):3-19. doi: 10.1177/0962280217713032. Epub 2017 Jun 2.

A comparison of different methods to handle missing data in the context of propensity score analysis.不同方法在倾向评分分析中处理缺失数据的比较。

Eur J Epidemiol. 2019 Jan;34(1):23-36. doi: 10.1007/s10654-018-0447-z. Epub 2018 Oct 19.

Multiple imputation for propensity score analysis with covariates missing at random: some clarity on "within" and "across" methods.对于协变量缺失随机的倾向评分分析的多重插补：“内部”和“外部”方法的一些澄清。

Am J Epidemiol. 2024 Oct 7;193(10):1470-1476. doi: 10.1093/aje/kwae105.

Bayesian causal inference for observational studies with missingness in covariates and outcomes.贝叶斯因果推断在协变量和结局缺失的观察性研究中的应用。

Biometrics. 2023 Dec;79(4):3624-3636. doi: 10.1111/biom.13918. Epub 2023 Aug 8.

Comparison of techniques for handling missing covariate data within prognostic modelling studies: a simulation study.预后建模研究中缺失协变量数据处理技术的比较：一项模拟研究。

BMC Med Res Methodol. 2010 Jan 19;10:7. doi: 10.1186/1471-2288-10-7.

Evaluation of multiple imputation approaches for handling missing covariate information in a case-cohort study with a binary outcome.评价在二分类结局病例-对照研究中采用多种插补方法处理协变量缺失信息的效果。

BMC Med Res Methodol. 2022 Apr 3;22(1):87. doi: 10.1186/s12874-021-01495-4.

Outcome-sensitive multiple imputation: a simulation study.结果敏感多重填补：一项模拟研究。

BMC Med Res Methodol. 2017 Jan 9;17(1):2. doi: 10.1186/s12874-016-0281-5.

引用本文的文献

Assessment of the effectiveness of weight-adjusted antibiotic administration, for reduced duration, in surgical prophylaxis of primary hip and knee arthroplasty.评估按体重调整抗生素给药在初次髋关节和膝关节置换术手术预防中缩短使用时长的有效性。

World J Orthop. 2024 Feb 18;15(2):170-179. doi: 10.5312/wjo.v15.i2.170.

Predicting the Climate Impact of Healthcare Facilities Using Gradient Boosting Machines.使用梯度提升机预测医疗保健设施对气候的影响。

Clean Environ Syst. 2024 Mar;12. doi: 10.1016/j.cesys.2023.100155. Epub 2023 Nov 26.

How to use the Surveillance, Epidemiology, and End Results (SEER) data: research design and methodology.如何使用监测、流行病学和最终结果（SEER）数据：研究设计和方法。

Mil Med Res. 2023 Oct 31;10(1):50. doi: 10.1186/s40779-023-00488-2.

Initiation of continuous renal replacement therapy versus intermittent hemodialysis in critically ill patients with severe acute kidney injury: a secondary analysis of STARRT-AKI trial.起始连续性肾脏替代治疗与间歇性血液透析在重症急性肾损伤患者中的比较：STARRT-AKI 试验的二次分析。

Intensive Care Med. 2023 Nov;49(11):1305-1316. doi: 10.1007/s00134-023-07211-8. Epub 2023 Oct 10.

Handling Missing Data in Health Economics and Outcomes Research (HEOR): A Systematic Review and Practical Recommendations.处理健康经济学和结果研究（HEOR）中的缺失数据：系统评价和实用建议。

Pharmacoeconomics. 2023 Dec;41(12):1589-1601. doi: 10.1007/s40273-023-01297-0. Epub 2023 Jul 25.

Tutorial on causal mediation analysis with binary variables: An application to health psychology research.关于二分类变量因果中介分析的教程：在健康心理学研究中的应用。

Health Psychol. 2023 Nov;42(11):778-787. doi: 10.1037/hea0001299. Epub 2023 Jul 6.

Intersectionality-Informed Sex/Gender-Sensitivity in Public Health Monitoring and Reporting (PHMR): A Case Study Assessing Stratification on an "Intersectional Gender-Score".基于交叉性视角的公共卫生监测和报告中的性/性别敏感性（PHMR）：一项评估“交叉性别评分”分层的案例研究

Int J Environ Res Public Health. 2023 Jan 26;20(3):2220. doi: 10.3390/ijerph20032220.

The Role of Pre-Treatment Traumatic Stress Symptoms in Adolescent Substance Use Treatment Outcomes.治疗前创伤后应激症状在青少年物质使用治疗结果中的作用。

Subst Use Misuse. 2023;58(4):551-559. doi: 10.1080/10826084.2023.2177960. Epub 2023 Feb 10.

Investigation of racial differences in survival from non-small cell lung cancer with immunotherapy use: A Texas study.使用免疫疗法的非小细胞肺癌患者生存的种族差异调查：一项德克萨斯州的研究。

Front Oncol. 2023 Jan 9;12:1092355. doi: 10.3389/fonc.2022.1092355. eCollection 2022.

Impact of Integrating Machine Learning in Comparative Effectiveness Research of Oral Anticoagulants in Patients with Atrial Fibrillation.机器学习在心房颤动患者口服抗凝药物比较有效性研究中的影响。

Int J Environ Res Public Health. 2022 Oct 9;19(19):12916. doi: 10.3390/ijerph191912916.

本文引用的文献

A fair comparison of tree-based and parametric methods in multiple imputation by chained equations.基于树的方法和参数方法在链式方程多重插补中的公平比较。

Stat Med. 2020 Apr 15;39(8):1156-1166. doi: 10.1002/sim.8468. Epub 2020 Jan 29.

Propensity score analysis with partially observed covariates: How should multiple imputation be used?倾向评分分析与部分观测协变量：应如何使用多重插补？

Stat Methods Med Res. 2019 Jan;28(1):3-19. doi: 10.1177/0962280217713032. Epub 2017 Jun 2.

Propensity score analysis with missing data.倾向评分分析与缺失数据。

Psychol Methods. 2016 Sep;21(3):427-45. doi: 10.1037/met0000076. Epub 2016 Mar 10.

Multiple imputation of covariates by fully conditional specification: Accommodating the substantive model.通过完全条件设定对协变量进行多重填补：适配实质性模型。

Stat Methods Med Res. 2015 Aug;24(4):462-87. doi: 10.1177/0962280214521348. Epub 2014 Feb 12.

A comparison of two methods of estimating propensity scores after multiple imputation.多次插补后两种倾向得分估计方法的比较。

Stat Methods Med Res. 2016 Feb;25(1):188-204. doi: 10.1177/0962280212445945. Epub 2012 Jun 11.

Weight trimming and propensity score weighting.体重修剪和倾向评分加权。

PLoS One. 2011 Mar 31;6(3):e18174. doi: 10.1371/journal.pone.0018174.

Estimating propensity scores with missing covariate data using general location mixture models.利用广义位置混合模型估计缺失协变量数据的倾向评分。

Stat Med. 2011 Mar 15;30(6):627-41. doi: 10.1002/sim.4124. Epub 2010 Dec 28.

Improving propensity score weighting using machine learning.使用机器学习改进倾向评分加权。

Stat Med. 2010 Feb 10;29(3):337-46. doi: 10.1002/sim.3782.

Comparison of several imputation methods for missing baseline data in propensity scores analysis of binary outcome.二元结局倾向得分分析中缺失基线数据的几种插补方法比较

Pharm Stat. 2010 Oct-Dec;9(4):269-79. doi: 10.1002/pst.389.

Propensity score estimation with missing values using a multiple imputation missingness pattern (MIMP) approach.使用多重填补缺失模式（MIMP）方法对缺失值进行倾向得分估计。

Stat Med. 2009 Apr 30;28(9):1402-14. doi: 10.1002/sim.3549.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

比较处理二分类暴露因素倾向性评分估计中协变量缺失的方法。

Comparison of methods for handling covariate missingness in propensity score estimation with a binary exposure.

机构信息

出版信息

BACKGROUND

METHOD

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献