Suppr超能文献

比较处理二分类暴露因素倾向性评分估计中协变量缺失的方法。

Comparison of methods for handling covariate missingness in propensity score estimation with a binary exposure.

机构信息

Temple University, 1301 Cecil B. Moore Ave. Ritter Annex, 9th floor, Philadelphia, PA, 19122, USA.

GlaxoSmithKline, Philadelphia, USA.

出版信息

BMC Med Res Methodol. 2020 Jun 26;20(1):168. doi: 10.1186/s12874-020-01053-4.

Abstract

BACKGROUND

Causal effect estimation with observational data is subject to bias due to confounding, which is often controlled for using propensity scores. One unresolved issue in propensity score estimation is how to handle missing values in covariates.

METHOD

Several approaches have been proposed for handling covariate missingness, including multiple imputation (MI), multiple imputation with missingness pattern (MIMP), and treatment mean imputation. However, there are other potentially useful approaches that have not been evaluated, including single imputation (SI) + prediction error (PE), SI + PE + parameter uncertainty (PU), and Generalized Boosted Modeling (GBM), which is a nonparametric approach for estimating propensity scores in which missing values are automatically handled in the estimation using a surrogate split method. To evaluate the performance of these approaches, a simulation study was conducted.

RESULTS

Results suggested that SI + PE, SI + PE + PU, MI, and MIMP perform almost equally well and better than treatment mean imputation and GBM in terms of bias; however, MI and MIMP account for the additional uncertainty of imputing the missingness.

CONCLUSIONS

Applying GBM to the incomplete data and relying on the surrogate split approach resulted in substantial bias. Imputation prior to implementing GBM is recommended.

摘要

背景

由于混杂因素的影响,观察性数据的因果效应估计会存在偏差,而倾向评分通常可用于控制混杂因素。在倾向评分估计中,一个未解决的问题是如何处理协变量中的缺失值。

方法

已经提出了几种处理协变量缺失值的方法,包括多重插补(MI)、带有缺失模式的多重插补(MIMP)和处理均值插补。然而,还有其他一些可能有用的方法尚未得到评估,包括单一插补(SI)+预测误差(PE)、SI+PE+参数不确定性(PU)和广义提升模型(GBM),这是一种非参数方法,用于估计倾向评分,其中缺失值在估计中使用替代分裂方法自动处理。为了评估这些方法的性能,进行了一项模拟研究。

结果

结果表明,在偏差方面,SI+PE、SI+PE+PU、MI 和 MIMP 的表现几乎相同,并且优于处理均值插补和 GBM;然而,MI 和 MIMP 考虑了插补缺失值的额外不确定性。

结论

将 GBM 应用于不完整数据并依赖于替代分裂方法会导致严重的偏差。建议在实施 GBM 之前进行插补。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7b70/7318364/011b2ae1f5c5/12874_2020_1053_Fig1_HTML.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验