从易出错的电子健康记录衍生协变量估计倾向得分的偏差减少方法。

Bias Reduction Methods for Propensity Scores Estimated from Error-Prone EHR-Derived Covariates.

作者信息

Harton Joanna, Mamtani Ronac, Mitra Nandita, Hubbard Rebecca A

机构信息

Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania, Philadelphia, PA, USA.

Department of Medicine, University of Pennsylvania, Philadelphia, PA, USA.

出版信息

Health Serv Outcomes Res Methodol. 2021 Jun;21:169-187. doi: 10.1007/s10742-020-00219-3. Epub 2020 Sep 10.

DOI:10.1007/s10742-020-00219-3

PMID:34149306

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8210692/

Abstract

As the use of electronic health records (EHR) to estimate treatment effects has become widespread, concern about bias introduced by error in EHR-derived covariates has also grown. While methods exist to address measurement error in individual covariates, little prior research has investigated the implications of using propensity scores for confounder control when the propensity scores are constructed from a combination of accurate and error-prone covariates. We reviewed approaches to account for error in propensity scores and used simulation studies to compare their performance. These comparisons were conducted across a range of scenarios featuring variation in outcome type, validation sample size, main sample size, strength of confounding, and structure of the error in the mismeasured covariate. We then applied these approaches to a real-world EHR-based comparative effectiveness study of alternative treatments for metastatic bladder cancer. This head-to-head comparison of measurement error correction methods in the context of a propensity score-adjusted analysis demonstrated that multiple imputation for propensity scores performs best when the outcome is continuous and regression calibration-based methods perform best when the outcome is binary.

摘要

随着利用电子健康记录（EHR）来估计治疗效果的做法变得普遍，对EHR衍生协变量中的误差所引入的偏差的担忧也与日俱增。虽然存在处理单个协变量测量误差的方法，但此前很少有研究探讨当倾向得分由准确和易出错的协变量组合构建时，使用倾向得分进行混杂因素控制的影响。我们回顾了处理倾向得分误差的方法，并通过模拟研究比较它们的性能。这些比较是在一系列场景中进行的，这些场景的特征包括结局类型差异、验证样本量、主要样本量、混杂强度以及测量错误协变量中的误差结构。然后，我们将这些方法应用于一项基于EHR的转移性膀胱癌替代治疗的真实世界比较有效性研究。在倾向得分调整分析的背景下对测量误差校正方法进行的这种直接比较表明，当结局为连续型时，倾向得分多重插补表现最佳，而当结局为二分类时，基于回归校准的方法表现最佳。

相似文献

Bias Reduction Methods for Propensity Scores Estimated from Error-Prone EHR-Derived Covariates.从易出错的电子健康记录衍生协变量估计倾向得分的偏差减少方法。

Health Serv Outcomes Res Methodol. 2021 Jun;21:169-187. doi: 10.1007/s10742-020-00219-3. Epub 2020 Sep 10.

An imputation-based solution to using mismeasured covariates in propensity score analysis.一种在倾向得分分析中使用测量错误协变量的基于插补的解决方案。

Stat Methods Med Res. 2017 Aug;26(4):1824-1837. doi: 10.1177/0962280215588771. Epub 2015 Jun 2.

Propensity Score-Based Estimators With Multiple Error-Prone Covariates.基于倾向得分的多易错协变量估计量。

Am J Epidemiol. 2019 Jan 1;188(1):222-230. doi: 10.1093/aje/kwy210.

Inverse Probability of Treatment Weighting and Confounder Missingness in Electronic Health Record-based Analyses: A Comparison of Approaches Using Plasmode Simulation.基于电子病历的分析中治疗反概率加权和混杂因素缺失：使用 Plasmode 模拟比较方法。

Epidemiology. 2023 Jul 1;34(4):520-530. doi: 10.1097/EDE.0000000000001618. Epub 2023 Apr 26.

Propensity score analysis with partially observed covariates: How should multiple imputation be used?倾向评分分析与部分观测协变量：应如何使用多重插补？

Stat Methods Med Res. 2019 Jan;28(1):3-19. doi: 10.1177/0962280217713032. Epub 2017 Jun 2.

Using Sensitivity Analyses for Unobserved Confounding to Address Covariate Measurement Error in Propensity Score Methods.利用敏感性分析解决未观察到的混杂因素对倾向评分法中协变量测量误差的影响。

Am J Epidemiol. 2018 Mar 1;187(3):604-613. doi: 10.1093/aje/kwx248.

The impact of moderator by confounder interactions in the assessment of treatment effect modification: a simulation study.调节变量与混杂因素交互作用对治疗效果修饰评估的影响：一项模拟研究。

BMC Med Res Methodol. 2022 Apr 3;22(1):88. doi: 10.1186/s12874-022-01519-7.

Comparison of the ability of double-robust estimators to correct bias in propensity score matching analysis. A Monte Carlo simulation study.双重稳健估计在倾向评分匹配分析中校正偏差的能力比较。一项蒙特卡罗模拟研究。

Pharmacoepidemiol Drug Saf. 2017 Dec;26(12):1513-1519. doi: 10.1002/pds.4325. Epub 2017 Oct 6.

Applied comparison of large-scale propensity score matching and cardinality matching for causal inference in observational research.应用大规模倾向评分匹配和基数匹配在观察性研究中的因果推断的比较。

BMC Med Res Methodol. 2021 May 24;21(1):109. doi: 10.1186/s12874-021-01282-1.

The impact of covariate measurement error on risk prediction.协变量测量误差对风险预测的影响。

Stat Med. 2015 Jul 10;34(15):2353-67. doi: 10.1002/sim.6498. Epub 2015 Apr 10.

引用本文的文献

A future of data-rich pharmacoepidemiology studies: transitioning to large-scale linked electronic health record + claims data.数据丰富的药物流行病学研究的未来：向大规模关联电子健康记录+索赔数据的转变。

Am J Epidemiol. 2025 Feb 5;194(2):315-321. doi: 10.1093/aje/kwae226.

Association Between Partial Pressure of Arterial Carbon Dioxide and Survival to Hospital Discharge Among Patients Diagnosed With Sepsis in the Emergency Department.在急诊科诊断为脓毒症的患者中，动脉血二氧化碳分压与住院存活率的关系。

Crit Care Med. 2018 Mar;46(3):e213-e220. doi: 10.1097/CCM.0000000000002918.

本文引用的文献

Propensity Score-Based Estimators With Multiple Error-Prone Covariates.基于倾向得分的多易错协变量估计量。

Am J Epidemiol. 2019 Jan 1;188(1):222-230. doi: 10.1093/aje/kwy210.

Association of Broad-Based Genomic Sequencing With Survival Among Patients With Advanced Non-Small Cell Lung Cancer in the Community Oncology Setting.社区肿瘤学环境中广泛基于基因组测序与晚期非小细胞肺癌患者生存的关联。

JAMA. 2018 Aug 7;320(5):469-477. doi: 10.1001/jama.2018.9824.

Development and Validation of a High-Quality Composite Real-World Mortality Endpoint.高质量综合真实世界死亡率终点的开发与验证

Health Serv Res. 2018 Dec;53(6):4460-4476. doi: 10.1111/1475-6773.12872. Epub 2018 May 14.

Out-of-system Care and Recording of Patient Characteristics Critical for Comparative Effectiveness Research.系统外医疗护理和患者特征记录对比较疗效研究至关重要。

Epidemiology. 2018 May;29(3):356-363. doi: 10.1097/EDE.0000000000000794.

Harnessing the Power of Real-World Evidence (RWE): A Checklist to Ensure Regulatory-Grade Data Quality.利用真实世界证据（RWE）的力量：确保监管级数据质量的检查表。

Clin Pharmacol Ther. 2018 Feb;103(2):202-205. doi: 10.1002/cpt.946. Epub 2017 Dec 6.

Identifying Patients With High Data Completeness to Improve Validity of Comparative Effectiveness Research in Electronic Health Records Data.确定数据完整性高的患者，以提高电子健康记录数据中比较有效性研究的有效性。

Clin Pharmacol Ther. 2018 May;103(5):899-905. doi: 10.1002/cpt.861. Epub 2017 Oct 10.

Use of Electronic Health Record Data for Quality Reporting.使用电子健康记录数据进行质量报告。

J Oncol Pract. 2017 Aug;13(8):530-534. doi: 10.1200/JOP.2017.024224. Epub 2017 Jul 26.

Opportunities and challenges in leveraging electronic health record data in oncology.利用电子健康记录数据在肿瘤学领域面临的机遇与挑战。

Future Oncol. 2016 May;12(10):1261-74. doi: 10.2217/fon-2015-0043. Epub 2016 Mar 8.

An imputation-based solution to using mismeasured covariates in propensity score analysis.一种在倾向得分分析中使用测量错误协变量的基于插补的解决方案。

Stat Methods Med Res. 2017 Aug;26(4):1824-1837. doi: 10.1177/0962280215588771. Epub 2015 Jun 2.

Adjustment for missing confounders in studies based on observational databases: 2-stage calibration combining propensity scores from primary and validation data.基于观察性数据库的缺失混杂因素调整：结合主要数据和验证数据的倾向评分的两阶段校准。

Am J Epidemiol. 2014 Aug 1;180(3):308-17. doi: 10.1093/aje/kwu130. Epub 2014 Jun 24.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验