使用倾向评分法进行医学索赔比较效果研究的真实因果推断

Veridical Causal Inference using Propensity Score Methods for Comparative Effectiveness Research with Medical Claims.

作者信息

Ross Ryan D, Shi Xu, Caram Megan E V, Tsao Pheobe A, Lin Paul, Bohnert Amy, Zhang Min, Mukherjee Bhramar

机构信息

Department of Biostatistics, School of Public Health, University of Michigan.

Department of Internal Medicine, Division of Hematology/Oncology, University of Michigan Medical School.

出版信息

Health Serv Outcomes Res Methodol. 2021 Jun;21(2):206-228. doi: 10.1007/s10742-020-00222-8. Epub 2020 Oct 20.

DOI:10.1007/s10742-020-00222-8

PMID:34040495

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8142944/

Abstract

Medical insurance claims are becoming increasingly common data sources to answer a variety of questions in biomedical research. Although comprehensive in terms of longitudinal characterization of disease development and progression for a potentially large number of patients, population-based inference using these datasets require thoughtful modifications to sample selection and analytic strategies relative to other types of studies. Along with complex selection bias and missing data issues, claims-based studies are purely observational, which limits effective understanding and characterization of the treatment differences between groups being compared. All these issues contribute to a crisis in reproducibility and replication of comparative findings using medical claims. This paper offers practical guidance to the analytical process, demonstrates methods for estimating causal treatment effects with propensity score methods for several types of outcomes common to such studies, such as binary, count, time to event and longitudinally-varying measures, and also aims to increase transparency and reproducibility of reporting of results from these investigations. We provide an online version of the paper with readily implementable code for the entire analysis pipeline to serve as a guided tutorial for practitioners. The online version can be accessed at https://rydaro.github.io/. The analytic pipeline is illustrated using a sub-cohort of patients with advanced prostate cancer from the large Clinformatics TM Data Mart Database (OptumInsight, Eden Prairie, Minnesota), consisting of 73 million distinct private payer insurees from 2001-2016.

摘要

医疗保险理赔数据正日益成为生物医学研究中回答各种问题的常见数据源。尽管就大量患者疾病发展和进展的纵向特征而言，这些数据集具有全面性，但与其他类型的研究相比，使用这些数据集进行基于人群的推断需要对样本选择和分析策略进行深思熟虑的调整。除了复杂的选择偏倚和数据缺失问题外，基于理赔数据的研究纯粹是观察性的，这限制了对所比较组之间治疗差异的有效理解和特征描述。所有这些问题都导致了使用医疗理赔数据进行比较研究结果的可重复性和再现性危机。本文为分析过程提供了实用指导，展示了使用倾向评分方法估计几种此类研究常见结果类型（如二元结果、计数结果、事件发生时间和纵向变化测量结果）的因果治疗效果的方法，并且旨在提高这些调查结果报告的透明度和可重复性。我们提供了本文的在线版本，其中包含整个分析流程易于实现的代码，作为从业者的指导教程。可通过https://rydaro.github.io/访问在线版本。使用来自大型临床信息学TM数据集市数据库（OptumInsight，明尼苏达州伊甸草原）的晚期前列腺癌患者子队列说明了分析流程，该数据库包含2001年至2016年期间7300万不同的私人支付者被保险人。

相似文献

Veridical Causal Inference using Propensity Score Methods for Comparative Effectiveness Research with Medical Claims.使用倾向评分法进行医学索赔比较效果研究的真实因果推断

Health Serv Outcomes Res Methodol. 2021 Jun;21(2):206-228. doi: 10.1007/s10742-020-00222-8. Epub 2020 Oct 20.

Applied comparison of large-scale propensity score matching and cardinality matching for causal inference in observational research.应用大规模倾向评分匹配和基数匹配在观察性研究中的因果推断的比较。

BMC Med Res Methodol. 2021 May 24;21(1):109. doi: 10.1186/s12874-021-01282-1.

A comparison of parametric propensity score-based methods for causal inference with multiple treatments and a binary outcome.多处理因素和二分类结局下基于参数倾向评分的因果推断方法比较。

Stat Med. 2021 Mar 30;40(7):1653-1677. doi: 10.1002/sim.8862. Epub 2021 Jan 18.

The project data sphere initiative: accelerating cancer research by sharing data.项目数据领域计划：通过数据共享加速癌症研究

Oncologist. 2015 May;20(5):464-e20. doi: 10.1634/theoncologist.2014-0431. Epub 2015 Apr 15.

Balancing Confounding and Generalizability Using Observational, Real-world Data: 17-gene Genomic Prostate Score Assay Effect on Active Surveillance.利用观察性真实世界数据平衡混杂因素与可推广性：17基因基因组前列腺评分检测对主动监测的影响

Rev Urol. 2018;20(2):69-76. doi: 10.3909/riu0799.

Causal Inference Methods for Estimating Long-Term Health Effects of Air Quality Regulations.用于评估空气质量法规长期健康影响的因果推断方法。

Res Rep Health Eff Inst. 2016 May(187):5-49.

Retrospective comparative effectiveness research: Will changing the analytical methods change the results?回顾性比较效果研究：改变分析方法会改变结果吗？

Int J Cancer. 2022 Jun 15;150(12):1933-1940. doi: 10.1002/ijc.33946. Epub 2022 Feb 16.

Translational Metabolomics of Head Injury: Exploring Dysfunctional Cerebral Metabolism with Ex Vivo NMR Spectroscopy-Based Metabolite Quantification头部损伤的转化代谢组学：基于体外核磁共振波谱的代谢物定量分析探索脑代谢功能障碍

The future of Cochrane Neonatal.考克兰新生儿协作网的未来。

Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.

引用本文的文献

Long-term outcomes after unilateral salpingo-oophorectomy: A registry-based retrospective cohort study.单侧输卵管卵巢切除术的长期结局：一项基于登记处的回顾性队列研究。

PLoS Med. 2025 Jul 7;22(7):e1004639. doi: 10.1371/journal.pmed.1004639. eCollection 2025 Jul.

Accurate treatment effect estimation using inverse probability of treatment weighting with deep learning.使用深度学习的治疗权重逆概率进行准确的治疗效果估计。

JAMIA Open. 2025 Apr 26;8(2):ooaf032. doi: 10.1093/jamiaopen/ooaf032. eCollection 2025 Apr.

Lymph node yield does not affect the cancer-specific survival of patients with T1 colorectal cancer: a population-based retrospective study of the U.S. database and a Chinese registry.淋巴结获取数量不影响T1期结直肠癌患者的癌症特异性生存率：一项基于美国数据库和中国登记处的人群回顾性研究。

Int J Colorectal Dis. 2025 Feb 5;40(1):31. doi: 10.1007/s00384-025-04816-x.

The Impact of Routine Vaccinations on Alzheimer's Disease Risk in Persons 65 Years and Older: A Claims-Based Cohort Study using Propensity Score Matching.基于倾向评分匹配的基于理赔的队列研究：常规疫苗接种对 65 岁及以上人群阿尔茨海默病风险的影响。

J Alzheimers Dis. 2023;95(2):703-718. doi: 10.3233/JAD-221231.

Risk of Alzheimer's Disease Following Influenza Vaccination: A Claims-Based Cohort Study Using Propensity Score Matching.流感疫苗接种后患阿尔茨海默病的风险：基于倾向评分匹配的基于索赔的队列研究。

J Alzheimers Dis. 2022;88(3):1061-1074. doi: 10.3233/JAD-220361.

本文引用的文献

Association of Mood and Anxiety Disorders and Opioid Prescription Patterns Among Postpartum Women.产后妇女的情绪和焦虑障碍与阿片类药物处方模式的关联。

Am J Addict. 2020 Nov;29(6):463-470. doi: 10.1111/ajad.13028. Epub 2020 Apr 6.

Veridical data science.真实数据科学。

Proc Natl Acad Sci U S A. 2020 Feb 25;117(8):3920-3929. doi: 10.1073/pnas.1901326117. Epub 2020 Feb 13.

Safety surveillance and the estimation of risk in select populations: Flexible methods to control for confounding while targeting marginal comparisons via standardization.特定人群的安全性监测和风险评估：通过标准化针对边缘比较进行控制混杂的灵活方法。

Stat Med. 2020 Feb 20;39(4):369-386. doi: 10.1002/sim.8410. Epub 2019 Dec 10.

Prescribing Patterns Associated With Biologic Therapies for Psoriasis from a United States Medical Records Database.来自美国医疗记录数据库的银屑病生物疗法相关处方模式。

J Drugs Dermatol. 2019 Aug 1;18(8):745-750.

Patient and Provider Variables Associated with Variation in the Systemic Treatment of Advanced Prostate Cancer.与晚期前列腺癌系统治疗差异相关的患者和医疗服务提供者变量

Urol Pract. 2019 Jul 1;6(4):234-242. doi: 10.1097/UPJ.0000000000000020.

Zostavax vaccine effectiveness among US elderly using real-world evidence: Addressing unmeasured confounders by using multiple imputation after linking beneficiary surveys with Medicare claims.使用真实世界证据评估美国老年人中Zostavax疫苗的有效性：通过将受益人的调查与医疗保险理赔数据相链接后采用多重填补法来处理未测量的混杂因素。

Pharmacoepidemiol Drug Saf. 2019 Jul;28(7):993-1001. doi: 10.1002/pds.4801. Epub 2019 Jun 5.

Adjusted restricted mean survival times in observational studies.观察性研究中的调整受限平均生存时间。

Stat Med. 2019 Sep 10;38(20):3832-3860. doi: 10.1002/sim.8206. Epub 2019 May 22.

Factors Associated With Use of Sipuleucel-T to Treat Patients With Advanced Prostate Cancer.与使用 sipuleucel-T 治疗晚期前列腺癌患者相关的因素。

JAMA Netw Open. 2019 Apr 5;2(4):e192589. doi: 10.1001/jamanetworkopen.2019.2589.

Comparative effectiveness of generic and brand-name medication use: A database study of US health insurance claims.比较仿制药和品牌药使用的效果：一项基于美国健康保险索赔数据库的研究。

PLoS Med. 2019 Mar 13;16(3):e1002763. doi: 10.1371/journal.pmed.1002763. eCollection 2019 Mar.

Principles of confounder selection.混杂因素选择原则。

Eur J Epidemiol. 2019 Mar;34(3):211-219. doi: 10.1007/s10654-019-00494-6. Epub 2019 Mar 6.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验