• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用医疗保健数据库中的观察数据比较估计异质治疗效果的方法。

Comparing methods for estimation of heterogeneous treatment effects using observational data from health care databases.

机构信息

Centre for Health Informatics, Australian Institute of Health Innovation, Macquarie University, Sydney, Australia.

Stanford Center for Biomedical Informatics Research, Stanford University, Stanford, USA.

出版信息

Stat Med. 2018 Oct 15;37(23):3309-3324. doi: 10.1002/sim.7820. Epub 2018 Jun 3.

DOI:10.1002/sim.7820
PMID:29862536
Abstract

There is growing interest in using routinely collected data from health care databases to study the safety and effectiveness of therapies in "real-world" conditions, as it can provide complementary evidence to that of randomized controlled trials. Causal inference from health care databases is challenging because the data are typically noisy, high dimensional, and most importantly, observational. It requires methods that can estimate heterogeneous treatment effects while controlling for confounding in high dimensions. Bayesian additive regression trees, causal forests, causal boosting, and causal multivariate adaptive regression splines are off-the-shelf methods that have shown good performance for estimation of heterogeneous treatment effects in observational studies of continuous outcomes. However, it is not clear how these methods would perform in health care database studies where outcomes are often binary and rare and data structures are complex. In this study, we evaluate these methods in simulation studies that recapitulate key characteristics of comparative effectiveness studies. We focus on the conditional average effect of a binary treatment on a binary outcome using the conditional risk difference as an estimand. To emulate health care database studies, we propose a simulation design where real covariate and treatment assignment data are used and only outcomes are simulated based on nonparametric models of the real outcomes. We apply this design to 4 published observational studies that used records from 2 major health care databases in the United States. Our results suggest that Bayesian additive regression trees and causal boosting consistently provide low bias in conditional risk difference estimates in the context of health care database studies.

摘要

人们越来越感兴趣的是利用医疗保健数据库中常规收集的数据,在“真实世界”条件下研究治疗方法的安全性和有效性,因为它可以为随机对照试验的证据提供补充。从医疗保健数据库中进行因果推断具有挑战性,因为这些数据通常是嘈杂的、高维的,最重要的是,是观察性的。这需要能够在高维环境中控制混杂因素的同时估计异质治疗效果的方法。贝叶斯加法回归树、因果森林、因果提升和因果多元自适应回归样条是现成的方法,它们在连续结果的观察性研究中对异质治疗效果的估计表现出良好的性能。然而,尚不清楚这些方法在医疗保健数据库研究中的表现如何,因为这些研究中的结果通常是二分类的、罕见的,并且数据结构复杂。在这项研究中,我们在模拟研究中评估了这些方法,这些模拟研究再现了比较疗效研究的关键特征。我们关注的是二分类治疗对二分类结果的条件平均效应,使用条件风险差作为估计量。为了模拟医疗保健数据库研究,我们提出了一种模拟设计,其中使用真实的协变量和治疗分配数据,仅根据真实结果的非参数模型模拟结果。我们将此设计应用于 4 项已发表的观察性研究,这些研究使用了来自美国 2 个主要医疗保健数据库的记录。我们的结果表明,在医疗保健数据库研究的背景下,贝叶斯加法回归树和因果提升始终能提供条件风险差估计的低偏差。

相似文献

1
Comparing methods for estimation of heterogeneous treatment effects using observational data from health care databases.利用医疗保健数据库中的观察数据比较估计异质治疗效果的方法。
Stat Med. 2018 Oct 15;37(23):3309-3324. doi: 10.1002/sim.7820. Epub 2018 Jun 3.
2
Some methods for heterogeneous treatment effect estimation in high dimensions.一些在高维中进行异质处理效应估计的方法。
Stat Med. 2018 May 20;37(11):1767-1787. doi: 10.1002/sim.7623. Epub 2018 Mar 6.
3
Targeted Maximum Likelihood Estimation for Causal Inference in Observational Studies.观察性研究中因果推断的靶向最大似然估计
Am J Epidemiol. 2017 Jan 1;185(1):65-73. doi: 10.1093/aje/kww165. Epub 2016 Dec 9.
4
High-dimensional propensity score algorithm in comparative effectiveness research with time-varying interventions.高维倾向评分算法在具有时变干预措施的比较效果研究中的应用
Stat Med. 2015 Feb 28;34(5):753-81. doi: 10.1002/sim.6377. Epub 2014 Dec 8.
5
A Bayesian nonparametric approach to causal inference on quantiles.一种用于分位数因果推断的贝叶斯非参数方法。
Biometrics. 2018 Sep;74(3):986-996. doi: 10.1111/biom.12863. Epub 2018 Feb 25.
6
Estimating heterogeneous survival treatment effect in observational data using machine learning.利用机器学习估计观察性数据中异质生存治疗效果。
Stat Med. 2021 Sep 20;40(21):4691-4713. doi: 10.1002/sim.9090. Epub 2021 Jun 10.
7
Comparing the performance of propensity score methods in healthcare database studies with rare outcomes.比较倾向评分方法在具有罕见结局的医疗保健数据库研究中的性能。
Stat Med. 2017 May 30;36(12):1946-1963. doi: 10.1002/sim.7250. Epub 2017 Feb 16.
8
A comparison of Bayesian and Monte Carlo sensitivity analysis for unmeasured confounding.贝叶斯分析与蒙特卡洛分析在未测量混杂因素敏感性分析中的比较
Stat Med. 2017 Aug 15;36(18):2887-2901. doi: 10.1002/sim.7298. Epub 2017 Apr 6.
9
Simultaneous record linkage and causal inference with propensity score subclassification.同时进行倾向评分亚组分类的记录链接和因果推断。
Stat Med. 2018 Oct 30;37(24):3533-3546. doi: 10.1002/sim.7911. Epub 2018 Aug 1.
10
Estimation of causal effects of multiple treatments in observational studies with a binary outcome.二元结局观察性研究中多种治疗因果效应的估计。
Stat Methods Med Res. 2020 Nov;29(11):3218-3234. doi: 10.1177/0962280220921909. Epub 2020 May 25.

引用本文的文献

1
Methodologies for the Emulation of Biomarker-Guided Trials Using Observational Data: A Systematic Review.使用观察性数据模拟生物标志物引导试验的方法:一项系统综述。
J Pers Med. 2025 May 10;15(5):195. doi: 10.3390/jpm15050195.
2
Integrative analysis of high-dimensional RCT and RWD subject to censoring and hidden confounding.对受删失和隐藏混杂因素影响的高维随机对照试验和真实世界数据进行综合分析。
Lifetime Data Anal. 2025 Jul;31(3):473-497. doi: 10.1007/s10985-025-09654-1. Epub 2025 Apr 29.
3
How to select predictive models for decision-making or causal inference.
如何选择用于决策或因果推断的预测模型。
Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf016.
4
An overview of modern machine learning methods for effect measure modification analyses in high-dimensional settings.高维环境下效应量修正分析的现代机器学习方法综述。
SSM Popul Health. 2025 Feb 13;29:101764. doi: 10.1016/j.ssmph.2025.101764. eCollection 2025 Mar.
5
Step-by-step causal analysis of EHRs to ground decision-making.对电子健康记录进行逐步因果分析以支持决策制定。
PLOS Digit Health. 2025 Feb 3;4(2):e0000721. doi: 10.1371/journal.pdig.0000721. eCollection 2025 Feb.
6
Human-centered design of a health recommender system for orthopaedic shoulder treatment.用于骨科肩部治疗的健康推荐系统的以人为本设计。
BMC Med Inform Decis Mak. 2025 Jan 10;25(1):17. doi: 10.1186/s12911-025-02850-x.
7
Human-centered Design of a Health Recommender System for Orthopaedic Shoulder Treatment.用于骨科肩部治疗的健康推荐系统的以人为本设计
Res Sq. 2024 May 21:rs.3.rs-4359437. doi: 10.21203/rs.3.rs-4359437/v1.
8
Assessing the properties of patient-specific treatment effect estimates from causal forest algorithms under essential heterogeneity.评估因果森林算法在本质异质性下对特定于患者的治疗效果估计的特性。
BMC Med Res Methodol. 2024 Mar 13;24(1):66. doi: 10.1186/s12874-024-02187-5.
9
A BAYESIAN MACHINE LEARNING APPROACH FOR ESTIMATING HETEROGENEOUS SURVIVOR CAUSAL EFFECTS: APPLICATIONS TO A CRITICAL CARE TRIAL.一种用于估计异质幸存者因果效应的贝叶斯机器学习方法:在重症监护试验中的应用
Ann Appl Stat. 2024 Mar;18(1):350-374. doi: 10.1214/23-aoas1792. Epub 2024 Jan 31.
10
Comparison of methods that combine multiple randomized trials to estimate heterogeneous treatment effects.比较合并多个随机试验以估计异质治疗效果的方法。
Stat Med. 2024 Mar 30;43(7):1291-1314. doi: 10.1002/sim.9955. Epub 2024 Jan 25.