使用观测数据对基于比率的条件平均治疗效果进行估计与验证

Estimation and Validation of Ratio-based Conditional Average Treatment Effects Using Observational Data.

作者信息

Yadlowsky Steve, Pellegrini Fabio, Lionetto Federica, Braune Stefan, Tian Lu

机构信息

Stanford University, Electrical Engineering, 1265 Welch Rd, Stanford, 94305-6104 United States.

Biogen International GmbH, Baar, Switzerland.

出版信息

J Am Stat Assoc. 2021;116(533):335-352. doi: 10.1080/01621459.2020.1772080. Epub 2020 Jul 7.

DOI:10.1080/01621459.2020.1772080

PMID:33767517

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7985957/

Abstract

While sample sizes in randomized clinical trials are large enough to estimate the average treatment effect well, they are often insufficient for estimation of treatment-covariate interactions critical to studying data-driven precision medicine. Observational data from real world practice may play an important role in alleviating this problem. One common approach in trials is to predict the outcome of interest with separate regression models in each treatment arm, and estimate the treatment effect based on the contrast of the predictions. Unfortunately, this simple approach may induce spurious treatment-covariate interaction in observational studies when the regression model is misspecified. Motivated by the need of modeling the number of relapses in multiple sclerosis patients, where the ratio of relapse rates is a natural choice of the treatment effect, we propose to estimate the conditional average treatment effect (CATE) as the ratio of expected potential outcomes, and derive a doubly robust estimator of this CATE in a semiparametric model of treatment-covariate interactions. We also provide a validation procedure to check the quality of the estimator on an independent sample. We conduct simulations to demonstrate the finite sample performance of the proposed methods, and illustrate their advantages on real data by examining the treatment effect of dimethyl fumarate compared to teriflunomide in multiple sclerosis patients.

摘要

虽然随机临床试验中的样本量足够大，可以很好地估计平均治疗效果，但对于估计对研究数据驱动的精准医学至关重要的治疗-协变量相互作用来说，往往还不够。来自实际临床实践的观察性数据可能在缓解这一问题方面发挥重要作用。试验中的一种常见方法是在每个治疗组中使用单独的回归模型预测感兴趣的结果，并根据预测的对比来估计治疗效果。不幸的是，当回归模型设定错误时，这种简单方法可能在观察性研究中导致虚假的治疗-协变量相互作用。受对多发性硬化症患者复发次数进行建模需求的推动，其中复发率之比是治疗效果的自然选择，我们建议将条件平均治疗效果（CATE）估计为预期潜在结果的比值，并在治疗-协变量相互作用的半参数模型中推导该CATE的双重稳健估计量。我们还提供了一个验证程序，以在独立样本上检查估计量的质量。我们进行模拟以证明所提出方法的有限样本性能，并通过研究富马酸二甲酯与特立氟胺在多发性硬化症患者中的治疗效果，在真实数据上说明它们的优势。

相似文献

Estimation and Validation of Ratio-based Conditional Average Treatment Effects Using Observational Data.使用观测数据对基于比率的条件平均治疗效果进行估计与验证

J Am Stat Assoc. 2021;116(533):335-352. doi: 10.1080/01621459.2020.1772080. Epub 2020 Jul 7.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

BOUNDS ON THE CONDITIONAL AND AVERAGE TREATMENT EFFECT WITH UNOBSERVED CONFOUNDING FACTORS.存在未观测混杂因素时条件处理效应和平均处理效应的界值

Ann Stat. 2022 Oct;50(5):2587-2615. doi: 10.1214/22-aos2195. Epub 2022 Oct 27.

Doubly robust estimation and causal inference for recurrent event data.复发事件数据的双重稳健估计与因果推断

Stat Med. 2020 Jul 30;39(17):2324-2338. doi: 10.1002/sim.8541. Epub 2020 Apr 28.

Model misspecification and bias for inverse probability weighting estimators of average causal effects.模型误设定和平均因果效应逆概率加权估计的偏差。

Biom J. 2023 Feb;65(2):e2100118. doi: 10.1002/bimj.202100118. Epub 2022 Aug 31.

Improved inference for doubly robust estimators of heterogeneous treatment effects.改善异质处理效应双重稳健估计量的推断。

Biometrics. 2023 Dec;79(4):3140-3152. doi: 10.1111/biom.13837. Epub 2023 Feb 15.

Covariate adjustment and estimation of difference in proportions in randomized clinical trials.随机临床试验中协变量调整及比例差异估计

Pharm Stat. 2024 Nov-Dec;23(6):884-905. doi: 10.1002/pst.2397. Epub 2024 May 19.

Model misspecification and robustness in causal inference: comparing matching with doubly robust estimation.因果推断中的模型误设定与稳健性：比较匹配法和双重稳健估计。

Stat Med. 2012 Jul 10;31(15):1572-81. doi: 10.1002/sim.4496. Epub 2012 Feb 23.

Framework for personalized prediction of treatment response in relapsing remitting multiple sclerosis.复发缓解型多发性硬化症个体化治疗反应预测的框架。

BMC Med Res Methodol. 2020 Feb 7;20(1):24. doi: 10.1186/s12874-020-0906-6.

Estimation of average treatment effect based on a multi-index propensity score.基于多指标倾向评分的平均处理效应估计。

BMC Med Res Methodol. 2022 Dec 28;22(1):337. doi: 10.1186/s12874-022-01822-3.

引用本文的文献

Toward a causal model of chronic back pain: Challenges and opportunities.迈向慢性背痛的因果模型：挑战与机遇。

Front Comput Neurosci. 2023 Jan 11;16:1017412. doi: 10.3389/fncom.2022.1017412. eCollection 2022.

Overall and patient-level comparative effectiveness of dimethyl fumarate and fingolimod: A precision medicine application to the Observatoire Français de la Sclérose en Plaques registry.富马酸二甲酯与芬戈莫德的总体及患者水平的比较疗效：在法国多发性硬化症观察登记处的精准医学应用

Mult Scler J Exp Transl Clin. 2022 Aug 4;8(3):20552173221116591. doi: 10.1177/20552173221116591. eCollection 2022 Jul-Sep.

Implementation of a data control framework to ensure confidentiality, integrity, and availability of high-quality real-world data (RWD) in the NeuroTransData (NTD) registry.在神经传输数据（NTD）注册中心实施数据控制框架，以确保高质量真实世界数据（RWD）的保密性、完整性和可用性。

JAMIA Open. 2022 Mar 9;5(1):ooac017. doi: 10.1093/jamiaopen/ooac017. eCollection 2022 Apr.

本文引用的文献

Multi-Armed Angle-Based Direct Learning for Estimating Optimal Individualized Treatment Rules With Various Outcomes.基于多臂角度的直接学习法用于估计具有多种结局的最优个体化治疗规则

J Am Stat Assoc. 2020;115(530):678-691. doi: 10.1080/01621459.2018.1529597. Epub 2019 Apr 11.

Metalearners for estimating heterogeneous treatment effects using machine learning.使用机器学习估计异质处理效应的元学习器。

Proc Natl Acad Sci U S A. 2019 Mar 5;116(10):4156-4165. doi: 10.1073/pnas.1804597116. Epub 2019 Feb 15.

Comparing methods for estimation of heterogeneous treatment effects using observational data from health care databases.利用医疗保健数据库中的观察数据比较估计异质治疗效果的方法。

Stat Med. 2018 Oct 15;37(23):3309-3324. doi: 10.1002/sim.7820. Epub 2018 Jun 3.

Estimating Individual Treatment Effect in Observational Data Using Random Forest Methods.使用随机森林方法估计观察性数据中的个体治疗效果。

J Comput Graph Stat. 2018;27(1):209-219. doi: 10.1080/10618600.2017.1356325. Epub 2018 Feb 1.

A Note on G-Estimation of Causal Risk Ratios.关于因果风险比的 G 估计的注释。

Am J Epidemiol. 2018 May 1;187(5):1079-1084. doi: 10.1093/aje/kwx347.

Some methods for heterogeneous treatment effect estimation in high dimensions.一些在高维中进行异质处理效应估计的方法。

Stat Med. 2018 May 20;37(11):1767-1787. doi: 10.1002/sim.7623. Epub 2018 Mar 6.

Benefit and harm of intensive blood pressure treatment: Derivation and validation of risk models using data from the SPRINT and ACCORD trials.强化血压治疗的益处与危害：利用收缩压干预试验（SPRINT）和控制糖尿病患者心血管风险行动（ACCORD）试验数据推导和验证风险模型

PLoS Med. 2017 Oct 17;14(10):e1002410. doi: 10.1371/journal.pmed.1002410. eCollection 2017 Oct.

Residual Weighted Learning for Estimating Individualized Treatment Rules.用于估计个体化治疗规则的残差加权学习

J Am Stat Assoc. 2017;112(517):169-187. doi: 10.1080/01621459.2015.1093947. Epub 2017 May 3.

A general statistical framework for subgroup identification and comparative treatment scoring.用于亚组识别和比较治疗评分的通用统计框架。

Biometrics. 2017 Dec;73(4):1199-1209. doi: 10.1111/biom.12676. Epub 2017 Feb 17.

Recursive partitioning for heterogeneous causal effects.异质因果效应的递归划分

Proc Natl Acad Sci U S A. 2016 Jul 5;113(27):7353-60. doi: 10.1073/pnas.1510489113.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验