• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

预防随机试验中异质性治疗效果亚组的错误发现。

Preventing false discovery of heterogeneous treatment effect subgroups in randomized trials.

作者信息

Rigdon Joseph, Baiocchi Michael, Basu Sanjay

机构信息

Quantitative Sciences Unit, Stanford University School of Medicine, 1070 Arastradero Road #3C3104, MC 5559, Palo Alto, California, 94304, USA.

Stanford Prevention Research Center, Stanford University School of Medicine, Medical School Office Building, Room 318,1265 Welch Road, MC 5411, Stanford, CA, 94305, USA.

出版信息

Trials. 2018 Jul 16;19(1):382. doi: 10.1186/s13063-018-2774-5.

DOI:10.1186/s13063-018-2774-5
PMID:30012181
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6048878/
Abstract

BACKGROUND

Heterogeneous treatment effects (HTEs), or systematic differences in treatment effectiveness among participants with different observable features, may be important when applying trial results to clinical practice. Current methods suffer from a potential for false detection of HTEs due to imbalances in covariates between candidate subgroups.

METHODS

We introduce a new method, matching plus classification and regression trees (mCART), that yields balance in covariates in identified HTE subgroups. We compared mCART to a classical method (logistic regression [LR] with backwards covariate selection using the Akaike information criterion ) and two machine-learning approaches increasingly applied to HTE detection (random forest [RF] and gradient RF) in simulations with a binary outcome with known HTE subgroups. We considered an N = 200 phase II oncology trial where there were either no HTEs (1A) or two HTE subgroups (1B) and an N = 6000 phase III cardiovascular disease trial where there were either no HTEs (2A) or four HTE subgroups (2B). Additionally, we considered an N = 6000 phase III cardiovascular disease trial where there was no average treatment effect but there were four HTE subgroups (2C).

RESULTS

In simulations 1A and 2A (no HTEs), mCART did not identify any HTE subgroups, whereas LR found 2 and 448, RF 5 and 2, and gradient RF 5 and 24, respectively (all false positives). In simulation 1B, mCART failed to identify the two true HTE subgroups whereas LR found 4, RF 6, and gradient RF 10 (half or more of which were false positives). In simulations 2B and 2C, mCART captured the four true HTE subgroups, whereas the other methods found only false positives. All HTE subgroups identified by mCART had acceptable treated vs. control covariate balance with absolute standardized differences less than 0.2, whereas the absolute standardized differences for the other methods typically exceeded 0.2. The imbalance in covariates in identified subgroups for LR, RF, and gradient RF indicates the false HTE detection may have been due to confounding.

CONCLUSIONS

Covariate imbalances may be producing false positives in subgroup analyses. mCART could be a useful tool to help prevent the false discovery of HTE subgroups in secondary analyses of randomized trial data.

摘要

背景

异质性治疗效果(HTEs),即具有不同可观察特征的参与者在治疗效果上的系统性差异,在将试验结果应用于临床实践时可能很重要。由于候选亚组之间协变量的不平衡,当前方法存在误判HTEs的可能性。

方法

我们引入了一种新方法,匹配加分类与回归树(mCART),该方法能使已识别的HTE亚组中的协变量达到平衡。在具有已知HTE亚组的二元结局模拟中,我们将mCART与一种经典方法(使用赤池信息准则进行反向协变量选择的逻辑回归[LR])以及两种越来越多地应用于HTE检测的机器学习方法(随机森林[RF]和梯度随机森林)进行了比较。我们考虑了一项N = 200的II期肿瘤学试验,其中要么不存在HTEs(1A),要么存在两个HTE亚组(1B),以及一项N = 6000的III期心血管疾病试验,其中要么不存在HTEs(2A),要么存在四个HTE亚组(2B)。此外,我们还考虑了一项N = 6000的III期心血管疾病试验,其中不存在平均治疗效果,但存在四个HTE亚组(2C)。

结果

在模拟1A和2A(不存在HTEs)中,mCART未识别出任何HTE亚组,而LR分别发现了2个和448个,RF发现了5个和2个,梯度随机森林发现了5个和24个(均为假阳性)。在模拟1B中,mCART未能识别出两个真正的HTE亚组,而LR发现了4个,RF发现了6个,梯度随机森林发现了10个(其中一半或更多为假阳性)。在模拟2B和2C中,mCART捕捉到了四个真正的HTE亚组,而其他方法只发现了假阳性。mCART识别出的所有HTE亚组在治疗组与对照组协变量平衡方面均可接受,绝对标准化差异小于0.2,而其他方法的绝对标准化差异通常超过0.2。LR、RF和梯度随机森林在已识别亚组中的协变量不平衡表明,误判HTEs可能是由于混杂因素导致的。

结论

协变量不平衡可能在亚组分析中产生假阳性结果。mCART可能是一种有用的工具,有助于在随机试验数据的二次分析中防止误发现HTE亚组。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e987/6048878/254d6d0a4371/13063_2018_2774_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e987/6048878/d1ef3d06e994/13063_2018_2774_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e987/6048878/b590eaf5fdd8/13063_2018_2774_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e987/6048878/506686b02d8f/13063_2018_2774_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e987/6048878/8fd70f9461dc/13063_2018_2774_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e987/6048878/0e23ab3abede/13063_2018_2774_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e987/6048878/8c07d8d1698d/13063_2018_2774_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e987/6048878/254d6d0a4371/13063_2018_2774_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e987/6048878/d1ef3d06e994/13063_2018_2774_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e987/6048878/b590eaf5fdd8/13063_2018_2774_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e987/6048878/506686b02d8f/13063_2018_2774_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e987/6048878/8fd70f9461dc/13063_2018_2774_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e987/6048878/0e23ab3abede/13063_2018_2774_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e987/6048878/8c07d8d1698d/13063_2018_2774_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e987/6048878/254d6d0a4371/13063_2018_2774_Fig7_HTML.jpg

相似文献

1
Preventing false discovery of heterogeneous treatment effect subgroups in randomized trials.预防随机试验中异质性治疗效果亚组的错误发现。
Trials. 2018 Jul 16;19(1):382. doi: 10.1186/s13063-018-2774-5.
2
Outcome risk model development for heterogeneity of treatment effect analyses: a comparison of non-parametric machine learning methods and semi-parametric statistical methods.治疗效果分析异质性的结局风险模型开发:非参数机器学习方法与半参数统计方法的比较
BMC Med Res Methodol. 2024 Jul 23;24(1):158. doi: 10.1186/s12874-024-02265-8.
3
Generalizability of heterogeneous treatment effects based on causal forests applied to two randomized clinical trials of intensive glycemic control.基于因果森林的异质治疗效果的可推广性,应用于两项强化血糖控制的随机临床试验。
Ann Epidemiol. 2022 Jan;65:101-108. doi: 10.1016/j.annepidem.2021.07.003. Epub 2021 Jul 17.
4
Comparison of machine learning clustering algorithms for detecting heterogeneity of treatment effect in acute respiratory distress syndrome: A secondary analysis of three randomised controlled trials.机器学习聚类算法在急性呼吸窘迫综合征治疗效果异质性检测中的比较:三项随机对照试验的二次分析。
EBioMedicine. 2021 Dec;74:103697. doi: 10.1016/j.ebiom.2021.103697. Epub 2021 Dec 1.
5
The Predictive Approaches to Treatment effect Heterogeneity (PATH) Statement.预测治疗效果异质性的方法(PATH)声明。
Ann Intern Med. 2020 Jan 7;172(1):35-45. doi: 10.7326/M18-3667. Epub 2019 Nov 12.
6
Heterogeneous treatment effects in social policy studies: An assessment of contemporary articles in the health and social sciences.社会政策研究中的异质处理效应:对健康和社会科学领域当代文章的评估。
Ann Epidemiol. 2022 Jun;70:79-88. doi: 10.1016/j.annepidem.2022.04.009. Epub 2022 Apr 26.
7
Assessing heterogeneity of treatment effect analyses in health-related cluster randomized trials: A systematic review.评估健康相关整群随机试验中治疗效果分析的异质性:系统评价。
PLoS One. 2019 Aug 12;14(8):e0219894. doi: 10.1371/journal.pone.0219894. eCollection 2019.
8
Heterogeneous treatment effect analysis based on machine-learning methodology.基于机器学习方法的异质处理效应分析。
CPT Pharmacometrics Syst Pharmacol. 2021 Nov;10(11):1433-1443. doi: 10.1002/psp4.12715. Epub 2021 Oct 30.
9
Maximin optimal cluster randomized designs for assessing treatment effect heterogeneity.最大最小最优区组随机设计评估治疗效应异质性。
Stat Med. 2023 Sep 20;42(21):3764-3785. doi: 10.1002/sim.9830. Epub 2023 Jun 20.
10
Assessing Heterogeneity of Treatment Effects: Are Authors Misinterpreting Their Results?评估治疗效果的异质性:作者是否误解了他们的结果?
Health Serv Res. 2010 Feb;45(1):283-301. doi: 10.1111/j.1475-6773.2009.01064.x.

引用本文的文献

1
Generalized framework for identifying meaningful heterogenous treatment effects in observational studies: A parametric data-adaptive G-computation approach.观察性研究中识别有意义的异质性治疗效果的通用框架:一种参数化数据自适应G计算方法。
Stat Methods Med Res. 2025 Apr;34(4):648-662. doi: 10.1177/09622802251316969. Epub 2025 Feb 24.
2
Automated generation of comparator patients in the electronic medical record.电子病历中对照患者的自动生成
Learn Health Syst. 2023 Mar 28;8(1):e10362. doi: 10.1002/lrh2.10362. eCollection 2024 Jan.
3
Identifying treatment heterogeneity in atrial fibrillation using a novel causal machine learning method.

本文引用的文献

1
Benefit and harm of intensive blood pressure treatment: Derivation and validation of risk models using data from the SPRINT and ACCORD trials.强化血压治疗的益处与危害:利用收缩压干预试验(SPRINT)和控制糖尿病患者心血管风险行动(ACCORD)试验数据推导和验证风险模型
PLoS Med. 2017 Oct 17;14(10):e1002410. doi: 10.1371/journal.pmed.1002410. eCollection 2017 Oct.
2
Targeting weight loss interventions to reduce cardiovascular complications of type 2 diabetes: a machine learning-based post-hoc analysis of heterogeneous treatment effects in the Look AHEAD trial.针对降低 2 型糖尿病心血管并发症的减肥干预措施:一项基于 LOOK AHEAD 试验中异质治疗效果的机器学习事后分析。
Lancet Diabetes Endocrinol. 2017 Oct;5(10):808-815. doi: 10.1016/S2213-8587(17)30176-6. Epub 2017 Jul 12.
3
利用一种新的因果机器学习方法识别心房颤动的治疗异质性。
Am Heart J. 2023 Jun;260:124-140. doi: 10.1016/j.ahj.2023.02.015. Epub 2023 Mar 7.
4
Individualising intensive systolic blood pressure reduction in hypertension using computational trial phenomaps and machine learning: a post-hoc analysis of randomised clinical trials.使用计算试验表型图谱和机器学习对高血压患者进行个体化强化收缩压降低治疗:随机临床试验的事后分析。
Lancet Digit Health. 2022 Nov;4(11):e796-e805. doi: 10.1016/S2589-7500(22)00170-4.
5
Downstream funding success of early career researchers for resubmitted versus new applications: A matched cohort.新申请与重新提交申请的早期职业研究人员的下游资金成功情况:匹配队列研究。
PLoS One. 2021 Nov 18;16(11):e0257559. doi: 10.1371/journal.pone.0257559. eCollection 2021.
6
A tutorial on individualized treatment effect prediction from randomized trials with a binary endpoint.关于二分类结局随机试验中个体治疗效果预测的教程。
Stat Med. 2021 Nov 20;40(26):5961-5981. doi: 10.1002/sim.9154. Epub 2021 Aug 16.
7
The role of machine learning in clinical research: transforming the future of evidence generation.机器学习在临床研究中的作用:改变证据生成的未来。
Trials. 2021 Aug 16;22(1):537. doi: 10.1186/s13063-021-05489-x.
8
Generalizability of heterogeneous treatment effects based on causal forests applied to two randomized clinical trials of intensive glycemic control.基于因果森林的异质治疗效果的可推广性,应用于两项强化血糖控制的随机临床试验。
Ann Epidemiol. 2022 Jan;65:101-108. doi: 10.1016/j.annepidem.2021.07.003. Epub 2021 Jul 17.
9
Identifying treatment effects of an informal caregiver education intervention to increase days in the community and decrease caregiver distress: a machine-learning secondary analysis of subgroup effects in the HI-FIVES randomized clinical trial.识别非正规照护者教育干预增加社区居住天数和减少照护者痛苦的治疗效果:HI-FIVES 随机临床试验亚组效应的机器学习二次分析。
Trials. 2020 Feb 14;21(1):189. doi: 10.1186/s13063-020-4113-x.
Personalizing the Intensity of Blood Pressure Control: Modeling the Heterogeneity of Risks and Benefits From SPRINT (Systolic Blood Pressure Intervention Trial).个性化血压控制强度:对收缩压干预试验(SPRINT)风险和获益的异质性进行建模
Circ Cardiovasc Qual Outcomes. 2017 Apr;10(4). doi: 10.1161/CIRCOUTCOMES.117.003624.
4
Recursive partitioning for heterogeneous causal effects.异质因果效应的递归划分
Proc Natl Acad Sci U S A. 2016 Jul 5;113(27):7353-60. doi: 10.1073/pnas.1510489113.
5
A Randomized Trial of Intensive versus Standard Blood-Pressure Control.强化与标准血压控制的随机试验
N Engl J Med. 2015 Nov 26;373(22):2103-16. doi: 10.1056/NEJMoa1511939. Epub 2015 Nov 9.
6
The precision medicine initiative: a new national effort.精准医学计划:一项新的全国性行动。
JAMA. 2015 Jun 2;313(21):2119-20. doi: 10.1001/jama.2015.3595.
7
A Simple Method for Estimating Interactions between a Treatment and a Large Number of Covariates.一种估计治疗与大量协变量之间相互作用的简单方法。
J Am Stat Assoc. 2014 Oct;109(508):1517-1532. doi: 10.1080/01621459.2014.951443.
8
The cyclin-dependent kinase 4/6 inhibitor palbociclib in combination with letrozole versus letrozole alone as first-line treatment of oestrogen receptor-positive, HER2-negative, advanced breast cancer (PALOMA-1/TRIO-18): a randomised phase 2 study.哌柏西利联合来曲唑与来曲唑单药一线治疗雌激素受体阳性、HER2 阴性、晚期乳腺癌(PALOMA-1/TRIO-18)的随机 2 期研究。
Lancet Oncol. 2015 Jan;16(1):25-35. doi: 10.1016/S1470-2045(14)71159-3. Epub 2014 Dec 16.
9
Randomization inference for treatment effects on a binary outcome.二元结局治疗效果的随机化推断
Stat Med. 2015 Mar 15;34(6):924-35. doi: 10.1002/sim.6384. Epub 2014 Dec 4.
10
Using internally developed risk models to assess heterogeneity in treatment effects in clinical trials.使用内部开发的风险模型评估临床试验中治疗效果的异质性。
Circ Cardiovasc Qual Outcomes. 2014 Jan;7(1):163-9. doi: 10.1161/CIRCOUTCOMES.113.000497. Epub 2014 Jan 14.