• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

来自序贯规则自适应试验的个体化治疗规则的非渐近性质。

Non-asymptotic Properties of Individualized Treatment Rules from Sequentially Rule-Adaptive Trials.

作者信息

Gao Daiqi, Liu Yufeng, Zeng Donglin

机构信息

Department of Statistics and Operations Research, The University of North Carolina at Chapel Hill Chapel Hill, NC 27599, USA.

Department of Statistics and Operations Research, Department of Genetics, Department of Biostatistics, The University of North Carolina at Chapel Hill Chapel Hill, NC 27599, USA.

出版信息

J Mach Learn Res. 2022;23(250).

PMID:37576335
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10419117/
Abstract

Learning optimal individualized treatment rules (ITRs) has become increasingly important in the modern era of precision medicine. Many statistical and machine learning methods for learning optimal ITRs have been developed in the literature. However, most existing methods are based on data collected from traditional randomized controlled trials and thus cannot take advantage of the accumulative evidence when patients enter the trials sequentially. It is also ethically important that future patients should have a high probability to be treated optimally based on the updated knowledge so far. In this work, we propose a new design called sequentially rule-adaptive trials to learn optimal ITRs based on the contextual bandit framework, in contrast to the response-adaptive design in traditional adaptive trials. In our design, each entering patient will be allocated with a high probability to the current best treatment for this patient, which is estimated using the past data based on some machine learning algorithm (for example, outcome weighted learning in our implementation). We explore the tradeoff between training and test values of the estimated ITR in single-stage problems by proving theoretically that for a higher probability of following the estimated ITR, the training value converges to the optimal value at a faster rate, while the test value converges at a slower rate. This problem is different from traditional decision problems in the sense that the training data are generated sequentially and are dependent. We also develop a tool that combines martingale with empirical process to tackle the problem that cannot be solved by previous techniques for i.i.d. data. We show by numerical examples that without much loss of the test value, our proposed algorithm can improve the training value significantly as compared to existing methods. Finally, we use a real data study to illustrate the performance of the proposed method.

摘要

在精准医学的现代时代,学习最优个体化治疗规则(ITRs)变得越来越重要。文献中已经开发了许多用于学习最优ITRs的统计和机器学习方法。然而,大多数现有方法基于从传统随机对照试验收集的数据,因此当患者依次进入试验时,无法利用累积证据。同样重要的是,从伦理角度来看,未来的患者应该有很高的概率根据迄今为止更新的知识接受最优治疗。在这项工作中,我们提出了一种名为序贯规则自适应试验的新设计,以基于上下文博弈框架学习最优ITRs,这与传统自适应试验中的响应自适应设计形成对比。在我们的设计中,每个进入的患者将以高概率被分配到针对该患者的当前最佳治疗,这是使用基于某种机器学习算法的过去数据进行估计的(例如,在我们的实现中的结果加权学习)。我们通过理论证明,在单阶段问题中探索估计ITR的训练值和测试值之间的权衡:对于遵循估计ITR的更高概率,训练值以更快的速率收敛到最优值,而测试值收敛速率较慢。这个问题与传统决策问题不同,因为训练数据是顺序生成的并且是相关的。我们还开发了一种将鞅与经验过程相结合的工具,以解决先前用于独立同分布数据的技术无法解决的问题。我们通过数值示例表明,与现有方法相比,在测试值没有太大损失的情况下,我们提出的算法可以显著提高训练值。最后,我们使用实际数据研究来说明所提出方法的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5249/10419117/228bb1ae4ead/nihms-1918139-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5249/10419117/34a61cf7cc7f/nihms-1918139-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5249/10419117/bcba25d9b3e1/nihms-1918139-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5249/10419117/55e8de677d74/nihms-1918139-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5249/10419117/94276ad08bc9/nihms-1918139-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5249/10419117/d77b1e7f6b7b/nihms-1918139-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5249/10419117/bb99fa6653a6/nihms-1918139-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5249/10419117/228bb1ae4ead/nihms-1918139-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5249/10419117/34a61cf7cc7f/nihms-1918139-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5249/10419117/bcba25d9b3e1/nihms-1918139-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5249/10419117/55e8de677d74/nihms-1918139-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5249/10419117/94276ad08bc9/nihms-1918139-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5249/10419117/d77b1e7f6b7b/nihms-1918139-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5249/10419117/bb99fa6653a6/nihms-1918139-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5249/10419117/228bb1ae4ead/nihms-1918139-f0008.jpg

相似文献

1
Non-asymptotic Properties of Individualized Treatment Rules from Sequentially Rule-Adaptive Trials.来自序贯规则自适应试验的个体化治疗规则的非渐近性质。
J Mach Learn Res. 2022;23(250).
2
Fusing Individualized Treatment Rules Using Secondary Outcomes.使用次要结果融合个体化治疗规则
Proc Mach Learn Res. 2024 May;238:712-720.
3
Robust outcome weighted learning for optimal individualized treatment rules.用于优化个体化治疗规则的稳健结果加权学习
J Biopharm Stat. 2019;29(4):606-624. doi: 10.1080/10543406.2019.1633657. Epub 2019 Jul 16.
4
Learning Optimal Group-structured Individualized Treatment Rules with Many Treatments.学习具有多种治疗方法的最优组结构化个体化治疗规则。
J Mach Learn Res. 2023;24.
5
Residual Weighted Learning for Estimating Individualized Treatment Rules.用于估计个体化治疗规则的残差加权学习
J Am Stat Assoc. 2017;112(517):169-187. doi: 10.1080/01621459.2015.1093947. Epub 2017 May 3.
6
Do machine learning methods lead to similar individualized treatment rules? A comparison study on real data.机器学习方法是否会产生类似的个体化治疗规则?基于真实数据的比较研究。
Stat Med. 2024 May 20;43(11):2043-2061. doi: 10.1002/sim.10059. Epub 2024 Mar 12.
7
On using electronic health records to improve optimal treatment rules in randomized trials.利用电子健康记录改善随机试验中的最佳治疗规则。
Biometrics. 2020 Dec;76(4):1075-1086. doi: 10.1111/biom.13288. Epub 2020 May 14.
8
Tree-based Ensemble Methods For Individualized Treatment Rules.用于个性化治疗规则的基于树的集成方法。
Biostat Epidemiol. 2017;2(1):61-83. doi: 10.1080/24709360.2018.1435608. Epub 2018 Mar 28.
9
Model selection for survival individualized treatment rules using the jackknife estimator.利用刀切估计量进行生存个体化治疗规则的模型选择。
BMC Med Res Methodol. 2022 Dec 22;22(1):328. doi: 10.1186/s12874-022-01811-6.
10
The future of Cochrane Neonatal.考克兰新生儿协作网的未来。
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.

引用本文的文献

1
Fusing Individualized Treatment Rules Using Secondary Outcomes.使用次要结果融合个体化治疗规则
Proc Mach Learn Res. 2024 May;238:712-720.

本文引用的文献

1
Personalized HeartSteps: A Reinforcement Learning Algorithm for Optimizing Physical Activity.个性化心脏运动计划:一种用于优化身体活动的强化学习算法
Proc ACM Interact Mob Wearable Ubiquitous Technol. 2020 Mar;4(1). doi: 10.1145/3381007.
2
Statistical Inference for Online Decision-Making: In a Contextual Bandit Setting.在线决策的统计推断:上下文博弈设置
J Am Stat Assoc. 2021;116(533):240-255. doi: 10.1080/01621459.2020.1770098. Epub 2020 Jul 7.
3
Estimating Dynamic Treatment Regimes in Mobile Health Using V-learning.使用V学习法估计移动健康中的动态治疗方案。
J Am Stat Assoc. 2020;115(530):692-706. doi: 10.1080/01621459.2018.1537919. Epub 2019 Apr 17.
4
A hybrid phase I-II/III clinical trial design allowing dose re-optimization in phase III.一种允许在III期进行剂量重新优化的I-II/III期混合临床试验设计。
Biometrics. 2019 Jun;75(2):371-381. doi: 10.1111/biom.12994. Epub 2019 Apr 3.
5
Augmented outcome-weighted learning for estimating optimal dynamic treatment regimens.增强型结果加权学习估计最优动态治疗方案。
Stat Med. 2018 Nov 20;37(26):3776-3788. doi: 10.1002/sim.7844. Epub 2018 Jun 5.
6
Estimating individualized treatment rules for ordinal treatments.估计有序治疗的个体化治疗规则。
Biometrics. 2018 Sep;74(3):924-933. doi: 10.1111/biom.12865. Epub 2018 Mar 13.
7
TARGETED SEQUENTIAL DESIGN FOR TARGETED LEARNING INFERENCE OF THE OPTIMAL TREATMENT RULE AND ITS MEAN REWARD.用于最优治疗规则及其平均奖励的靶向学习推断的靶向序贯设计
Ann Stat. 2017;45(6):2537-2564. doi: 10.1214/16-AOS1534. Epub 2017 Dec 15.
8
Residual Weighted Learning for Estimating Individualized Treatment Rules.用于估计个体化治疗规则的残差加权学习
J Am Stat Assoc. 2017;112(517):169-187. doi: 10.1080/01621459.2015.1093947. Epub 2017 May 3.
9
Active Clinical Trials for Personalized Medicine.个性化医疗的活跃临床试验
J Am Stat Assoc. 2016;111(514):875-887. doi: 10.1080/01621459.2015.1066682. Epub 2016 Aug 18.
10
Phase I/II dose-finding design for molecularly targeted agent: Plateau determination using adaptive randomization.分子靶向药物的I/II期剂量探索设计:使用适应性随机化确定平台期
Stat Methods Med Res. 2018 Feb;27(2):466-479. doi: 10.1177/0962280216631763. Epub 2016 Mar 17.