• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

强化学习衍生的化疗方案,用于稳健的患者特异性治疗。

Reinforcement learning derived chemotherapeutic schedules for robust patient-specific therapy.

机构信息

Department of Applied Mathematics, University of Waterloo, Waterloo, N2L 3G1, Canada.

出版信息

Sci Rep. 2021 Sep 9;11(1):17882. doi: 10.1038/s41598-021-97028-6.

DOI:10.1038/s41598-021-97028-6
PMID:34504141
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8429726/
Abstract

The in-silico development of a chemotherapeutic dosing schedule for treating cancer relies upon a parameterization of a particular tumour growth model to describe the dynamics of the cancer in response to the dose of the drug. In practice, it is often prohibitively difficult to ensure the validity of patient-specific parameterizations of these models for any particular patient. As a result, sensitivities to these particular parameters can result in therapeutic dosing schedules that are optimal in principle not performing well on particular patients. In this study, we demonstrate that chemotherapeutic dosing strategies learned via reinforcement learning methods are more robust to perturbations in patient-specific parameter values than those learned via classical optimal control methods. By training a reinforcement learning agent on mean-value parameters and allowing the agent periodic access to a more easily measurable metric, relative bone marrow density, for the purpose of optimizing dose schedule while reducing drug toxicity, we are able to develop drug dosing schedules that outperform schedules learned via classical optimal control methods, even when such methods are allowed to leverage the same bone marrow measurements.

摘要

治疗癌症的化疗剂量方案的计算机开发依赖于对特定肿瘤生长模型的参数化,以描述癌症对药物剂量的反应动力学。在实践中,通常很难确保对任何特定患者的这些模型的特定于患者的参数化的有效性。结果,对这些特定参数的敏感性可能导致在原则上最佳的治疗剂量方案在特定患者上表现不佳。在这项研究中,我们证明通过强化学习方法学习的化疗剂量策略比通过经典最优控制方法学习的策略对患者特定参数值的扰动更具有鲁棒性。通过在均值参数上训练强化学习代理,并允许代理定期访问更易于测量的指标(相对骨髓密度),以便在降低药物毒性的同时优化剂量方案,我们能够开发出比通过经典最优控制方法学习的药物剂量方案更好的药物剂量方案,即使这些方法被允许利用相同的骨髓测量值。

相似文献

1
Reinforcement learning derived chemotherapeutic schedules for robust patient-specific therapy.强化学习衍生的化疗方案,用于稳健的患者特异性治疗。
Sci Rep. 2021 Sep 9;11(1):17882. doi: 10.1038/s41598-021-97028-6.
2
Reinforcement learning strategies in cancer chemotherapy treatments: A review.强化学习策略在癌症化疗治疗中的应用:综述。
Comput Methods Programs Biomed. 2023 Feb;229:107280. doi: 10.1016/j.cmpb.2022.107280. Epub 2022 Nov 26.
3
Reinforcement learning-based control of drug dosing for cancer chemotherapy treatment.基于强化学习的癌症化疗药物剂量控制
Math Biosci. 2017 Nov;293:11-20. doi: 10.1016/j.mbs.2017.08.004. Epub 2017 Aug 16.
4
The rationale of dose-response curves in selecting cancer drug dosing.剂量-反应曲线在选择癌症药物剂量中的原理。
Br J Clin Pharmacol. 2019 Oct;85(10):2198-2204. doi: 10.1111/bcp.13979. Epub 2019 Jun 20.
5
Human free-operant performance varies with a concurrent task: Probability learning without a task, and schedule-consistent with a task.人类的自由操作行为会因一项并发任务而有所不同:无任务时的概率学习,以及与任务一致的时间表。
Learn Behav. 2020 Jun;48(2):254-273. doi: 10.3758/s13420-019-00398-1.
6
Optimal medication dosing from suboptimal clinical examples: a deep reinforcement learning approach.从次优临床实例中得出最佳药物剂量:一种深度强化学习方法。
Annu Int Conf IEEE Eng Med Biol Soc. 2016 Aug;2016:2978-2981. doi: 10.1109/EMBC.2016.7591355.
7
Continuous action deep reinforcement learning for propofol dosing during general anesthesia.全身麻醉期间丙泊酚给药的连续动作深度强化学习
Artif Intell Med. 2022 Jan;123:102227. doi: 10.1016/j.artmed.2021.102227. Epub 2021 Dec 2.
8
Concurrent schedules of differential reinforcement of alternative behavior in the treatment of problem behavior without extinction.同时采用差别强化替代行为的时间表对问题行为进行治疗,而不采用消退法。
J Appl Behav Anal. 2023 Jun;56(3):623-637. doi: 10.1002/jaba.987. Epub 2023 Apr 23.
9
Reinforcement learning for optimal scheduling of Glioblastoma treatment with Temozolomide.用于替莫唑胺治疗胶质母细胞瘤最佳治疗计划的强化学习
Comput Methods Programs Biomed. 2020 Sep;193:105443. doi: 10.1016/j.cmpb.2020.105443. Epub 2020 Mar 19.
10
Broad spectrum preclinical antitumor activity of eribulin (Halaven(R)): optimal effectiveness under intermittent dosing conditions.艾立布林(海乐卫(R))广谱抗肿瘤临床前活性:间歇性给药条件下的最佳疗效。
Anticancer Res. 2012 May;32(5):1611-9.

引用本文的文献

1
Model-informed Deep Q-Networks to Guide Infliximab Dosing in Pediatric Crohn's Disease.模型指导的深度Q网络用于指导儿童克罗恩病的英夫利昔单抗给药剂量
bioRxiv. 2025 Jul 18:2025.07.14.664804. doi: 10.1101/2025.07.14.664804.
2
Reinforcement Learning and Its Clinical Applications Within Healthcare: A Systematic Review of Precision Medicine and Dynamic Treatment Regimes.强化学习及其在医疗保健领域的临床应用:精准医学与动态治疗方案的系统综述
Healthcare (Basel). 2025 Jul 19;13(14):1752. doi: 10.3390/healthcare13141752.
3
Learning Chemotherapy Drug Action via Universal Physics-Informed Neural Networks.

本文引用的文献

1
Optimal Control Theory for Personalized Therapeutic Regimens in Oncology: Background, History, Challenges, and Opportunities.肿瘤个性化治疗方案的最优控制理论:背景、历史、挑战与机遇
J Clin Med. 2020 May 2;9(5):1314. doi: 10.3390/jcm9051314.
2
A mathematical model of breast and ovarian cancer treated with paclitaxel.
Math Biosci. 1997 Dec;146(2):89-113. doi: 10.1016/s0025-5564(97)00077-1.
通过通用物理信息神经网络学习化疗药物作用
Pharm Res. 2025 Apr;42(4):593-612. doi: 10.1007/s11095-025-03858-8. Epub 2025 Apr 17.
4
Optimized patient-specific immune checkpoint inhibitor therapies for cancer treatment based on tumor immune microenvironment modeling.基于肿瘤免疫微环境建模的癌症治疗优化的个体化免疫检查点抑制剂治疗。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae547.
5
Machine Learning Methods for Precision Dosing in Anticancer Drug Therapy: A Scoping Review.机器学习方法在抗癌药物治疗中的精准剂量学:范围综述。
Clin Pharmacokinet. 2024 Sep;63(9):1221-1237. doi: 10.1007/s40262-024-01409-9. Epub 2024 Aug 17.
6
Model enhanced reinforcement learning to enable precision dosing: A theoretical case study with dosing of propofol.模型增强强化学习以实现精准剂量给药:以丙泊酚给药为例的理论案例研究。
CPT Pharmacometrics Syst Pharmacol. 2022 Nov;11(11):1497-1510. doi: 10.1002/psp4.12858. Epub 2022 Sep 30.