• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

深度强化学习在个性化治疗推荐中的应用。

Deep reinforcement learning for personalized treatment recommendation.

机构信息

School of Statistics, University of Minnesota, Minneapolis, Minnesota, USA.

Division of Biostatistics, University of Minnesota, Minneapolis, Minnesota, USA.

出版信息

Stat Med. 2022 Sep 10;41(20):4034-4056. doi: 10.1002/sim.9491. Epub 2022 Jun 18.

DOI:10.1002/sim.9491
PMID:35716038
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9427729/
Abstract

In precision medicine, the ultimate goal is to recommend the most effective treatment to an individual patient based on patient-specific molecular and clinical profiles, possibly high-dimensional. To advance cancer treatment, large-scale screenings of cancer cell lines against chemical compounds have been performed to help better understand the relationship between genomic features and drug response; existing machine learning approaches use exclusively supervised learning, including penalized regression and recommender systems. However, it would be more efficient to apply reinforcement learning to sequentially learn as data accrue, including selecting the most promising therapy for a patient given individual molecular and clinical features and then collecting and learning from the corresponding data. In this article, we propose a novel personalized ranking system called Proximal Policy Optimization Ranking (PPORank), which ranks the drugs based on their predicted effects per cell line (or patient) in the framework of deep reinforcement learning (DRL). Modeled as a Markov decision process, the proposed method learns to recommend the most suitable drugs sequentially and continuously over time. As a proof-of-concept, we conduct experiments on two large-scale cancer cell line data sets in addition to simulated data. The results demonstrate that the proposed DRL-based PPORank outperforms the state-of-the-art competitors based on supervised learning. Taken together, we conclude that novel methods in the framework of DRL have great potential for precision medicine and should be further studied.

摘要

在精准医学中,最终目标是以患者特定的分子和临床特征(可能是高维的)为基础,为个体患者推荐最有效的治疗方法。为了推进癌症治疗,已经对癌细胞系进行了大规模的化合物筛选,以帮助更好地理解基因组特征与药物反应之间的关系;现有的机器学习方法仅使用有监督学习,包括惩罚回归和推荐系统。然而,随着数据的积累,应用强化学习来顺序学习将更加高效,包括根据个体分子和临床特征为患者选择最有前途的疗法,然后收集和学习相应的数据。在本文中,我们提出了一种名为近策略优化排序(PPORank)的新的个性化排序系统,该系统在深度强化学习(DRL)框架内根据每个细胞系(或患者)的预测效果对药物进行排序。该模型被建模为一个马尔可夫决策过程,所提出的方法学会了随着时间的推移顺序和连续地推荐最合适的药物。作为概念验证,我们在两个大型癌症细胞系数据集以及模拟数据上进行了实验。结果表明,基于强化学习的 DRL 提出的方法优于基于监督学习的最先进的竞争对手。综上所述,我们得出结论,DRL 框架中的新方法在精准医学方面具有巨大的潜力,应该进一步研究。

相似文献

1
Deep reinforcement learning for personalized treatment recommendation.深度强化学习在个性化治疗推荐中的应用。
Stat Med. 2022 Sep 10;41(20):4034-4056. doi: 10.1002/sim.9491. Epub 2022 Jun 18.
2
Kernelized rank learning for personalized drug recommendation.核化秩学习在个性化药物推荐中的应用。
Bioinformatics. 2018 Aug 15;34(16):2808-2816. doi: 10.1093/bioinformatics/bty132.
3
An adaptable and personalized framework for top-N course recommendations in online learning.一种适用于在线学习的可自适应和个性化的 top-N 课程推荐框架。
Sci Rep. 2024 May 6;14(1):10382. doi: 10.1038/s41598-024-56497-1.
4
Deep Reinforcement Learning on Autonomous Driving Policy With Auxiliary Critic Network.基于辅助评论家网络的自动驾驶策略深度强化学习
IEEE Trans Neural Netw Learn Syst. 2023 Jul;34(7):3680-3690. doi: 10.1109/TNNLS.2021.3116063. Epub 2023 Jul 6.
5
Deep Reinforcement Learning and Simulation as a Path Toward Precision Medicine.深度强化学习与模拟:通往精准医学之路
J Comput Biol. 2019 Jun;26(6):597-604. doi: 10.1089/cmb.2018.0168. Epub 2019 Jan 25.
6
Deep reinforcement learning-based control of chemo-drug dose in cancer treatment.基于深度强化学习的癌症治疗化疗药物剂量控制。
Comput Methods Programs Biomed. 2024 Jan;243:107884. doi: 10.1016/j.cmpb.2023.107884. Epub 2023 Oct 24.
7
Deep reinforcement learning for automated radiation adaptation in lung cancer.深度强化学习在肺癌放射自适应中的应用。
Med Phys. 2017 Dec;44(12):6690-6705. doi: 10.1002/mp.12625. Epub 2017 Nov 14.
8
Intelligent control of self-driving vehicles based on adaptive sampling supervised actor-critic and human driving experience.基于自适应采样监督式智能体-评论家算法和人类驾驶经验的自动驾驶车辆智能控制
Math Biosci Eng. 2024 May 24;21(5):6077-6096. doi: 10.3934/mbe.2024267.
9
Predictive hierarchical reinforcement learning for path-efficient mapless navigation with moving target.具有移动目标的无图路径高效导航的预测分层强化学习。
Neural Netw. 2023 Aug;165:677-688. doi: 10.1016/j.neunet.2023.06.007. Epub 2023 Jun 10.
10
Deep Reinforcement Learning Framework for Category-Based Item Recommendation.用于基于类别的项目推荐的深度强化学习框架
IEEE Trans Cybern. 2022 Nov;52(11):12028-12041. doi: 10.1109/TCYB.2021.3089941. Epub 2022 Oct 17.

引用本文的文献

1
Advancing genome-based precision medicine: a review on machine learning applications for rare genetic disorders.推进基于基因组的精准医学:关于机器学习在罕见遗传疾病中的应用综述
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf329.
2
Navigating disruption in the PID landscape: embracing opportunities and anticipating threats in the next ten years.应对原发性免疫缺陷病领域的变革:把握未来十年的机遇并预见威胁
Front Immunol. 2025 Jun 17;16:1596971. doi: 10.3389/fimmu.2025.1596971. eCollection 2025.
3
Role of Artificial Intelligence and Personalized Medicine in Enhancing HIV Management and Treatment Outcomes.

本文引用的文献

1
A Survey on Offline Reinforcement Learning: Taxonomy, Review, and Open Problems.离线强化学习综述:分类、回顾与开放问题
IEEE Trans Neural Netw Learn Syst. 2024 Aug;35(8):10237-10257. doi: 10.1109/TNNLS.2023.3250269. Epub 2024 Aug 5.
2
Outcome weighted -learning for individualized treatment rules.基于结果加权学习的个性化治疗规则
Stat. 2020 Dec;10(1). doi: 10.1002/sta4.343. Epub 2020 Dec 7.
3
Reinforcement learning for intelligent healthcare applications: A survey.强化学习在智能医疗保健应用中的研究进展:综述
人工智能与个性化医疗在改善艾滋病病毒管理及治疗效果中的作用
Life (Basel). 2025 May 6;15(5):745. doi: 10.3390/life15050745.
4
Reinforcement Learning in Personalized Medicine: A Comprehensive Review of Treatment Optimization Strategies.个性化医疗中的强化学习:治疗优化策略的全面综述
Cureus. 2025 Apr 21;17(4):e82756. doi: 10.7759/cureus.82756. eCollection 2025 Apr.
5
Role of AI in empowering and redefining the oncology care landscape: perspective from a developing nation.人工智能在赋能和重新定义肿瘤护理格局中的作用:来自一个发展中国家的视角。
Front Digit Health. 2025 Mar 4;7:1550407. doi: 10.3389/fdgth.2025.1550407. eCollection 2025.
6
Beyond the current state of just-in-time adaptive interventions in mental health: a qualitative systematic review.超越心理健康即时适应性干预的现状:一项定性系统综述
Front Digit Health. 2025 Jan 28;7:1460167. doi: 10.3389/fdgth.2025.1460167. eCollection 2025.
7
From Serendipity to Precision: Integrating AI, Multi-Omics, and Human-Specific Models for Personalized Neuropsychiatric Care.从意外发现到精准医疗:整合人工智能、多组学和人类特异性模型以实现个性化神经精神疾病护理。
Biomedicines. 2025 Jan 12;13(1):167. doi: 10.3390/biomedicines13010167.
8
Metabolic modelling as a powerful tool to identify critical components of Pneumocystis growth medium.代谢建模作为一种强大的工具,可用于鉴定卡氏肺孢子虫生长培养基的关键成分。
PLoS Comput Biol. 2024 Oct 28;20(10):e1012545. doi: 10.1371/journal.pcbi.1012545. eCollection 2024 Oct.
9
Artificial Intelligence in Head and Neck Cancer: Innovations, Applications, and Future Directions.人工智能在头颈部肿瘤中的应用:创新、应用和未来方向。
Curr Oncol. 2024 Sep 6;31(9):5255-5290. doi: 10.3390/curroncol31090389.
10
Trust me if you can: a survey on reliability and interpretability of machine learning approaches for drug sensitivity prediction in cancer.相信我:一项关于机器学习方法在癌症药物敏感性预测中的可靠性和可解释性的调查。
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae379.
Artif Intell Med. 2020 Sep;109:101964. doi: 10.1016/j.artmed.2020.101964. Epub 2020 Sep 28.
4
Deep reinforcement learning in medical imaging: A literature review.深度强化学习在医学成像中的应用:文献综述。
Med Image Anal. 2021 Oct;73:102193. doi: 10.1016/j.media.2021.102193. Epub 2021 Jul 27.
5
Estimating Dynamic Treatment Regimes in Mobile Health Using V-learning.使用V学习法估计移动健康中的动态治疗方案。
J Am Stat Assoc. 2020;115(530):692-706. doi: 10.1080/01621459.2018.1537919. Epub 2019 Apr 17.
6
Reinforcement Learning for Clinical Decision Support in Critical Care: Comprehensive Review.强化学习在重症监护临床决策支持中的应用:全面综述。
J Med Internet Res. 2020 Jul 20;22(7):e18477. doi: 10.2196/18477.
7
Precision Medicine.精准医学
Annu Rev Stat Appl. 2019 Mar;6:263-286. doi: 10.1146/annurev-statistics-030718-105251.
8
Improving Sepsis Treatment Strategies by Combining Deep and Kernel-Based Reinforcement Learning.通过结合深度强化学习和基于核的强化学习改进脓毒症治疗策略
AMIA Annu Symp Proc. 2018 Dec 5;2018:887-896. eCollection 2018.
9
Interpretable Dynamic Treatment Regimes.可解释的动态治疗方案
J Am Stat Assoc. 2018;113(524):1541-1549. doi: 10.1080/01621459.2017.1345743. Epub 2018 Nov 14.
10
Learning with multiple pairwise kernels for drug bioactivity prediction.使用多种成对核函数进行药物生物活性预测。
Bioinformatics. 2018 Jul 1;34(13):i509-i518. doi: 10.1093/bioinformatics/bty277.