• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

将因果因素纳入 HIV 动态治疗方案的强化学习中。

Incorporating causal factors into reinforcement learning for dynamic treatment regimes in HIV.

机构信息

School of Computer Science and Technology, Dalian University of Technology, No. 2, Linggong Road, Dalian, 116024, China.

Department of Computer Science, Hong Kong Baptist University, Kowloon Tong, Hong Kong, China.

出版信息

BMC Med Inform Decis Mak. 2019 Apr 9;19(Suppl 2):60. doi: 10.1186/s12911-019-0755-6.

DOI:10.1186/s12911-019-0755-6
PMID:30961606
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6454675/
Abstract

BACKGROUND

Reinforcement learning (RL) provides a promising technique to solve complex sequential decision making problems in health care domains. However, existing studies simply apply naive RL algorithms in discovering optimal treatment strategies for a targeted problem. This kind of direct applications ignores the abundant causal relationships between treatment options and the associated outcomes that are inherent in medical domains.

METHODS

This paper investigates how to integrate causal factors into an RL process in order to facilitate the final learning performance and increase explanations of learned strategies. A causal policy gradient algorithm is proposed and evaluated in dynamic treatment regimes (DTRs) for HIV based on a simulated computational model.

RESULTS

Simulations prove the effectiveness of the proposed algorithm for designing more efficient treatment protocols in HIV, and different definitions of the causal factors could have significant influence on the final learning performance, indicating the necessity of human prior knowledge on defining a suitable causal relationships for a given problem.

CONCLUSIONS

More efficient and robust DTRs for HIV can be derived through incorporation of causal factors between options of anti-HIV drugs and the associated treatment outcomes.

摘要

背景

强化学习(RL)为解决医疗领域复杂的序贯决策问题提供了一种很有前途的技术。然而,现有的研究只是简单地将原始的 RL 算法应用于发现针对特定问题的最佳治疗策略。这种直接应用忽略了医疗领域中治疗方案与相关结果之间固有的丰富因果关系。

方法

本文研究了如何将因果因素纳入 RL 过程中,以提高最终的学习性能和增强对所学策略的解释能力。基于模拟计算模型,针对 HIV 提出并评估了一种因果策略梯度算法。

结果

模拟证明了所提出的算法在设计更有效的 HIV 治疗方案方面的有效性,并且因果因素的不同定义对最终的学习性能有显著影响,这表明对于给定问题,人为定义合适的因果关系的先验知识是必要的。

结论

通过在抗 HIV 药物的选择与相关治疗结果之间引入因果因素,可以得出更有效和稳健的 HIV 动态治疗方案。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd6a/6454675/995e197b1574/12911_2019_755_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd6a/6454675/cd166844acd0/12911_2019_755_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd6a/6454675/17a1f9a9f015/12911_2019_755_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd6a/6454675/17872719f7d9/12911_2019_755_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd6a/6454675/6a1a55babae4/12911_2019_755_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd6a/6454675/f59658574dc0/12911_2019_755_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd6a/6454675/995e197b1574/12911_2019_755_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd6a/6454675/cd166844acd0/12911_2019_755_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd6a/6454675/17a1f9a9f015/12911_2019_755_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd6a/6454675/17872719f7d9/12911_2019_755_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd6a/6454675/6a1a55babae4/12911_2019_755_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd6a/6454675/f59658574dc0/12911_2019_755_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd6a/6454675/995e197b1574/12911_2019_755_Fig6_HTML.jpg

相似文献

1
Incorporating causal factors into reinforcement learning for dynamic treatment regimes in HIV.将因果因素纳入 HIV 动态治疗方案的强化学习中。
BMC Med Inform Decis Mak. 2019 Apr 9;19(Suppl 2):60. doi: 10.1186/s12911-019-0755-6.
2
Inverse reinforcement learning for intelligent mechanical ventilation and sedative dosing in intensive care units.重症监护病房中智能机械通气和镇静药物剂量的逆强化学习。
BMC Med Inform Decis Mak. 2019 Apr 9;19(Suppl 2):57. doi: 10.1186/s12911-019-0763-6.
3
A reinforcement learning model to inform optimal decision paths for HIV elimination.一种用于为消除艾滋病病毒提供最佳决策路径的强化学习模型。
Math Biosci Eng. 2021 Sep 6;18(6):7666-7684. doi: 10.3934/mbe.2021380.
4
Reinforcement Learning in Neurocritical and Neurosurgical Care: Principles and Possible Applications.神经危重症与神经外科学中的强化学习:原则与可能的应用。
Comput Math Methods Med. 2021 Feb 22;2021:6657119. doi: 10.1155/2021/6657119. eCollection 2021.
5
Adaptive contrast weighted learning for multi-stage multi-treatment decision-making.用于多阶段多治疗决策的自适应对比度加权学习
Biometrics. 2017 Mar;73(1):145-155. doi: 10.1111/biom.12539. Epub 2016 May 23.
6
Supervised-actor-critic reinforcement learning for intelligent mechanical ventilation and sedative dosing in intensive care units.基于监督演员-评论员的强化学习算法在重症监护病房智能机械通气和镇静药物剂量调节中的应用
BMC Med Inform Decis Mak. 2020 Jul 9;20(Suppl 3):124. doi: 10.1186/s12911-020-1120-5.
7
TREE-BASED REINFORCEMENT LEARNING FOR ESTIMATING OPTIMAL DYNAMIC TREATMENT REGIMES.基于树的强化学习用于估计最优动态治疗方案
Ann Appl Stat. 2018 Sep;12(3):1914-1938. doi: 10.1214/18-AOAS1137. Epub 2018 Sep 11.
8
Optimization of anemia treatment in hemodialysis patients via reinforcement learning.通过强化学习优化血液透析患者的贫血治疗。
Artif Intell Med. 2014 Sep;62(1):47-60. doi: 10.1016/j.artmed.2014.07.004. Epub 2014 Jul 19.
9
An algorithm for direct causal learning of influences on patient outcomes.一种用于直接因果学习对患者预后影响的算法。
Artif Intell Med. 2017 Jan;75:1-15. doi: 10.1016/j.artmed.2016.10.003. Epub 2016 Nov 5.
10
Continuous action deep reinforcement learning for propofol dosing during general anesthesia.全身麻醉期间丙泊酚给药的连续动作深度强化学习
Artif Intell Med. 2022 Jan;123:102227. doi: 10.1016/j.artmed.2021.102227. Epub 2021 Dec 2.

引用本文的文献

1
The application of machine learning in clinical microbiology and infectious diseases.机器学习在临床微生物学和传染病中的应用。
Front Cell Infect Microbiol. 2025 May 1;15:1545646. doi: 10.3389/fcimb.2025.1545646. eCollection 2025.
2
Artificial Intelligence in Infectious Disease Clinical Practice: An Overview of Gaps, Opportunities, and Limitations.传染病临床实践中的人工智能:差距、机遇与局限概述
Trop Med Infect Dis. 2024 Sep 30;9(10):228. doi: 10.3390/tropicalmed9100228.
3
Continuous-Time Decision Transformer for Healthcare Applications.

本文引用的文献

1
Causal Explanation Under Indeterminism: A Sampling Approach.非决定论下的因果解释:一种抽样方法。
Proc AAAI Conf Artif Intell. 2016 Feb;2016:1037-1043.
2
Deep reinforcement learning for automated radiation adaptation in lung cancer.深度强化学习在肺癌放射自适应中的应用。
Med Phys. 2017 Dec;44(12):6690-6705. doi: 10.1002/mp.12625. Epub 2017 Nov 14.
3
Combining Kernel and Model Based Learning for HIV Therapy Selection.结合基于核和模型的学习方法进行HIV治疗方案选择
用于医疗保健应用的连续时间决策变压器
Proc Mach Learn Res. 2023 Apr;206:6245-6262.
4
The Health Gym: synthetic health-related datasets for the development of reinforcement learning algorithms.健康健身房:用于开发强化学习算法的综合健康相关数据集。
Sci Data. 2022 Nov 11;9(1):693. doi: 10.1038/s41597-022-01784-7.
5
A reinforcement learning model to inform optimal decision paths for HIV elimination.一种用于为消除艾滋病病毒提供最佳决策路径的强化学习模型。
Math Biosci Eng. 2021 Sep 6;18(6):7666-7684. doi: 10.3934/mbe.2021380.
6
Supervised-actor-critic reinforcement learning for intelligent mechanical ventilation and sedative dosing in intensive care units.基于监督演员-评论员的强化学习算法在重症监护病房智能机械通气和镇静药物剂量调节中的应用
BMC Med Inform Decis Mak. 2020 Jul 9;20(Suppl 3):124. doi: 10.1186/s12911-020-1120-5.
AMIA Jt Summits Transl Sci Proc. 2017 Jul 26;2017:239-248. eCollection 2017.
4
Model-Free Machine Learning in Biomedicine: Feasibility Study in Type 1 Diabetes.生物医学中的无模型机器学习:1型糖尿病的可行性研究
PLoS One. 2016 Jul 21;11(7):e0158722. doi: 10.1371/journal.pone.0158722. eCollection 2016.
5
Informing sequential clinical decision-making through reinforcement learning: an empirical study.通过强化学习为序贯临床决策提供信息:一项实证研究。
Mach Learn. 2011 Jul 1;84(1-2):109-136. doi: 10.1007/s10994-010-5229-0.
6
Dynamic multidrug therapies for hiv: optimal and sti control approaches.动态多种药物疗法治疗艾滋病毒:最佳和性病控制方法。
Math Biosci Eng. 2004 Sep;1(2):223-41. doi: 10.3934/mbe.2004.1.223.