基于多智能体在线策略强化学习的自适应平均动脉压控制

Adaptive average arterial pressure control by multi-agent on-policy reinforcement learning.

作者信息

Hong Xiaofeng, Ayadi Walid, Alattas Khalid A, Mohammadzadeh Ardashir, Salimi Mohamad, Zhang Chunwei

机构信息

Zhejiang Guangsha Vocational and Technical University of Construction, Dongyang, 322100, China.

Mechatronics and Intelligent Systems, Abu Dhabi Polytechnic, Abu Dhabi, United Arab Emirates.

出版信息

Sci Rep. 2025 Jan 3;15(1):679. doi: 10.1038/s41598-024-84791-5.

DOI:10.1038/s41598-024-84791-5

PMID:39753883

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11699154/

Abstract

The current research introduces a model-free ultra-local model (MFULM) controller that utilizes the multi-agent on-policy reinforcement learning (MAOPRL) technique for remotely regulating blood pressure through precise drug dosing in a closed-loop system. Within the closed-loop system, there exists a MFULM controller, an observer, and an intelligent MAOPRL algorithm. Initially, a flexible MFULM controller is created to make adjustments to blood pressure and medication dosages. Following this, an observer is incorporated into the main controller to improve performance and stability by estimating states and disturbances. The controller parameters are optimized using MAOPRL in an adaptive manner, which involves the use of an actor-critic approach in an adaptive fashion. This approach enhances the adaptability of the controller by allowing for dynamic modifications to dosage and blood pressure control parameters. In the presence of disturbances or instabilities, the critic's feedback aids the actor in adjusting actions to reduce their impact, utilizing a complementary strategy to tackle deficiencies in the primary controller. Lastly, various evaluations, including assessments under normal conditions, adaptability between patients, and stability evaluations against mixed disturbances, have been carried out to confirm the efficiency and viability of the proposed method.

摘要

当前的研究引入了一种无模型超局部模型（MFULM）控制器，该控制器利用多智能体在线强化学习（MAOPRL）技术，通过在闭环系统中精确给药来远程调节血压。在闭环系统中，存在一个MFULM控制器、一个观测器和一种智能MAOPRL算法。首先，创建一个灵活的MFULM控制器来调整血压和药物剂量。在此之后，将一个观测器纳入主控制器，通过估计状态和干扰来提高性能和稳定性。使用MAOPRL以自适应方式优化控制器参数，这涉及以自适应方式使用actor-critic方法。这种方法通过允许动态修改剂量和血压控制参数来增强控制器的适应性。在存在干扰或不稳定的情况下，评论家的反馈有助于行动者调整行动以减少其影响，利用一种互补策略来解决主控制器中的不足。最后，进行了各种评估，包括正常条件下的评估、患者之间的适应性评估以及针对混合干扰的稳定性评估，以确认所提出方法的有效性和可行性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3fbe/11699154/956c5e6872e9/41598_2024_84791_Fig1_HTML.jpg

相似文献

Adaptive average arterial pressure control by multi-agent on-policy reinforcement learning.基于多智能体在线策略强化学习的自适应平均动脉压控制

Sci Rep. 2025 Jan 3;15(1):679. doi: 10.1038/s41598-024-84791-5.

An optimal interval type-2 fuzzy logic control based closed-loop drug administration to regulate the mean arterial blood pressure.基于最优区间型 2 模糊逻辑控制的闭环给药调节平均动脉血压。

Comput Methods Programs Biomed. 2020 Mar;185:105167. doi: 10.1016/j.cmpb.2019.105167. Epub 2019 Oct 31.

Meta attention for Off-Policy Actor-Critic.用于离策略演员-评论家的元注意力机制

Neural Netw. 2023 Jun;163:86-96. doi: 10.1016/j.neunet.2023.03.024. Epub 2023 Mar 28.

Stochastic Integrated Actor-Critic for Deep Reinforcement Learning.用于深度强化学习的随机集成演员-评论家算法

IEEE Trans Neural Netw Learn Syst. 2024 May;35(5):6654-6666. doi: 10.1109/TNNLS.2022.3212273. Epub 2024 May 2.

Reinforcement-learning-based dual-control methodology for complex nonlinear discrete-time systems with application to spark engine EGR operation.基于强化学习的复杂非线性离散时间系统双控制方法及其在火花发动机废气再循环操作中的应用

IEEE Trans Neural Netw. 2008 Aug;19(8):1369-88. doi: 10.1109/TNN.2008.2000452.

Towards autonomous neuroprosthetic control using Hebbian reinforcement learning.使用赫布强化学习实现自主神经假肢控制。

J Neural Eng. 2013 Dec;10(6):066005. doi: 10.1088/1741-2560/10/6/066005. Epub 2013 Oct 8.

Actor-Critic Reinforcement Learning Based Algorithm for Contaminant Type Identification in Surface Electromyography Data.

Annu Int Conf IEEE Eng Med Biol Soc. 2021 Nov;2021:186-189. doi: 10.1109/EMBC46164.2021.9629967.

Reinforcement learning for closed-loop regulation of cardiovascular system with vagus nerve stimulation: a computational study.基于迷走神经刺激的心血管系统闭环调节的强化学习：一项计算研究。

J Neural Eng. 2024 Jun 3;21(3):036027. doi: 10.1088/1741-2552/ad48bb.

Continuous action deep reinforcement learning for propofol dosing during general anesthesia.全身麻醉期间丙泊酚给药的连续动作深度强化学习

Artif Intell Med. 2022 Jan;123:102227. doi: 10.1016/j.artmed.2021.102227. Epub 2021 Dec 2.

Reinforcement-learning-based output-feedback control of nonstrict nonlinear discrete-time systems with application to engine emission control.基于强化学习的非严格非线性离散时间系统输出反馈控制及其在发动机排放控制中的应用

IEEE Trans Syst Man Cybern B Cybern. 2009 Oct;39(5):1162-79. doi: 10.1109/TSMCB.2009.2013272. Epub 2009 Mar 24.

引用本文的文献

The HM-TARGET personalised real-time haemodynamic targets in critical care.重症监护中的HM-TARGET个性化实时血流动力学目标

Nat Commun. 2025 Aug 7;16(1):7307. doi: 10.1038/s41467-025-62527-x.

Near real-time online reinforcement learning with synchronous or asynchronous updates.具有同步或异步更新的近实时在线强化学习。

Sci Rep. 2025 May 17;15(1):17158. doi: 10.1038/s41598-025-00492-7.

本文引用的文献

Automated Blood Pressure Control.自动血压控制

Semin Respir Crit Care Med. 2021 Feb;42(1):47-58. doi: 10.1055/s-0040-1713083. Epub 2020 Aug 3.

Automated closed-loop control of diabetes: the artificial pancreas.糖尿病的自动闭环控制：人工胰腺

Bioelectron Med. 2018 Nov 7;4:14. doi: 10.1186/s42234-018-0015-6. eCollection 2018.

Comput Methods Programs Biomed. 2020 Mar;185:105167. doi: 10.1016/j.cmpb.2019.105167. Epub 2019 Oct 31.

Computational and Pharmacogenomic Insights on Hypertension Treatment: Rational Drug Design and Optimization Strategies.计算与药物基因组学在高血压治疗中的研究进展：合理药物设计与优化策略。

Curr Drug Targets. 2020;21(1):18-33. doi: 10.2174/1389450120666190808101356.

The influence of mean arterial blood pressure during cardiopulmonary bypass on postoperative renal dysfunction in elderly patients.体外循环期间平均动脉血压对老年患者术后肾功能障碍的影响。

Perfusion. 2012 May;27(3):193-8. doi: 10.1177/0267659112436751. Epub 2012 Feb 15.

Pulse pressure and risk of adverse outcome in coronary bypass surgery.冠状动脉搭桥手术中的脉压与不良结局风险

Anesth Analg. 2008 Oct;107(4):1122-9. doi: 10.1213/ane.0b013e31816ba404.

Improving regulation of mean arterial blood pressure during anesthesia through estimates of surgery effects.

IEEE Trans Biomed Eng. 2000 Nov;47(11):1456-64. doi: 10.1109/10.880097.

Sodium nitroprusside: twenty years and counting.硝普钠：二十年仍在继续。

Anesth Analg. 1995 Jul;81(1):152-62. doi: 10.1097/00000539-199507000-00031.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于多智能体在线策略强化学习的自适应平均动脉压控制

Adaptive average arterial pressure control by multi-agent on-policy reinforcement learning.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献