一种用于重症监护中优化治疗策略的强化学习模型：心肺特征作用的评估

A Reinforcement Learning Model for Optimal Treatment Strategies in Intensive Care: Assessment of the Role of Cardiorespiratory Features.

作者信息

Drudi Cristian, Mollura Maximiliano, Lehman Li-Wei H, Barbieri Riccardo

机构信息

Department of Electronics, Informatics and EngineeringPolitecnico di Milano 20133 Milano Italy.

Institute for Medical Engineering and ScienceMassachusetts Institute of Technology Cambridge MA 02139 USA.

出版信息

IEEE Open J Eng Med Biol. 2024 Feb 19;5:806-815. doi: 10.1109/OJEMB.2024.3367236. eCollection 2024.

DOI:10.1109/OJEMB.2024.3367236

PMID:39559781

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11573419/

Abstract

The purpose of this study is to evaluate the importance of cardiorespiratory variables within a Reinforcement Learning (RL) recommendation system aimed at establishing optimal strategies for drug treatment of septic patients in the intensive care unit (ICU). We developed a RL model in order to establish drug administration strategies for septic patients using only a set of cardiorespiratory variables. We then compared this model with other RL models trained with a different set of features. We selected patients meeting the Sepsis-3 criteria from the Multi-parameter Intelligent Monitoring in Intensive Care (MIMIC III) database, resulting in a total of 20,496 ICU admissions. A Markov Decision Process (MDP) was built on the extracted discrete time-series. A policy iteration algorithm was used to obtain the optimal AI policy for the MDP. The policy performance was then evaluated using the WIS estimator. The process was repeated for each set of variables and compared to a set of baseline benchmark policies. The model trained with cardiorespiratory variables outperformed all other models considered, resulting in a 95% confidence lower bound score of 97.48. This finding highlights the importance of cardiovascular variables in the clinical RL recommendation system. We established an efficient RL model for sepsis treatment in the ICU and demonstrated that cardiorespiratory variables provides critical information in devising optimal policies. Given the potentially continuous availability of cardiorespiratory features extracted from bedside physiological waveform monitoring, the proposed framework paves the way for a real time recommendation system for sepsis treatment.

摘要

本研究的目的是评估强化学习（RL）推荐系统中心肺变量的重要性，该系统旨在为重症监护病房（ICU）的脓毒症患者建立最佳药物治疗策略。我们开发了一个RL模型，以便仅使用一组心肺变量为脓毒症患者建立给药策略。然后，我们将该模型与使用不同特征集训练的其他RL模型进行比较。我们从多参数智能重症监护监测（MIMIC III）数据库中选择符合脓毒症-3标准的患者，共有20496例ICU入院病例。基于提取的离散时间序列构建了马尔可夫决策过程（MDP）。使用策略迭代算法获得MDP的最优AI策略。然后使用WIS估计器评估策略性能。对每组变量重复该过程，并与一组基线基准策略进行比较。使用心肺变量训练的模型优于所有其他考虑的模型，95%置信下限评分为97.48。这一发现突出了心血管变量在临床RL推荐系统中的重要性。我们建立了一个用于ICU脓毒症治疗的高效RL模型，并证明心肺变量在制定最优策略时提供了关键信息。鉴于从床边生理波形监测中提取的心肺特征可能持续可用，所提出的框架为脓毒症治疗的实时推荐系统铺平了道路。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad04/11573419/dd1dda639ca9/drudi1-3367236.jpg

相似文献

A Reinforcement Learning Model for Optimal Treatment Strategies in Intensive Care: Assessment of the Role of Cardiorespiratory Features.一种用于重症监护中优化治疗策略的强化学习模型：心肺特征作用的评估

IEEE Open J Eng Med Biol. 2024 Feb 19;5:806-815. doi: 10.1109/OJEMB.2024.3367236. eCollection 2024.

Systemic Inflammatory Response Syndrome全身炎症反应综合征

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

Confusion Assessment Method for the Intensive Care Unit (CAM-ICU) for the diagnosis of delirium in adults in critical care settings.**用于** ICU 成人患者的意识模糊评估方法（CAM-ICU）**用于** 诊断重症监护环境下成人的意识障碍。

Cochrane Database Syst Rev. 2023 Nov 21;11(11):CD013126. doi: 10.1002/14651858.CD013126.pub2.

Melatonin for the promotion of sleep in adults in the intensive care unit.褪黑素用于促进重症监护病房成年患者的睡眠。

Cochrane Database Syst Rev. 2018 May 10;5(5):CD012455. doi: 10.1002/14651858.CD012455.pub2.

Higher versus lower fractions of inspired oxygen or targets of arterial oxygenation for adults admitted to the intensive care unit.对于入住重症监护病房的成年人，较高与较低吸氧分数或动脉血氧目标。

Cochrane Database Syst Rev. 2023 Sep 13;9(9):CD012631. doi: 10.1002/14651858.CD012631.pub3.

Prediction cardiovascular deterioration in a paediatric intensive care unit (PicEWS): a machine learning modelling study of routinely collected health-care data.儿科重症监护病房心血管恶化的预测（PicEWS）：一项基于常规收集的医疗数据的机器学习建模研究

EClinicalMedicine. 2025 Jun 18;85:103255. doi: 10.1016/j.eclinm.2025.103255. eCollection 2025 Jul.

Exercise rehabilitation following intensive care unit discharge for recovery from critical illness.重症监护病房出院后进行运动康复以促进危重症恢复。

Cochrane Database Syst Rev. 2015 Jun 22;2015(6):CD008632. doi: 10.1002/14651858.CD008632.pub2.

Automated monitoring compared to standard care for the early detection of sepsis in critically ill patients.与标准护理相比，自动监测用于危重症患者脓毒症的早期检测

Cochrane Database Syst Rev. 2018 Jun 25;6(6):CD012404. doi: 10.1002/14651858.CD012404.pub2.

Health professionals' experience of teamwork education in acute hospital settings: a systematic review of qualitative literature.医疗专业人员在急症医院环境中团队合作教育的经验：对定性文献的系统综述

JBI Database System Rev Implement Rep. 2016 Apr;14(4):96-137. doi: 10.11124/JBISRIR-2016-1843.

引用本文的文献

Predictive Modeling of Acute Respiratory Distress Syndrome Using Machine Learning: Systematic Review and Meta-Analysis.使用机器学习对急性呼吸窘迫综合征进行预测建模：系统评价与荟萃分析

J Med Internet Res. 2025 May 13;27:e66615. doi: 10.2196/66615.

本文引用的文献

Characterization of Physiologic Patients' Response to Fluid Interventions in the Intensive Care Unit.描述 ICU 中生理患者对液体干预的反应特征。

Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul;2022:1402-1405. doi: 10.1109/EMBC48229.2022.9871512.

A Reinforcement Learning Application for Optimal Fluid and Vasopressor Interventions in Septic ICU Patients.强化学习在脓毒症 ICU 患者最佳液体和血管加压素干预中的应用。

Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul;2022:321-324. doi: 10.1109/EMBC48229.2022.9871055.

A novel artificial intelligence based intensive care unit monitoring system: using physiological waveforms to identify sepsis.一种新型人工智能重症监护监测系统：利用生理波形识别脓毒症。

Philos Trans A Math Phys Eng Sci. 2021 Dec 13;379(2212):20200252. doi: 10.1098/rsta.2020.0252. Epub 2021 Oct 25.

The autonomic nervous system in septic shock and its role as a future therapeutic target: a narrative review.脓毒性休克中的自主神经系统及其作为未来治疗靶点的作用：一篇叙述性综述。

Ann Intensive Care. 2021 May 17;11(1):80. doi: 10.1186/s13613-021-00869-7.

Reinforcement Learning for Clinical Decision Support in Critical Care: Comprehensive Review.强化学习在重症监护临床决策支持中的应用：全面综述。

J Med Internet Res. 2020 Jul 20;22(7):e18477. doi: 10.2196/18477.

Using Artificial Intelligence to Detect COVID-19 and Community-acquired Pneumonia Based on Pulmonary CT: Evaluation of the Diagnostic Accuracy.基于肺部 CT 的人工智能检测 COVID-19 和社区获得性肺炎：诊断准确性评估。

Radiology. 2020 Aug;296(2):E65-E71. doi: 10.1148/radiol.2020200905. Epub 2020 Mar 19.

Deep Reinforcement Learning for Optimal Critical Care Pain Management with Morphine using Dueling Double-Deep Q Networks.使用决斗双深度Q网络的深度强化学习实现吗啡对重症监护疼痛的优化管理

Annu Int Conf IEEE Eng Med Biol Soc. 2019 Jul;2019:3960-3963. doi: 10.1109/EMBC.2019.8857295.

Use of machine learning to analyse routinely collected intensive care unit data: a systematic review.运用机器学习分析常规收集的重症监护病房数据：系统评价。

Crit Care. 2019 Aug 22;23(1):284. doi: 10.1186/s13054-019-2564-9.

Derivation, Validation, and Potential Treatment Implications of Novel Clinical Phenotypes for Sepsis.新型败血症临床表型的推导、验证及潜在治疗意义。

JAMA. 2019 May 28;321(20):2003-2017. doi: 10.1001/jama.2019.5791.

Deep Reinforcement Learning and Simulation as a Path Toward Precision Medicine.深度强化学习与模拟：通往精准医学之路

J Comput Biol. 2019 Jun;26(6):597-604. doi: 10.1089/cmb.2018.0168. Epub 2019 Jan 25.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种用于重症监护中优化治疗策略的强化学习模型：心肺特征作用的评估

A Reinforcement Learning Model for Optimal Treatment Strategies in Intensive Care: Assessment of the Role of Cardiorespiratory Features.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献