一种用于重症监护病房低血压管理预部署建模的可解释强化学习框架。

An interpretable RL framework for pre-deployment modeling in ICU hypotension management.

作者信息

Zhang Kristine, Wang Henry, Du Jianzhun, Chu Brian, Arévalo Aldo Robles, Kindle Ryan, Celi Leo Anthony, Doshi-Velez Finale

机构信息

Harvard University, Cambridge, MA, USA.

IDMEC, Instituto Superior Técnico - Universidade de Lisboa, NTT DATA Portugal, Lisbon, Portugal.

出版信息

NPJ Digit Med. 2022 Nov 18;5(1):173. doi: 10.1038/s41746-022-00708-4.

DOI:10.1038/s41746-022-00708-4

PMID:36396808

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9671896/

Abstract

Computational methods from reinforcement learning have shown promise in inferring treatment strategies for hypotension management and other clinical decision-making challenges. Unfortunately, the resulting models are often difficult for clinicians to interpret, making clinical inspection and validation of these computationally derived strategies challenging in advance of deployment. In this work, we develop a general framework for identifying succinct sets of clinical contexts in which clinicians make very different treatment choices, tracing the effects of those choices, and inferring a set of recommendations for those specific contexts. By focusing on these few key decision points, our framework produces succinct, interpretable treatment strategies that can each be easily visualized and verified by clinical experts. This interrogation process allows clinicians to leverage the model's use of historical data in tandem with their own expertise to determine which recommendations are worth investigating further e.g. at the bedside. We demonstrate the value of this approach via application to hypotension management in the ICU, an area with critical implications for patient outcomes that lacks data-driven individualized treatment strategies; that said, our framework has broad implications on how to use computational methods to assist with decision-making challenges on a wide range of clinical domains.

摘要

强化学习中的计算方法已显示出在推断低血压管理及其他临床决策挑战的治疗策略方面的前景。不幸的是，所得模型往往让临床医生难以解释，这使得在部署之前对这些通过计算得出的策略进行临床检查和验证具有挑战性。在这项工作中，我们开发了一个通用框架，用于识别临床医生做出非常不同治疗选择的简洁临床情境集，追踪这些选择的效果，并为那些特定情境推断出一套建议。通过关注这几个关键决策点，我们的框架产生简洁、可解释的治疗策略，每个策略都能很容易地由临床专家进行可视化和验证。这种询问过程使临床医生能够将模型对历史数据的使用与其自身专业知识结合起来，以确定哪些建议值得进一步研究，例如在床边。我们通过将该方法应用于重症监护病房的低血压管理来证明其价值，这是一个对患者预后有重大影响且缺乏数据驱动的个性化治疗策略的领域；也就是说，我们的框架对于如何使用计算方法协助广泛临床领域的决策挑战具有广泛意义。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc71/9671896/c2630a4cfe5a/41746_2022_708_Fig1_HTML.jpg

相似文献

An interpretable RL framework for pre-deployment modeling in ICU hypotension management.一种用于重症监护病房低血压管理预部署建模的可解释强化学习框架。

NPJ Digit Med. 2022 Nov 18;5(1):173. doi: 10.1038/s41746-022-00708-4.

Interpretable Batch IRL to Extract Clinician Goals in ICU Hypotension Management.用于提取重症监护病房低血压管理中临床医生目标的可解释批处理逆强化学习。

AMIA Jt Summits Transl Sci Proc. 2020 May 30;2020:636-645. eCollection 2020.

Healthcare stakeholders' perceptions and experiences of factors affecting the implementation of critical care telemedicine (CCT): qualitative evidence synthesis.医疗保健利益相关者对影响重症监护远程医疗（CCT）实施因素的看法和经验：定性证据综合分析。

Cochrane Database Syst Rev. 2021 Feb 18;2(2):CD012876. doi: 10.1002/14651858.CD012876.pub2.

Trajectory Inspection: A Method for Iterative Clinician-Driven Design of Reinforcement Learning Studies.轨迹检查：一种迭代临床医生驱动的强化学习研究设计方法。

AMIA Jt Summits Transl Sci Proc. 2021 May 17;2021:305-314. eCollection 2021.

Interpretable Decision Sets: A Joint Framework for Description and Prediction.可解释决策集：用于描述与预测的联合框架

KDD. 2016 Aug;2016:1675-1684. doi: 10.1145/2939672.2939874.

Self-Supervised Discovering of Interpretable Features for Reinforcement Learning.基于自监督学习的强化学习可解释特征发现。

IEEE Trans Pattern Anal Mach Intell. 2022 May;44(5):2712-2724. doi: 10.1109/TPAMI.2020.3037898. Epub 2022 Apr 1.

Transatlantic transferability of a new reinforcement learning model for optimizing haemodynamic treatment for critically ill patients with sepsis.用于优化脓毒症危重症患者血液动力学治疗的新型强化学习模型的大西洋转移能力。

Artif Intell Med. 2021 Feb;112:102003. doi: 10.1016/j.artmed.2020.102003. Epub 2020 Dec 15.

The future of Cochrane Neonatal.考克兰新生儿协作网的未来。

Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.

The effectiveness of internet-based e-learning on clinician behavior and patient outcomes: a systematic review protocol.基于互联网的电子学习对临床医生行为和患者结局的有效性：一项系统评价方案。

JBI Database System Rev Implement Rep. 2015 Jan;13(1):52-64. doi: 10.11124/jbisrir-2015-1919.

PaCAR: COVID-19 Pandemic Control Decision Making via Large-Scale Agent-Based Modeling and Deep Reinforcement Learning.PaCAR：通过大规模基于代理的建模和深度强化学习进行 COVID-19 大流行控制决策。

Med Decis Making. 2022 Nov;42(8):1064-1077. doi: 10.1177/0272989X221107902. Epub 2022 Jul 1.

引用本文的文献

Data-driven decision making in patient management: a systematic review.患者管理中数据驱动的决策制定：一项系统综述

BMC Med Inform Decis Mak. 2025 Jul 1;25(1):239. doi: 10.1186/s12911-025-03072-x.

Learning optimal treatment strategies for intraoperative hypotension using deep reinforcement learning.使用深度强化学习学习术中低血压的最佳治疗策略。

ArXiv. 2025 May 27:arXiv:2505.21596v1.

Optimal Vasopressin Initiation in Septic Shock: The OVISS Reinforcement Learning Study.脓毒性休克中血管加压素的最佳起始剂量：OVISS强化学习研究

JAMA. 2025 May 20;333(19):1688-1698. doi: 10.1001/jama.2025.3046.

Safety of human-AI cooperative decision-making within intensive care: A physical simulation study.重症监护中人类与人工智能协作决策的安全性：一项物理模拟研究。

PLOS Digit Health. 2025 Feb 24;4(2):e0000726. doi: 10.1371/journal.pdig.0000726. eCollection 2025 Feb.

Personalized decision making for coronary artery disease treatment using offline reinforcement learning.使用离线强化学习进行冠状动脉疾病治疗的个性化决策

NPJ Digit Med. 2025 Feb 14;8(1):99. doi: 10.1038/s41746-025-01498-1.

Reinforcement learning model for optimizing dexmedetomidine dosing to prevent delirium in critically ill patients.用于优化右美托咪定给药剂量以预防重症患者谵妄的强化学习模型

NPJ Digit Med. 2024 Nov 18;7(1):325. doi: 10.1038/s41746-024-01335-x.

Development and validation of a reinforcement learning model for ventilation control during emergence from general anesthesia.全身麻醉苏醒期通气控制强化学习模型的开发与验证

NPJ Digit Med. 2023 Aug 14;6(1):145. doi: 10.1038/s41746-023-00893-w.

本文引用的文献

Machine learning for patient risk stratification: standing on, or looking over, the shoulders of clinicians?用于患者风险分层的机器学习：是站在临床医生的肩膀上，还是俯瞰他们？

NPJ Digit Med. 2021 Mar 30;4(1):62. doi: 10.1038/s41746-021-00426-3.

Optimal treatment recommendations for diabetes patients using the Markov decision process along with the South Korean electronic health records.利用马尔可夫决策过程和韩国电子健康记录为糖尿病患者提供最佳治疗建议。

Sci Rep. 2021 Mar 25;11(1):6920. doi: 10.1038/s41598-021-86419-4.

The surviving sepsis campaign: fluid resuscitation and vasopressor therapy research priorities in adult patients.拯救脓毒症运动：成年患者的液体复苏和血管活性药物治疗研究重点

Intensive Care Med Exp. 2021 Mar 1;9(1):10. doi: 10.1186/s40635-021-00369-9.

Exploring Computational User Models for Agent Policy Summarization.探索用于智能体策略总结的计算用户模型。

IJCAI (U S). 2019 Aug;28:1401-1407.

Generalizable deep temporal models for predicting episodes of sudden hypotension in critically ill patients: a personalized approach.可推广的深度时间模型用于预测危重病患者突发性低血压发作：个性化方法。

Sci Rep. 2020 Jul 10;10(1):11480. doi: 10.1038/s41598-020-67952-0.

Identifying Distinct, Effective Treatments for Acute Hypotension with SODA-RL: Safely Optimized Diverse Accurate Reinforcement Learning.使用SODA-RL识别急性低血压的独特有效治疗方法：安全优化的多样精确强化学习

AMIA Jt Summits Transl Sci Proc. 2020 May 30;2020:181-190. eCollection 2020.

Guidelines for reinforcement learning in healthcare.医疗保健领域强化学习指南。

Nat Med. 2019 Jan;25(1):16-18. doi: 10.1038/s41591-018-0310-5.

The Artificial Intelligence Clinician learns optimal treatment strategies for sepsis in intensive care.人工智能临床医生学习重症监护中脓毒症的最佳治疗策略。

Nat Med. 2018 Nov;24(11):1716-1720. doi: 10.1038/s41591-018-0213-5. Epub 2018 Oct 22.

Deep reinforcement learning for automated radiation adaptation in lung cancer.深度强化学习在肺癌放射自适应中的应用。

Med Phys. 2017 Dec;44(12):6690-6705. doi: 10.1002/mp.12625. Epub 2017 Nov 14.

Fluid administration in severe sepsis and septic shock, patterns and outcomes: an analysis of a large national database.严重脓毒症和脓毒性休克中的液体管理、模式和结局：对大型国家数据库的分析。

Intensive Care Med. 2017 May;43(5):625-632. doi: 10.1007/s00134-016-4675-y. Epub 2017 Jan 27.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一种用于重症监护病房低血压管理预部署建模的可解释强化学习框架。

An interpretable RL framework for pre-deployment modeling in ICU hypotension management.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献