• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用强化学习优化心脏瓣膜置换术后住院患者华法林抗凝的动态治疗方案。

Optimizing the dynamic treatment regime of in-hospital warfarin anticoagulation in patients after surgical valve replacement using reinforcement learning.

机构信息

National Clinical Research Center of Cardiovascular Diseases, Fuwai Hospital, National Center for Cardiovascular Diseases, Beijing, People's Republic of China.

State Key Laboratory of Cardiovascular Disease, Fuwai Hospital, National Center for Cardiovascular Diseases, Beijing, People's Republic of China.

出版信息

J Am Med Inform Assoc. 2022 Sep 12;29(10):1722-1732. doi: 10.1093/jamia/ocac088.

DOI:10.1093/jamia/ocac088
PMID:35864720
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9471704/
Abstract

OBJECTIVE

Warfarin anticoagulation management requires sequential decision-making to adjust dosages based on patients' evolving states continuously. We aimed to leverage reinforcement learning (RL) to optimize the dynamic in-hospital warfarin dosing in patients after surgical valve replacement (SVR).

MATERIALS AND METHODS

10 408 SVR cases with warfarin dosage-response data were retrospectively collected to develop and test an RL algorithm that can continuously recommend daily warfarin doses based on patients' evolving multidimensional states. The RL algorithm was compared with clinicians' actual practice and other machine learning and clinical decision rule-based algorithms. The primary outcome was the ratio of patients without in-hospital INRs >3.0 and the INR at discharge within the target range (1.8-2.5) (excellent responders). The secondary outcomes were the safety responder ratio (no INRs >3.0) and the target responder ratio (the discharge INR within 1.8-2.5).

RESULTS

In the test set (n = 1260), the excellent responder ratio under clinicians' guidance was significantly lower than the RL algorithm: 41.6% versus 80.8% (relative risk [RR], 0.51; 95% confidence interval [CI], 0.48-0.55), also the safety responder ratio: 83.1% versus 99.5% (RR, 0.83; 95% CI, 0.81-0.86), and the target responder ratio: 49.7% versus 81.1% (RR, 0.61; 95% CI, 0.58-0.65). The RL algorithms performed significantly better than all the other algorithms. Compared with clinicians' actual practice, the RL-optimized INR trajectory reached and maintained within the target range significantly faster and longer.

DISCUSSION

RL could offer interactive, practical clinical decision support for sequential decision-making tasks and is potentially adaptable for varied clinical scenarios. Prospective validation is needed.

CONCLUSION

An RL algorithm significantly optimized the post-operation warfarin anticoagulation quality compared with clinicians' actual practice, suggesting its potential for challenging sequential decision-making tasks.

摘要

目的

华法林抗凝管理需要连续决策,根据患者不断变化的状态调整剂量。本研究旨在利用强化学习(RL)优化心脏瓣膜置换术后患者住院期间华法林的动态剂量。

材料和方法

回顾性收集了 10408 例心脏瓣膜置换术后患者的华法林剂量反应数据,用于开发和测试一种 RL 算法,该算法可以根据患者不断变化的多维状态持续推荐每日华法林剂量。将 RL 算法与临床医生的实际实践以及其他基于机器学习和临床决策规则的算法进行比较。主要结局为住院期间 INR >3.0 和出院时 INR 处于目标范围(1.8-2.5)(优秀反应者)的患者比例。次要结局为安全反应者比例(无 INR >3.0)和目标反应者比例(出院 INR 处于 1.8-2.5)。

结果

在测试集(n=1260)中,临床医生指导下的优秀反应者比例明显低于 RL 算法:41.6%比 80.8%(相对风险 [RR],0.51;95%置信区间 [CI],0.48-0.55),安全反应者比例:83.1%比 99.5%(RR,0.83;95% CI,0.81-0.86),目标反应者比例:49.7%比 81.1%(RR,0.61;95% CI,0.58-0.65)。RL 算法的性能明显优于所有其他算法。与临床医生的实际实践相比,RL 优化的 INR 轨迹更快、更长时间地达到并保持在目标范围内。

讨论

RL 可以为连续决策任务提供互动式、实用的临床决策支持,并且具有潜在的适应性,可以应用于各种临床场景。需要前瞻性验证。

结论

与临床医生的实际实践相比,RL 算法显著优化了术后华法林抗凝质量,这表明其在具有挑战性的连续决策任务中有应用潜力。

相似文献

1
Optimizing the dynamic treatment regime of in-hospital warfarin anticoagulation in patients after surgical valve replacement using reinforcement learning.利用强化学习优化心脏瓣膜置换术后住院患者华法林抗凝的动态治疗方案。
J Am Med Inform Assoc. 2022 Sep 12;29(10):1722-1732. doi: 10.1093/jamia/ocac088.
2
Evaluation of Machine Learning Approaches for Predicting Warfarin Discharge Dose in Cardiac Surgery Patients: Retrospective Algorithm Development and Validation Study.评估用于预测心脏手术患者华法林出院剂量的机器学习方法:回顾性算法开发与验证研究。
JMIR Cardio. 2023 Dec 6;7:e47262. doi: 10.2196/47262.
3
Effect of post-operative anticoagulation management in patients who have undergone On-X mechanical heart valve replacement surgery on post-discharge warfarin therapy.经 On-X 机械心脏瓣膜置换术后患者的术后抗凝管理对出院后华法林治疗的影响。
J Clin Pharm Ther. 2020 Aug;45(4):767-773. doi: 10.1111/jcpt.13164. Epub 2020 May 15.
4
Pattern of anticoagulation control after heart valve surgery at the Kenyatta National Hospital, Nairobi.内罗毕肯雅塔国家医院心脏瓣膜手术后的抗凝控制模式。
East Afr Med J. 2000 Jul;77(7):354-8. doi: 10.4314/eamj.v77i7.46667.
5
Efficacy and safety of a pharmacist-managed inpatient anticoagulation service for warfarin initiation and titration.药师管理的住院患者抗凝治疗启动和滴定服务的疗效和安全性。
J Clin Pharm Ther. 2011 Oct;36(5):585-91. doi: 10.1111/j.1365-2710.2010.01216.x. Epub 2010 Nov 12.
6
Fiix-prothrombin time versus standard prothrombin time for monitoring of warfarin anticoagulation: a single centre, double-blind, randomised, non-inferiority trial.用于监测华法林抗凝治疗的Fiix-凝血酶原时间与标准凝血酶原时间对比:一项单中心、双盲、随机、非劣效性试验
Lancet Haematol. 2015 Jun;2(6):e231-40. doi: 10.1016/S2352-3026(15)00073-3. Epub 2015 May 25.
7
[The multicenter study on the registration and follow-up of low anticoagulation therapy for the heart valve operation in China].[中国心脏瓣膜手术低抗凝治疗登记与随访多中心研究]
Zhonghua Yi Xue Za Zhi. 2016 May 24;96(19):1489-94. doi: 10.3760/cma.j.issn.0376-2491.2016.19.006.
8
Association between CYP2C9 genetic variants and anticoagulation-related outcomes during warfarin therapy.华法林治疗期间CYP2C9基因变异与抗凝相关结局的关联。
JAMA. 2002 Apr 3;287(13):1690-8. doi: 10.1001/jama.287.13.1690.
9
Optimizing warfarin dosing using deep reinforcement learning.使用深度强化学习优化华法林剂量。
J Biomed Inform. 2023 Jan;137:104267. doi: 10.1016/j.jbi.2022.104267. Epub 2022 Dec 7.
10
Anticoagulation-related outcomes in patients receiving warfarin after starting levofloxacin or gatifloxacin.开始使用左氧氟沙星或加替沙星后接受华法林治疗的患者的抗凝相关结局。
Pharmacotherapy. 2006 Oct;26(10):1446-52. doi: 10.1592/phco.26.10.1446.

引用本文的文献

1
Reinforcement Learning and Its Clinical Applications Within Healthcare: A Systematic Review of Precision Medicine and Dynamic Treatment Regimes.强化学习及其在医疗保健领域的临床应用:精准医学与动态治疗方案的系统综述
Healthcare (Basel). 2025 Jul 19;13(14):1752. doi: 10.3390/healthcare13141752.
2
Efficacy, Safety, and Cost-Effectiveness of "Internet + Pharmacy Care" Via the Alfalfa App in Warfarin Therapy Management After Cardiac Valve Replacement: Randomized Controlled Trial.通过紫花苜蓿应用程序进行“互联网 + 药学服务”在心脏瓣膜置换术后华法林治疗管理中的疗效、安全性及成本效益:随机对照试验
JMIR Mhealth Uhealth. 2025 May 20;13:e53586. doi: 10.2196/53586.
3
Chronic Kidney Disease-Mineral and Bone Disorder Management in 4D: The Case for Dynamic Treatment Regime Methods to Optimize Care.4D 模式下的慢性肾脏病 - 矿物质和骨异常管理:采用动态治疗方案方法优化护理的理由
Curr Osteoporos Rep. 2025 Mar 25;23(1):16. doi: 10.1007/s11914-025-00911-8.
4
Cardiac Surgery-Specific Subtle Perioperative Serum Creatinine Change in Defining Acute Kidney Injury After Coronary Surgery.心脏手术特异性围手术期血清肌酐细微变化在冠状动脉手术后急性肾损伤定义中的应用
JACC Adv. 2024 Oct 15;3(11):101326. doi: 10.1016/j.jacadv.2024.101326. eCollection 2024 Nov.
5
Individualized decision making in on-scene resuscitation time for out-of-hospital cardiac arrest using reinforcement learning.使用强化学习进行院外心脏骤停现场复苏时间的个体化决策
NPJ Digit Med. 2024 Oct 9;7(1):276. doi: 10.1038/s41746-024-01278-3.
6
A Mobile Application for Anticoagulation Management in Patients After Heart Valve Replacement: A Usability Study.心脏瓣膜置换术后患者抗凝管理的移动应用程序:一项可用性研究。
Patient Prefer Adherence. 2024 Sep 30;18:2055-2066. doi: 10.2147/PPA.S471577. eCollection 2024.
7
Machine learning guided prediction of warfarin blood levels for personalized medicine based on clinical longitudinal data from cardiac surgery patients: a prospective observational study.基于心脏手术患者临床纵向数据的机器学习指导对华法林血药浓度进行预测以实现个性化医疗:一项前瞻性观察性研究。
Int J Surg. 2024 Oct 1;110(10):6528-6540. doi: 10.1097/JS9.0000000000001734.
8
Prediction of coronary artery disease based on facial temperature information captured by non-contact infrared thermography.基于非接触式红外热成像技术获取的面部温度信息预测冠状动脉疾病。
BMJ Health Care Inform. 2024 Jun 3;31(1):e100942. doi: 10.1136/bmjhci-2023-100942.
9
Personalizing renal replacement therapy initiation in the intensive care unit: a reinforcement learning-based strategy with external validation on the AKIKI randomized controlled trials.在重症监护病房中进行个体化肾脏替代治疗起始:基于强化学习的策略,并在 AKIKI 随机对照试验上进行了外部验证。
J Am Med Inform Assoc. 2024 Apr 19;31(5):1074-1083. doi: 10.1093/jamia/ocae004.
10
Application of eHealth Tools in Anticoagulation Management After Cardiac Valve Replacement: Scoping Review Coupled With Bibliometric Analysis.电子健康工具在心脏瓣膜置换术后抗凝管理中的应用:结合文献计量分析的系统评价。
JMIR Mhealth Uhealth. 2024 Jan 5;12:e48716. doi: 10.2196/48716.

本文引用的文献

1
2021 ESC/EACTS Guidelines for the management of valvular heart disease.2021年欧洲心脏病学会/欧洲心胸外科学会瓣膜性心脏病管理指南。
Eur Heart J. 2022 Feb 12;43(7):561-632. doi: 10.1093/eurheartj/ehab395.
2
Development of a system to support warfarin dose decisions using deep neural networks.开发一个使用深度神经网络支持华法林剂量决策的系统。
Sci Rep. 2021 Jul 20;11(1):14745. doi: 10.1038/s41598-021-94305-2.
3
Warfarin maintenance dose prediction for Chinese after heart valve replacement by a feedforward neural network with equal stratified sampling.基于等分层抽样的前馈神经网络预测中国心脏瓣膜置换术后华法林维持剂量。
Sci Rep. 2021 Jul 2;11(1):13778. doi: 10.1038/s41598-021-93317-2.
4
An Adapted Neural-Fuzzy Inference System Model Using Preprocessed Balance Data to Improve the Predictive Accuracy of Warfarin Maintenance Dosing in Patients After Heart Valve Replacement.一种使用预处理平衡数据的自适应神经模糊推理系统模型,以提高心脏瓣膜置换术后患者华法林维持剂量的预测准确性。
Cardiovasc Drugs Ther. 2022 Oct;36(5):879-889. doi: 10.1007/s10557-021-07191-1. Epub 2021 Apr 20.
5
Personalized Multimorbidity Management for Patients with Type 2 Diabetes Using Reinforcement Learning of Electronic Health Records.基于电子健康记录的强化学习实现 2 型糖尿病患者多病种的个体化管理。
Drugs. 2021 Mar;81(4):471-482. doi: 10.1007/s40265-020-01435-4.
6
Evaluating warfarin dosing models on multiple datasets with a novel software framework and evolutionary optimisation.运用新型软件框架和进化优化算法,在多个数据集上评估华法林剂量模型。
J Biomed Inform. 2021 Jan;113:103634. doi: 10.1016/j.jbi.2020.103634. Epub 2020 Nov 30.
7
A Reinforcement Learning-Based Method for Management of Type 1 Diabetes: Exploratory Study.一种基于强化学习的1型糖尿病管理方法:探索性研究。
JMIR Diabetes. 2019 Aug 28;4(3):e12905. doi: 10.2196/12905.
8
Inverse reinforcement learning for intelligent mechanical ventilation and sedative dosing in intensive care units.重症监护病房中智能机械通气和镇静药物剂量的逆强化学习。
BMC Med Inform Decis Mak. 2019 Apr 9;19(Suppl 2):57. doi: 10.1186/s12911-019-0763-6.
9
High-performance medicine: the convergence of human and artificial intelligence.高性能医学:人机智能融合。
Nat Med. 2019 Jan;25(1):44-56. doi: 10.1038/s41591-018-0300-7. Epub 2019 Jan 7.
10
The Artificial Intelligence Clinician learns optimal treatment strategies for sepsis in intensive care.人工智能临床医生学习重症监护中脓毒症的最佳治疗策略。
Nat Med. 2018 Nov;24(11):1716-1720. doi: 10.1038/s41591-018-0213-5. Epub 2018 Oct 22.