• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过与训练不佳的强化学习智能体交互从错误中学习来加强医学培训。

Enhancing Medical Training Through Learning From Mistakes by Interacting With an Ill-Trained Reinforcement Learning Agent.

作者信息

Kakdas Yasar C, Kockara Sinan, Halic Tansel, Demirel Doga

机构信息

Florida Polytechnic Univ., Dept. of Computer Science, Lakeland, FL, USA 33805.

Rice Univ., Dept. of Computer Science, Houston, TX, USA 77005.

出版信息

IEEE Trans Learn Technol. 2024;17:1248-1260. doi: 10.1109/tlt.2024.3372508. Epub 2024 Mar 4.

DOI:10.1109/tlt.2024.3372508
PMID:39431279
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11486497/
Abstract

This study presents a 3D medical simulation that employs reinforcement learning (RL) and interactive reinforcement learning (IRL) to teach and assess the procedure of donning and doffing personal protective equipment (PPE). The simulation is motivated by the need for effective, safe, and remote training techniques in medicine, particularly in light of the COVID-19 pandemic. The simulation has two modes: a tutorial mode and an assessment mode. In the tutorial mode, a computer-based, ill-trained RL agent utilizes RL to learn the correct sequence of donning the PPE by trial and error. This allows students to experience many outlier cases they might not encounter in an in-class educational model. In the assessment mode, an IRL-based method is used to evaluate how effective the participant is at correcting the mistakes performed by the RL agent. Each time the RL agent interacts with the environment and performs an action, the participants provide positive or negative feedback regarding the action taken. Following the assessment, participants receive a score based on the accuracy of their feedback and the time taken for the RL agent to learn the correct sequence. An experiment was conducted using two groups, each consisting of 10 participants. The first group received RL-assisted training for donning PPE, followed by an IRL-based assessment. Meanwhile, the second group observed a video featuring the RL agent demonstrating only the correct donning order without outlier cases, replicating traditional training, before undergoing the same assessment as the first group. Results showed that RL-assisted training with many outlier cases was more effective than traditional training with only regular cases. Moreover, combining RL with IRL significantly enhanced the participants' performance. Notably, 90% of the participants finished the assessment with perfect scores within three iterations. In contrast, only 10% of those who did not engage in RL-assisted training finished the assessment with a perfect score, highlighting the substantial impact of RL and IRL integration on participants' overall achievement.

摘要

本研究展示了一种三维医学模拟,该模拟采用强化学习(RL)和交互式强化学习(IRL)来教授和评估穿脱个人防护装备(PPE)的过程。鉴于COVID-19大流行,对医学领域有效、安全和远程培训技术的需求推动了该模拟的发展。该模拟有两种模式:教程模式和评估模式。在教程模式中,一个基于计算机的、训练不足的强化学习智能体利用强化学习通过试错来学习正确的PPE穿戴顺序。这使学生能够体验到他们在课堂教育模式中可能不会遇到的许多异常情况。在评估模式中,一种基于交互式强化学习的方法用于评估参与者纠正强化学习智能体所犯错误的效果。每次强化学习智能体与环境交互并执行一个动作时,参与者会对所采取的动作提供正面或负面反馈。评估结束后,参与者会根据其反馈的准确性以及强化学习智能体学习正确顺序所需的时间获得一个分数。使用两组进行了一项实验,每组由10名参与者组成。第一组接受了穿戴PPE的强化学习辅助训练,随后进行基于交互式强化学习的评估。与此同时,第二组观看了一段视频,视频中强化学习智能体仅展示了正确的穿戴顺序,没有异常情况,这是传统训练的方式,然后与第一组进行相同的评估。结果表明,包含许多异常情况的强化学习辅助训练比仅包含常规情况的传统训练更有效。此外,将强化学习与交互式强化学习相结合显著提高了参与者的表现。值得注意的是,90%的参与者在三次迭代内以满分完成了评估。相比之下,未参与强化学习辅助训练的参与者中只有10%以满分完成了评估,这凸显了强化学习与交互式强化学习整合对参与者整体成绩的重大影响。

相似文献

1
Enhancing Medical Training Through Learning From Mistakes by Interacting With an Ill-Trained Reinforcement Learning Agent.通过与训练不佳的强化学习智能体交互从错误中学习来加强医学培训。
IEEE Trans Learn Technol. 2024;17:1248-1260. doi: 10.1109/tlt.2024.3372508. Epub 2024 Mar 4.
2
The Comparative Effectiveness of Virtual Reality Versus E-Module on the Training of Donning and Doffing Personal Protective Equipment: A Randomized, Simulation-Based Educational Study.虚拟现实与电子模块在个人防护装备穿脱培训中的比较效果:一项基于模拟的随机教育研究。
Cureus. 2022 Mar 30;14(3):e23655. doi: 10.7759/cureus.23655. eCollection 2022 Mar.
3
Personal protective equipment for preventing highly infectious diseases due to exposure to contaminated body fluids in healthcare staff.医护人员用于预防因接触受污染体液而感染高度传染性疾病的个人防护装备。
Cochrane Database Syst Rev. 2020 May 15;5(5):CD011621. doi: 10.1002/14651858.CD011621.pub5.
4
Personal protective equipment for preventing highly infectious diseases due to exposure to contaminated body fluids in healthcare staff.用于医护人员预防因接触受污染体液而感染高传染性疾病的个人防护装备。
Cochrane Database Syst Rev. 2020 Apr 15;4(4):CD011621. doi: 10.1002/14651858.CD011621.pub4.
5
Personal Protection Equipment Training as a Virtual Reality Game in Immersive Environments: Development Study and Pilot Randomized Controlled Trial.在沉浸式环境中将个人防护装备培训作为虚拟现实游戏:开发研究与试点随机对照试验
JMIR Serious Games. 2025 Mar 20;13:e69021. doi: 10.2196/69021.
6
Comparison of Repeated Video Display vs Combined Video Display and Live Demonstration as Training Methods to Healthcare Providers for Donning and Doffing Personal Protective Equipment: A Randomized Controlled Trial.重复视频展示与视频展示结合现场演示作为医疗保健提供者穿戴和脱卸个人防护装备培训方法的比较:一项随机对照试验。
Risk Manag Healthc Policy. 2020 Oct 29;13:2325-2335. doi: 10.2147/RMHP.S267514. eCollection 2020.
7
Comparing Training Techniques in Personal Protective Equipment Use.比较个人防护设备使用中的培训技巧。
Prehosp Disaster Med. 2020 Aug;35(4):364-371. doi: 10.1017/S1049023X20000564. Epub 2020 May 11.
8
Simulation-Based Mastery Learning Improves the Performance of Donning and Doffing of Personal Protective Equipment by Medical Students.基于模拟的掌握学习提高了医学生穿脱个人防护装备的表现。
West J Emerg Med. 2022 May 2;23(3):318-323. doi: 10.5811/westjem.2022.2.54748.
9
Personal protective equipment for preventing highly infectious diseases due to exposure to contaminated body fluids in healthcare staff.用于防止医护人员因接触受污染体液而感染高传染性疾病的个人防护装备。
Cochrane Database Syst Rev. 2019 Jul 1;7(7):CD011621. doi: 10.1002/14651858.CD011621.pub3.
10
Donning and Doffing of Personal Protective Equipment: Perceived Effectiveness of Virtual Simulation Training to Decrease COVID-19 Transmission and Contraction.个人防护装备的穿脱:虚拟模拟训练对降低新冠病毒传播与感染的感知有效性
Cureus. 2022 Mar 7;14(3):e22943. doi: 10.7759/cureus.22943. eCollection 2022 Mar.

引用本文的文献

1
Personal Protection Equipment Training as a Virtual Reality Game in Immersive Environments: Development Study and Pilot Randomized Controlled Trial.在沉浸式环境中将个人防护装备培训作为虚拟现实游戏:开发研究与试点随机对照试验
JMIR Serious Games. 2025 Mar 20;13:e69021. doi: 10.2196/69021.

本文引用的文献

1
Endoscopic sleeve gastroplasty: stomach location and task classification for evaluation using artificial intelligence.内镜袖状胃成形术:利用人工智能评估的胃位置和任务分类。
Int J Comput Assist Radiol Surg. 2024 Apr;19(4):635-644. doi: 10.1007/s11548-023-03054-2. Epub 2024 Jan 11.
2
An experimental machine learning study investigating the decision-making process of students and qualified radiographers when interpreting radiographic images.一项实验性机器学习研究,旨在调查学生和合格放射技师在解读X光图像时的决策过程。
PLOS Digit Health. 2023 Oct 25;2(10):e0000229. doi: 10.1371/journal.pdig.0000229. eCollection 2023 Oct.
3
Revolutionizing healthcare: the role of artificial intelligence in clinical practice.人工智能在临床实践中的应用:医疗保健的革命。
BMC Med Educ. 2023 Sep 22;23(1):689. doi: 10.1186/s12909-023-04698-z.
4
The applications of machine learning techniques in medical data processing based on distributed computing and the Internet of Things.基于分布式计算和物联网的机器学习技术在医学数据处理中的应用。
Comput Methods Programs Biomed. 2023 Nov;241:107745. doi: 10.1016/j.cmpb.2023.107745. Epub 2023 Aug 9.
5
Beyond Supervised Learning for Pervasive Healthcare.超越监督学习的普及医疗保健。
IEEE Rev Biomed Eng. 2024;17:42-62. doi: 10.1109/RBME.2023.3296938. Epub 2024 Jan 12.
6
Multi-stage optimal dynamic treatment regimes for survival outcomes with dependent censoring.用于具有相依删失的生存结局的多阶段最优动态治疗方案
Biometrika. 2022 Aug 13;110(2):395-410. doi: 10.1093/biomet/asac047. eCollection 2023 Jun.
7
Towards more efficient and robust evaluation of sepsis treatment with deep reinforcement learning.用深度强化学习更有效地评估脓毒症治疗效果。
BMC Med Inform Decis Mak. 2023 Mar 1;23(1):43. doi: 10.1186/s12911-023-02126-2.
8
Artificial intelligence aids in development of nanomedicines for cancer management.人工智能助力癌症管理的纳米药物研发。
Semin Cancer Biol. 2023 Feb;89:61-75. doi: 10.1016/j.semcancer.2023.01.005. Epub 2023 Jan 20.
9
Reinforcement learning in medical image analysis: Concepts, applications, challenges, and future directions.医学图像分析中的强化学习:概念、应用、挑战和未来方向。
J Appl Clin Med Phys. 2023 Feb;24(2):e13898. doi: 10.1002/acm2.13898. Epub 2023 Jan 10.
10
Reinforcement learning using Deep [Formula: see text] networks and [Formula: see text] learning accurately localizes brain tumors on MRI with very small training sets.基于深度[公式:见正文]网络和[公式:见正文]学习的强化学习能够使用非常小的训练集准确地对 MRI 上的脑肿瘤进行定位。
BMC Med Imaging. 2022 Dec 23;22(1):224. doi: 10.1186/s12880-022-00919-x.