• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

强化学习在虚拟机器人手术模拟中的集成。

Integration of Reinforcement Learning in a Virtual Robotic Surgical Simulation.

机构信息

12228Yale University School of Medicine, New Haven, CT, USA.

Vector Institute and Department of Computer Science, University of Toronto, Toronto, ON, Canada.

出版信息

Surg Innov. 2023 Feb;30(1):94-102. doi: 10.1177/15533506221095298. Epub 2022 May 3.

DOI:10.1177/15533506221095298
PMID:35503302
Abstract

The revolutions in AI hold tremendous capacity to augment human achievements in surgery, but robust integration of deep learning algorithms with high-fidelity surgical simulation remains a challenge. We present a novel application of reinforcement learning (RL) for automating surgical maneuvers in a graphical simulation. In the Unity3D game engine, the Machine Learning-Agents package was integrated with the NVIDIA FleX particle simulator for developing autonomously behaving RL-trained scissors. Proximal Policy Optimization (PPO) was used to reward movements and desired behavior such as movement along desired trajectory and optimized cutting maneuvers along the deformable tissue-like object. Constant and proportional reward functions were tested, and TensorFlow analytics was used to informed hyperparameter tuning and evaluate performance. RL-trained scissors reliably manipulated the rendered tissue that was simulated with soft-tissue properties. A desirable trajectory of the autonomously behaving scissors was achieved along 1 axis. Proportional rewards performed better compared to constant rewards. Cumulative reward and PPO metrics did not consistently improve across RL-trained scissors in the setting for movement across 2 axes (horizontal and depth). Game engines hold promising potential for the design and implementation of RL-based solutions to simulated surgical subtasks. Task completion was sufficiently achieved in one-dimensional movement in simulations with and without tissue-rendering. Further work is needed to optimize network architecture and parameter tuning for increasing complexity.

摘要

人工智能的革命具有极大的潜力,可以增强人类在手术方面的成就,但将深度学习算法与高保真手术模拟进行稳健整合仍然是一个挑战。我们提出了一种在图形模拟中自动执行手术操作的强化学习 (RL) 的新应用。在 Unity3D 游戏引擎中,集成了 Machine Learning-Agents 包与 NVIDIA FleX 粒子模拟器,以开发自主行为的 RL 训练剪刀。使用近端策略优化 (PPO) 来奖励运动和期望行为,例如沿期望轨迹运动和沿可变形组织样物体优化切割操作。测试了常数和比例奖励函数,并使用 TensorFlow 分析来通知超参数调整和评估性能。RL 训练的剪刀可靠地操纵了具有软组织属性的渲染组织。自主行为的剪刀沿着 1 个轴实现了期望的轨迹。与常数奖励相比,比例奖励表现更好。在两个轴(水平和深度)上移动的设置中,累积奖励和 PPO 指标并没有随着 RL 训练剪刀的一致性提高而提高。游戏引擎为设计和实现基于 RL 的模拟手术子任务解决方案提供了很大的潜力。在有组织渲染和无组织渲染的模拟中,在一维运动中都可以充分完成任务。需要进一步的工作来优化网络架构和参数调整以增加复杂性。

相似文献

1
Integration of Reinforcement Learning in a Virtual Robotic Surgical Simulation.强化学习在虚拟机器人手术模拟中的集成。
Surg Innov. 2023 Feb;30(1):94-102. doi: 10.1177/15533506221095298. Epub 2022 May 3.
2
Human locomotion with reinforcement learning using bioinspired reward reshaping strategies.基于生物启发式奖励重塑策略的强化学习的人类运动。
Med Biol Eng Comput. 2021 Jan;59(1):243-256. doi: 10.1007/s11517-020-02309-3. Epub 2021 Jan 8.
3
Model-Based Reinforcement Learning with Automated Planning for Network Management.基于模型的强化学习与自动化规划在网络管理中的应用。
Sensors (Basel). 2022 Aug 22;22(16):6301. doi: 10.3390/s22166301.
4
Training an Actor-Critic Reinforcement Learning Controller for Arm Movement Using Human-Generated Rewards.使用人类生成的奖励训练用于手臂运动的 Actor-Critic 强化学习控制器。
IEEE Trans Neural Syst Rehabil Eng. 2017 Oct;25(10):1892-1905. doi: 10.1109/TNSRE.2017.2700395. Epub 2017 May 2.
5
Continuous action deep reinforcement learning for propofol dosing during general anesthesia.全身麻醉期间丙泊酚给药的连续动作深度强化学习
Artif Intell Med. 2022 Jan;123:102227. doi: 10.1016/j.artmed.2021.102227. Epub 2021 Dec 2.
6
Neuro-Inspired Reinforcement Learning to Improve Trajectory Prediction in Reward-Guided Behavior.神经启发式强化学习改进奖励导向行为中的轨迹预测。
Int J Neural Syst. 2022 Sep;32(9):2250038. doi: 10.1142/S0129065722500381. Epub 2022 Aug 19.
7
Learning intraoperative organ manipulation with context-based reinforcement learning.基于上下文的强化学习来学习术中器官操作。
Int J Comput Assist Radiol Surg. 2022 Aug;17(8):1419-1427. doi: 10.1007/s11548-022-02630-2. Epub 2022 May 3.
8
ASAP-CORPS: A Semi-Autonomous Platform for COntact-Rich Precision Surgery.ASAP-CORPS:一种用于接触丰富的精准手术的半自主平台。
Mil Med. 2023 Nov 8;188(Suppl 6):412-419. doi: 10.1093/milmed/usad175.
9
Robot-assisted motor training: assistance decreases exploration during reinforcement learning.机器人辅助运动训练:在强化学习过程中,辅助会减少探索行为。
Annu Int Conf IEEE Eng Med Biol Soc. 2014;2014:3516-20. doi: 10.1109/EMBC.2014.6944381.
10
Combining STDP and binary networks for reinforcement learning from images and sparse rewards.结合 STDP 和二进制网络,从图像和稀疏奖励中进行强化学习。
Neural Netw. 2021 Dec;144:496-506. doi: 10.1016/j.neunet.2021.09.010. Epub 2021 Sep 17.

引用本文的文献

1
AI for IMPACTS Framework for Evaluating the Long-Term Real-World Impacts of AI-Powered Clinician Tools: Systematic Review and Narrative Synthesis.用于评估人工智能驱动的临床医生工具长期现实世界影响的AI for IMPACTS框架:系统评价与叙述性综合分析
J Med Internet Res. 2025 Feb 5;27:e67485. doi: 10.2196/67485.
2
Enhancing Medical Training Through Learning From Mistakes by Interacting With an Ill-Trained Reinforcement Learning Agent.通过与训练不佳的强化学习智能体交互从错误中学习来加强医学培训。
IEEE Trans Learn Technol. 2024;17:1248-1260. doi: 10.1109/tlt.2024.3372508. Epub 2024 Mar 4.