• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用用于区域提议和检测的深度神经网络在机器人辅助手术视频中检测和定位机器人工具。

Detection and Localization of Robotic Tools in Robot-Assisted Surgery Videos Using Deep Neural Networks for Region Proposal and Detection.

出版信息

IEEE Trans Med Imaging. 2017 Jul;36(7):1542-1549. doi: 10.1109/TMI.2017.2665671. Epub 2017 Feb 8.

DOI:10.1109/TMI.2017.2665671
PMID:28186883
Abstract

Video understanding of robot-assisted surgery (RAS) videos is an active research area. Modeling the gestures and skill level of surgeons presents an interesting problem. The insights drawn may be applied in effective skill acquisition, objective skill assessment, real-time feedback, and human-robot collaborative surgeries. We propose a solution to the tool detection and localization open problem in RAS video understanding, using a strictly computer vision approach and the recent advances of deep learning. We propose an architecture using multimodal convolutional neural networks for fast detection and localization of tools in RAS videos. To the best of our knowledge, this approach will be the first to incorporate deep neural networks for tool detection and localization in RAS videos. Our architecture applies a region proposal network (RPN) and a multimodal two stream convolutional network for object detection to jointly predict objectness and localization on a fusion of image and temporal motion cues. Our results with an average precision of 91% and a mean computation time of 0.1 s per test frame detection indicate that our study is superior to conventionally used methods for medical imaging while also emphasizing the benefits of using RPN for precision and efficiency. We also introduce a new data set, ATLAS Dione, for RAS video understanding. Our data set provides video data of ten surgeons from Roswell Park Cancer Institute, Buffalo, NY, USA, performing six different surgical tasks on the daVinci Surgical System (dVSS) with annotations of robotic tools per frame.

摘要

机器人辅助手术(RAS)视频的理解是一个活跃的研究领域。对外科医生的手势和技能水平进行建模是一个有趣的问题。从中得出的见解可以应用于有效的技能获取、客观的技能评估、实时反馈和人机协作手术。我们提出了一种使用严格的计算机视觉方法和深度学习的最新进展来解决 RAS 视频理解中的工具检测和定位难题的解决方案。我们提出了一种使用多模态卷积神经网络的架构,用于快速检测和定位 RAS 视频中的工具。据我们所知,这种方法将是第一个将深度学习网络用于 RAS 视频中的工具检测和定位的方法。我们的架构应用了区域提议网络(RPN)和多模态双流卷积网络进行目标检测,以联合预测图像和时间运动线索融合上的目标性和定位。我们的结果平均精度为 91%,每个测试帧的平均计算时间为 0.1 秒,这表明我们的研究优于传统的医学成像方法,同时也强调了使用 RPN 提高精度和效率的好处。我们还引入了一个新的数据集 ATLAS Dione,用于 RAS 视频理解。我们的数据集提供了来自美国纽约州布法罗市罗斯韦尔公园癌症研究所的十位外科医生的视频数据,他们在达芬奇手术系统(dVSS)上执行了六个不同的手术任务,每个框架都有机器人工具的注释。

相似文献

1
Detection and Localization of Robotic Tools in Robot-Assisted Surgery Videos Using Deep Neural Networks for Region Proposal and Detection.使用用于区域提议和检测的深度神经网络在机器人辅助手术视频中检测和定位机器人工具。
IEEE Trans Med Imaging. 2017 Jul;36(7):1542-1549. doi: 10.1109/TMI.2017.2665671. Epub 2017 Feb 8.
2
Deep learning with convolutional neural network for objective skill evaluation in robot-assisted surgery.基于卷积神经网络的深度学习在机器人辅助手术中的客观技能评估。
Int J Comput Assist Radiol Surg. 2018 Dec;13(12):1959-1970. doi: 10.1007/s11548-018-1860-1. Epub 2018 Sep 25.
3
Video-based surgical skill assessment using 3D convolutional neural networks.基于视频的三维卷积神经网络手术技能评估。
Int J Comput Assist Radiol Surg. 2019 Jul;14(7):1217-1225. doi: 10.1007/s11548-019-01995-1. Epub 2019 May 18.
4
Real-time surgical instrument detection in robot-assisted surgery using a convolutional neural network cascade.使用卷积神经网络级联在机器人辅助手术中进行实时手术器械检测
Healthc Technol Lett. 2019 Nov 26;6(6):275-279. doi: 10.1049/htl.2019.0064. eCollection 2019 Dec.
5
Video recognition of simple mastoidectomy using convolutional neural networks: Detection and segmentation of surgical tools and anatomical regions.基于卷积神经网络的单纯乳突切除术视频识别:手术器械和解剖区域的检测和分割。
Comput Methods Programs Biomed. 2021 Sep;208:106251. doi: 10.1016/j.cmpb.2021.106251. Epub 2021 Jun 25.
6
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.更快的 R-CNN:基于区域建议网络的实时目标检测。
IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.
7
System events: readily accessible features for surgical phase detection.系统事件:用于手术阶段检测的易于访问的功能。
Int J Comput Assist Radiol Surg. 2016 Jun;11(6):1201-9. doi: 10.1007/s11548-016-1409-0. Epub 2016 May 13.
8
Video Salient Object Detection via Fully Convolutional Networks.基于全卷积网络的视频显著目标检测
IEEE Trans Image Process. 2018;27(1):38-49. doi: 10.1109/TIP.2017.2754941.
9
Real-time multiple spatiotemporal action localization and prediction approach using deep learning.基于深度学习的实时多时空动作定位与预测方法。
Neural Netw. 2020 Aug;128:331-344. doi: 10.1016/j.neunet.2020.05.017. Epub 2020 May 19.
10
Automated tool detection with deep learning for monitoring kinematics and eye-hand coordination in microsurgery.用于显微手术中运动学和眼手协调监测的深度学习自动工具检测
Comput Biol Med. 2022 Feb;141:105121. doi: 10.1016/j.compbiomed.2021.105121. Epub 2021 Dec 11.

引用本文的文献

1
The intelligent lift: Artificial Intelligence's growing role in plastic surgery - a comprehensive review.智能提升:人工智能在整形手术中日益重要的作用——全面综述
Front Surg. 2025 Aug 5;12:1640588. doi: 10.3389/fsurg.2025.1640588. eCollection 2025.
2
DeepVinci: Organ and Tool Segmentation with Edge Supervision and a Densely Multi-Scale Pyramid Module for Robot-Assisted Surgery.深度达芬奇:用于机器人辅助手术的基于边缘监督和密集多尺度金字塔模块的器官与工具分割
Diagnostics (Basel). 2025 Jul 30;15(15):1917. doi: 10.3390/diagnostics15151917.
3
Impact of regional digital transformation on public health: an empirical analysis based on 31 provinces in China.
区域数字转型对公共卫生的影响:基于中国31个省份的实证分析。
BMC Public Health. 2025 Jul 17;25(1):2485. doi: 10.1186/s12889-025-23670-8.
4
Automated assessment of simulated laparoscopic surgical skill performance using deep learning.使用深度学习对模拟腹腔镜手术技能表现进行自动评估。
Sci Rep. 2025 Apr 19;15(1):13591. doi: 10.1038/s41598-025-96336-5.
5
A dual-instrument Kalman-based tracker to enhance robustness of microsurgical tools tracking.基于双仪器卡尔曼滤波器的跟踪器,增强微创手术器械跟踪的鲁棒性。
Int J Comput Assist Radiol Surg. 2024 Dec;19(12):2351-2362. doi: 10.1007/s11548-024-03246-4. Epub 2024 Aug 12.
6
Real-Time Tool Localization for Laparoscopic Surgery Using Convolutional Neural Network.基于卷积神经网络的腹腔镜手术实时工具定位
Sensors (Basel). 2024 Jun 27;24(13):4191. doi: 10.3390/s24134191.
7
Real-time surgical tool detection with multi-scale positional encoding and contrastive learning.基于多尺度位置编码和对比学习的实时手术工具检测
Healthc Technol Lett. 2023 Dec 7;11(2-3):48-58. doi: 10.1049/htl2.12060. eCollection 2024 Apr-Jun.
8
YOLOv7-RepFPN: Improving real-time performance of laparoscopic tool detection on embedded systems.YOLOv7-RepFPN:提升嵌入式系统上腹腔镜工具检测的实时性能
Healthc Technol Lett. 2024 Jan 23;11(2-3):157-166. doi: 10.1049/htl2.12072. eCollection 2024 Apr-Jun.
9
DBH-YOLO: a surgical instrument detection method based on feature separation in laparoscopic surgery.DBH-YOLO:一种基于腔镜手术中特征分离的手术器械检测方法。
Int J Comput Assist Radiol Surg. 2024 Nov;19(11):2215-2225. doi: 10.1007/s11548-024-03115-0. Epub 2024 Apr 13.
10
CLAD-Net: cross-layer aggregation attention network for real-time endoscopic instrument detection.CLAD-Net:用于实时内镜器械检测的跨层聚合注意力网络。
Health Inf Sci Syst. 2023 Nov 27;11(1):58. doi: 10.1007/s13755-023-00260-9. eCollection 2023 Dec.