• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于视频和运动学数据的外科手势分类。

Surgical gesture classification from video and kinematic data.

机构信息

Johns Hopkins University, 3400 North Charles Street, Baltimore, MD 21218, USA.

出版信息

Med Image Anal. 2013 Oct;17(7):732-45. doi: 10.1016/j.media.2013.04.007. Epub 2013 Apr 28.

DOI:10.1016/j.media.2013.04.007
PMID:23706754
Abstract

Much of the existing work on automatic classification of gestures and skill in robotic surgery is based on dynamic cues (e.g., time to completion, speed, forces, torque) or kinematic data (e.g., robot trajectories and velocities). While videos could be equally or more discriminative (e.g., videos contain semantic information not present in kinematic data), they are typically not used because of the difficulties associated with automatic video interpretation. In this paper, we propose several methods for automatic surgical gesture classification from video data. We assume that the video of a surgical task (e.g., suturing) has been segmented into video clips corresponding to a single gesture (e.g., grabbing the needle, passing the needle) and propose three methods to classify the gesture of each video clip. In the first one, we model each video clip as the output of a linear dynamical system (LDS) and use metrics in the space of LDSs to classify new video clips. In the second one, we use spatio-temporal features extracted from each video clip to learn a dictionary of spatio-temporal words, and use a bag-of-features (BoF) approach to classify new video clips. In the third one, we use multiple kernel learning (MKL) to combine the LDS and BoF approaches. Since the LDS approach is also applicable to kinematic data, we also use MKL to combine both types of data in order to exploit their complementarity. Our experiments on a typical surgical training setup show that methods based on video data perform equally well, if not better, than state-of-the-art approaches based on kinematic data. In turn, the combination of both kinematic and video data outperforms any other algorithm based on one type of data alone.

摘要

现有的许多关于机器人手术中手势和技能的自动分类工作都是基于动态线索(例如,完成时间、速度、力、扭矩)或运动学数据(例如,机器人轨迹和速度)。虽然视频可能同样具有区分性(例如,视频包含运动学数据中不存在的语义信息),但由于自动视频解释所带来的困难,通常不会使用视频。在本文中,我们提出了几种从视频数据中自动分类手术手势的方法。我们假设手术任务(例如缝合)的视频已经被分割成对应单个手势(例如抓取针、传递针)的视频片段,并提出了三种方法来分类每个视频片段的手势。在第一种方法中,我们将每个视频片段建模为线性动力系统(LDS)的输出,并使用 LDS 空间中的度量来对新的视频片段进行分类。在第二种方法中,我们使用从每个视频片段中提取的时空特征来学习时空词字典,并使用特征袋(BoF)方法对新的视频片段进行分类。在第三种方法中,我们使用多核学习(MKL)来结合 LDS 和 BoF 方法。由于 LDS 方法也适用于运动学数据,我们还使用 MKL 来结合这两种类型的数据,以利用它们的互补性。我们在典型的手术培训设置上的实验表明,基于视频数据的方法表现同样出色,如果不是更好的话,比基于运动学数据的最新方法。反过来,运动学和视频数据的结合优于任何其他仅基于一种类型数据的算法。

相似文献

1
Surgical gesture classification from video and kinematic data.基于视频和运动学数据的外科手势分类。
Med Image Anal. 2013 Oct;17(7):732-45. doi: 10.1016/j.media.2013.04.007. Epub 2013 Apr 28.
2
Surgical gesture classification from video data.基于视频数据的手术手势分类
Med Image Comput Comput Assist Interv. 2012;15(Pt 1):34-41. doi: 10.1007/978-3-642-33415-3_5.
3
Categorizing dynamic textures using a bag of dynamical systems.使用动态系统袋对动态纹理进行分类。
IEEE Trans Pattern Anal Mach Intell. 2013 Feb;35(2):342-53. doi: 10.1109/TPAMI.2012.83.
4
Recognizing gestures by learning local motion signatures of HOG descriptors.通过学习 HOG 描述符的局部运动特征来识别手势。
IEEE Trans Pattern Anal Mach Intell. 2012 Nov;34(11):2247-58. doi: 10.1109/TPAMI.2012.19.
5
Surgical gesture segmentation and recognition.手术手势分割与识别。
Med Image Comput Comput Assist Interv. 2013;16(Pt 3):339-46. doi: 10.1007/978-3-642-40760-4_43.
6
Real-time recognition of surgical tasks in eye surgery videos.眼外科手术视频中手术任务的实时识别。
Med Image Anal. 2014 Apr;18(3):579-90. doi: 10.1016/j.media.2014.02.007. Epub 2014 Feb 26.
7
Automatic detection and segmentation of robot-assisted surgical motions.机器人辅助手术动作的自动检测与分割
Med Image Comput Comput Assist Interv. 2005;8(Pt 1):802-10. doi: 10.1007/11566465_99.
8
Most probable longest common subsequence for recognition of gesture character input.用于识别手势字符输入的最可能最长公共子序列。
IEEE Trans Cybern. 2013 Jun;43(3):871-80. doi: 10.1109/TSMCB.2012.2217324. Epub 2012 Oct 3.
9
Robust face recognition from multi-view videos.从多角度视频中进行稳健的人脸识别。
IEEE Trans Image Process. 2014 Mar;23(3):1105-17. doi: 10.1109/TIP.2014.2300812.
10
A discriminative learning framework with pairwise constraints for video object classification.一种用于视频对象分类的带有成对约束的判别式学习框架。
IEEE Trans Pattern Anal Mach Intell. 2006 Apr;28(4):578-93. doi: 10.1109/TPAMI.2006.65.

引用本文的文献

1
Untangling surgical gesture analysis-are we even speaking the same language? a systematic review.解析手术手势分析——我们说的是同一种语言吗?一项系统综述。
Surg Endosc. 2025 Sep;39(9):5538-5557. doi: 10.1007/s00464-025-11907-x. Epub 2025 Jul 31.
2
Transforming Surgery With Artificial Intelligence: An Early Analysis of Private Industry Trends.用人工智能变革手术:对私营行业趋势的早期分析
Cureus. 2025 Apr 15;17(4):e82328. doi: 10.7759/cureus.82328. eCollection 2025 Apr.
3
Automatic gesture recognition and evaluation in peg transfer tasks of laparoscopic surgery training.
腹腔镜手术训练中栓子转移任务的自动手势识别与评估
Surg Endosc. 2025 Jun;39(6):3749-3759. doi: 10.1007/s00464-025-11730-4. Epub 2025 May 2.
4
Automated assessment of simulated laparoscopic surgical skill performance using deep learning.使用深度学习对模拟腹腔镜手术技能表现进行自动评估。
Sci Rep. 2025 Apr 19;15(1):13591. doi: 10.1038/s41598-025-96336-5.
5
Zero-shot prompt-based video encoder for surgical gesture recognition.用于手术手势识别的基于零样本提示的视频编码器
Int J Comput Assist Radiol Surg. 2025 Feb;20(2):311-321. doi: 10.1007/s11548-024-03257-1. Epub 2024 Sep 17.
6
Classification of subtask types and skill levels in robot-assisted surgery using EEG, eye-tracking, and machine learning.使用 EEG、眼动追踪和机器学习对机器人辅助手术中的子任务类型和技能水平进行分类。
Surg Endosc. 2024 Sep;38(9):5137-5147. doi: 10.1007/s00464-024-11049-6. Epub 2024 Jul 22.
7
Advancements in robotic surgery: innovations, challenges and future prospects.机器人手术的进展:创新、挑战与未来展望。
J Robot Surg. 2024 Jan 17;18(1):28. doi: 10.1007/s11701-023-01801-w.
8
Recognition and Prediction of Surgical Gestures and Trajectories Using Transformer Models in Robot-Assisted Surgery.在机器人辅助手术中使用Transformer模型识别和预测手术手势与轨迹
Rep U S. 2022 Oct;2022:8017-8024. doi: 10.1109/IROS47612.2022.9981611. Epub 2022 Dec 26.
9
A vision transformer for decoding surgeon activity from surgical videos.一种从手术视频中解码外科医生活动的视觉转换器。
Nat Biomed Eng. 2023 Jun;7(6):780-796. doi: 10.1038/s41551-023-01010-8. Epub 2023 Mar 30.
10
Multi-Stage Temporal Convolutional Network with Moment Loss and Positional Encoding for Surgical Phase Recognition.基于矩损失和位置编码的多阶段时间卷积网络用于手术阶段识别
Diagnostics (Basel). 2022 Dec 29;13(1):107. doi: 10.3390/diagnostics13010107.