眼外科手术视频中手术任务的实时识别。

Real-time recognition of surgical tasks in eye surgery videos.

机构信息

Inserm, UMR 1101, Brest F-29200, France.

INSTITUT Mines-Télécom, TELECOM Bretagne, UEB, Dpt ITI, Brest F-29200, France; Inserm, UMR 1101, Brest F-29200, France.

出版信息

Med Image Anal. 2014 Apr;18(3):579-90. doi: 10.1016/j.media.2014.02.007. Epub 2014 Feb 26.

DOI:10.1016/j.media.2014.02.007

PMID:24637155

Abstract

Nowadays, many surgeries, including eye surgeries, are video-monitored. We present in this paper an automatic video analysis system able to recognize surgical tasks in real-time. The proposed system relies on the Content-Based Video Retrieval (CBVR) paradigm. It characterizes short subsequences in the video stream and searches for video subsequences with similar structures in a video archive. Fixed-length feature vectors are built for each subsequence: the feature vectors are unchanged by variations in duration and temporal structure among the target surgical tasks. Therefore, it is possible to perform fast nearest neighbor searches in the video archive. The retrieved video subsequences are used to recognize the current surgical task by analogy reasoning. The system can be trained to recognize any surgical task using weak annotations only. It was applied to a dataset of 23 epiretinal membrane surgeries and a dataset of 100 cataract surgeries. Three surgical tasks were annotated in the first dataset. Nine surgical tasks were annotated in the second dataset. To assess its generality, the system was also applied to a dataset of 1,707 movie clips in which 12 human actions were annotated. High task recognition scores were measured in all three datasets. Real-time task recognition will be used in future works to communicate with surgeons (trainees in particular) or with surgical devices.

摘要

如今，许多手术，包括眼部手术，都采用视频监控。我们在本文中提出了一种能够实时识别手术任务的自动视频分析系统。所提出的系统依赖于基于内容的视频检索 (CBVR) 范例。它对视频流中的短子序列进行特征化，并在视频档案中搜索具有相似结构的视频子序列。为每个子序列构建固定长度的特征向量：特征向量不受目标手术任务在持续时间和时间结构方面的变化影响。因此，可以在视频档案中快速执行最近邻搜索。检索到的视频子序列通过类比推理用于识别当前手术任务。该系统可以通过仅使用弱注释来训练识别任何手术任务。它应用于 23 例视网膜后膜手术数据集和 100 例白内障手术数据集。第一个数据集标记了三个手术任务。第二个数据集标记了九个手术任务。为了评估其通用性，该系统还应用于一个包含 12 个人类动作的 1707 个电影剪辑数据集。在所有三个数据集上都测量到了高任务识别分数。实时任务识别将在未来的工作中用于与外科医生（特别是学员）或手术设备进行通信。

相似文献

Real-time recognition of surgical tasks in eye surgery videos.眼外科手术视频中手术任务的实时识别。

Med Image Anal. 2014 Apr;18(3):579-90. doi: 10.1016/j.media.2014.02.007. Epub 2014 Feb 26.

Surgical gesture classification from video and kinematic data.基于视频和运动学数据的外科手势分类。

Med Image Anal. 2013 Oct;17(7):732-45. doi: 10.1016/j.media.2013.04.007. Epub 2013 Apr 28.

Real-time task recognition in cataract surgery videos using adaptive spatiotemporal polynomials.基于自适应时空多项式的白内障手术视频中的实时任务识别。

IEEE Trans Med Imaging. 2015 Apr;34(4):877-87. doi: 10.1109/TMI.2014.2366726. Epub 2014 Oct 31.

Markerless monocular tracking system for guided external eye surgery.用于引导性外眼手术的无标记单目跟踪系统。

Comput Med Imaging Graph. 2014 Dec;38(8):785-92. doi: 10.1016/j.compmedimag.2014.08.001. Epub 2014 Aug 23.

Robust face recognition from multi-view videos.从多角度视频中进行稳健的人脸识别。

IEEE Trans Image Process. 2014 Mar;23(3):1105-17. doi: 10.1109/TIP.2014.2300812.

A 3D shape constraint on video.视频上的三维形状约束

IEEE Trans Pattern Anal Mach Intell. 2006 Jun;28(6):1018-23. doi: 10.1109/TPAMI.2006.109.

Real-time segmentation and recognition of surgical tasks in cataract surgery videos.白内障手术视频中手术任务的实时分割与识别。

IEEE Trans Med Imaging. 2014 Dec;33(12):2352-60. doi: 10.1109/TMI.2014.2340473. Epub 2014 Jul 18.

Efficient subframe video alignment using short descriptors.利用短描述符实现高效子帧视频配准。

IEEE Trans Pattern Anal Mach Intell. 2013 Oct;35(10):2371-86. doi: 10.1109/TPAMI.2013.56.

Adaptive online performance evaluation of video trackers.视频跟踪器的自适应在线性能评估。

IEEE Trans Image Process. 2012 May;21(5):2812-23. doi: 10.1109/TIP.2011.2182520. Epub 2012 Jan 2.

Consistent depth maps recovery from a video sequence.从视频序列中恢复一致的深度图。

IEEE Trans Pattern Anal Mach Intell. 2009 Jun;31(6):974-88. doi: 10.1109/TPAMI.2009.52.

引用本文的文献

Deep learning-driven approach for cataract management: towards precise identification and predictive analytics.深度学习驱动的白内障管理方法：迈向精确识别和预测分析

Front Cell Dev Biol. 2025 May 30;13:1611216. doi: 10.3389/fcell.2025.1611216. eCollection 2025.

Transferable situation recognition system for scenario-independent context-aware surgical assistance systems: a proof of concept.用于与场景无关的情境感知手术辅助系统的可转移情境识别系统：概念验证

Int J Comput Assist Radiol Surg. 2025 Mar;20(3):579-590. doi: 10.1007/s11548-024-03283-z. Epub 2024 Nov 27.

Artificial Intelligence in Cataract Surgery: A Systematic Review.人工智能在白内障手术中的应用：系统评价。

Transl Vis Sci Technol. 2024 Apr 2;13(4):20. doi: 10.1167/tvst.13.4.20.

Multi-Stage Temporal Convolutional Network with Moment Loss and Positional Encoding for Surgical Phase Recognition.基于矩损失和位置编码的多阶段时间卷积网络用于手术阶段识别

Diagnostics (Basel). 2022 Dec 29;13(1):107. doi: 10.3390/diagnostics13010107.

State-of-the-art of situation recognition systems for intraoperative procedures.术中操作情境识别系统的最新技术。

Med Biol Eng Comput. 2022 Apr;60(4):921-939. doi: 10.1007/s11517-022-02520-4. Epub 2022 Feb 17.

PhacoTrainer: A Multicenter Study of Deep Learning for Activity Recognition in Cataract Surgical Videos.飞秒训练器：基于深度学习的白内障手术视频活动识别的多中心研究。

Transl Vis Sci Technol. 2021 Nov 1;10(13):23. doi: 10.1167/tvst.10.13.23.

LRTD: long-range temporal dependency based active learning for surgical workflow recognition.基于长程时间依赖的主动学习在手术流程识别中的应用

Int J Comput Assist Radiol Surg. 2020 Sep;15(9):1573-1584. doi: 10.1007/s11548-020-02198-9. Epub 2020 Jun 25.

Use of Artificial Intelligence for Medical Literature Search: Randomized Controlled Trial Using the Hackathon Format.利用人工智能进行医学文献检索：采用黑客松形式的随机对照试验

Interact J Med Res. 2020 Mar 30;9(1):e16606. doi: 10.2196/16606.

Surgical process modeling.手术过程建模

Innov Surg Sci. 2017 May 20;2(3):123-137. doi: 10.1515/iss-2017-0005. eCollection 2017 Sep.

Assessment of Automated Identification of Phases in Videos of Cataract Surgery Using Machine Learning and Deep Learning Techniques.使用机器学习和深度学习技术评估白内障手术视频中的相位自动识别。

JAMA Netw Open. 2019 Apr 5;2(4):e191860. doi: 10.1001/jamanetworkopen.2019.1860.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

眼外科手术视频中手术任务的实时识别。

Real-time recognition of surgical tasks in eye surgery videos.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献