Suppr超能文献

眼外科手术视频中手术任务的实时识别。

Real-time recognition of surgical tasks in eye surgery videos.

机构信息

Inserm, UMR 1101, Brest F-29200, France.

INSTITUT Mines-Télécom, TELECOM Bretagne, UEB, Dpt ITI, Brest F-29200, France; Inserm, UMR 1101, Brest F-29200, France.

出版信息

Med Image Anal. 2014 Apr;18(3):579-90. doi: 10.1016/j.media.2014.02.007. Epub 2014 Feb 26.

Abstract

Nowadays, many surgeries, including eye surgeries, are video-monitored. We present in this paper an automatic video analysis system able to recognize surgical tasks in real-time. The proposed system relies on the Content-Based Video Retrieval (CBVR) paradigm. It characterizes short subsequences in the video stream and searches for video subsequences with similar structures in a video archive. Fixed-length feature vectors are built for each subsequence: the feature vectors are unchanged by variations in duration and temporal structure among the target surgical tasks. Therefore, it is possible to perform fast nearest neighbor searches in the video archive. The retrieved video subsequences are used to recognize the current surgical task by analogy reasoning. The system can be trained to recognize any surgical task using weak annotations only. It was applied to a dataset of 23 epiretinal membrane surgeries and a dataset of 100 cataract surgeries. Three surgical tasks were annotated in the first dataset. Nine surgical tasks were annotated in the second dataset. To assess its generality, the system was also applied to a dataset of 1,707 movie clips in which 12 human actions were annotated. High task recognition scores were measured in all three datasets. Real-time task recognition will be used in future works to communicate with surgeons (trainees in particular) or with surgical devices.

摘要

如今,许多手术,包括眼部手术,都采用视频监控。我们在本文中提出了一种能够实时识别手术任务的自动视频分析系统。所提出的系统依赖于基于内容的视频检索 (CBVR) 范例。它对视频流中的短子序列进行特征化,并在视频档案中搜索具有相似结构的视频子序列。为每个子序列构建固定长度的特征向量:特征向量不受目标手术任务在持续时间和时间结构方面的变化影响。因此,可以在视频档案中快速执行最近邻搜索。检索到的视频子序列通过类比推理用于识别当前手术任务。该系统可以通过仅使用弱注释来训练识别任何手术任务。它应用于 23 例视网膜后膜手术数据集和 100 例白内障手术数据集。第一个数据集标记了三个手术任务。第二个数据集标记了九个手术任务。为了评估其通用性,该系统还应用于一个包含 12 个人类动作的 1707 个电影剪辑数据集。在所有三个数据集上都测量到了高任务识别分数。实时任务识别将在未来的工作中用于与外科医生(特别是学员)或手术设备进行通信。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验