Laboratory of Medical Physics, Medical School, National and Kapodistrian University of Athens, Mikras Asias 75 str., 11527, Athens, Greece.
Surg Endosc. 2018 Feb;32(2):553-568. doi: 10.1007/s00464-017-5878-1. Epub 2017 Oct 26.
In addition to its therapeutic benefits, minimally invasive surgery offers the potential for video recording of the operation. The videos may be archived and used later for reasons such as cognitive training, skills assessment, and workflow analysis. Methods from the major field of video content analysis and representation are increasingly applied in the surgical domain. In this paper, we review recent developments and analyze future directions in the field of content-based video analysis of surgical operations.
The review was obtained from PubMed and Google Scholar search on combinations of the following keywords: 'surgery', 'video', 'phase', 'task', 'skills', 'event', 'shot', 'analysis', 'retrieval', 'detection', 'classification', and 'recognition'. The collected articles were categorized and reviewed based on the technical goal sought, type of surgery performed, and structure of the operation.
A total of 81 articles were included. The publication activity is constantly increasing; more than 50% of these articles were published in the last 3 years. Significant research has been performed for video task detection and retrieval in eye surgery. In endoscopic surgery, the research activity is more diverse: gesture/task classification, skills assessment, tool type recognition, shot/event detection and retrieval. Recent works employ deep neural networks for phase and tool recognition as well as shot detection.
Content-based video analysis of surgical operations is a rapidly expanding field. Several future prospects for research exist including, inter alia, shot boundary detection, keyframe extraction, video summarization, pattern discovery, and video annotation. The development of publicly available benchmark datasets to evaluate and compare task-specific algorithms is essential.
微创外科除了具有治疗益处外,还具有操作视频记录的潜力。这些视频可以存档,并以后用于认知培训、技能评估和工作流程分析等目的。来自视频内容分析和表示主要领域的方法越来越多地应用于外科领域。在本文中,我们回顾了手术操作的基于内容的视频分析领域的最新发展,并分析了未来的方向。
通过对以下关键词的 PubMed 和 Google Scholar 搜索组合,获得了综述:“surgery”(手术)、“video”(视频)、“phase”(阶段)、“task”(任务)、“skills”(技能)、“event”(事件)、“shot”(镜头)、“analysis”(分析)、“retrieval”(检索)、“detection”(检测)、“classification”(分类)和“recognition”(识别)。收集的文章根据所寻求的技术目标、进行的手术类型和手术结构进行分类和综述。
共纳入 81 篇文章。发表活动不断增加;其中超过 50%的文章发表在过去 3 年。在眼科手术的视频任务检测和检索方面进行了大量研究。在内窥镜手术中,研究活动更加多样化:手势/任务分类、技能评估、工具类型识别、镜头/事件检测和检索。最近的工作使用深度神经网络进行阶段和工具识别以及镜头检测。
手术操作的基于内容的视频分析是一个快速发展的领域。未来的研究前景包括但不限于镜头边界检测、关键帧提取、视频摘要、模式发现和视频标注。开发用于评估和比较特定于任务的算法的公共可用基准数据集至关重要。