用于自动腹腔镜视频数据库组织的分类方法。

Classification approach for automatic laparoscopic video database organization.

作者信息

Twinanda Andru Putra, Marescaux Jacques, de Mathelin Michel, Padoy Nicolas

机构信息

ICube Laboratory, University of Strasbourg, CNRS, IHU, Strasbourg, France,

出版信息

Int J Comput Assist Radiol Surg. 2015 Sep;10(9):1449-60. doi: 10.1007/s11548-015-1183-4. Epub 2015 Apr 7.

DOI:10.1007/s11548-015-1183-4

PMID:25847668

Abstract

PURPOSE

One of the advantages of minimally invasive surgery (MIS) is that the underlying digitization provides invaluable information regarding the execution of procedures in various patient-specific conditions. However, such information can only be obtained conveniently if the laparoscopic video database comes with semantic annotations, which are typically provided manually by experts. Considering the growing popularity of MIS, manual annotation becomes a laborious and costly task. In this paper, we tackle the problem of laparoscopic video classification, which consists of automatically identifying the type of abdominal surgery performed in a video. In addition to performing classifications on the full recordings of the procedures, we also carry out sub-video and video clip classifications. These classifications are carried out to investigate how many frames from a video are needed to get a good classification performance and which parts of the procedures contain more discriminative features.

METHOD

Our classification pipeline is as follows. First, we reject the irrelevant frames from the videos using the color properties of the video frames. Second, we extract visual features from the relevant frames. Third, we quantize the features using several feature encoding methods, i.e., vector quantization, sparse coding (SC), and Fisher encoding. Fourth, we carry out the classification using support vector machines. While the sub-video classification is carried out by uniformly downsampling the video frames, the video clip classification is carried out by taking three parts of the videos (i.e., beginning, middle, and end) and running the classification pipeline separately for every video part. Ultimately, we build our final classification model by combining the features using a multiple kernel learning (MKL) approach.

RESULTS

To carry out the experiments, we use a dataset containing 208 videos of eight different surgeries performed by 10 different surgeons. The results show that SC with K-singular value decomposition (K-SVD) yields the best classification accuracy. The results also demonstrate that the classification accuracy only decreases by 3 % when solely 60 % of the video frames are utilized. Furthermore, it is also shown that the end part of the procedures is the most discriminative part of the surgery. Specifically, by using only the last 20 % of the video frames, a classification accuracy greater than 70 % can be achieved. Finally, the combination of all features yields the best performance of 90.38 % accuracy.

CONCLUSIONS

The SC with K-SVD provides the best representation of our videos, yielding the best accuracies for all features. In terms of information, the end part of the laparoscopic videos is the most discriminative compared to the other parts of the videos. In addition to their good performance individually, the features yield even better classification results when all of them are combined using the MKL approach.

摘要

目的

微创手术（MIS）的优势之一在于其潜在的数字化特性能够提供有关在各种特定患者条件下手术执行情况的宝贵信息。然而，只有当腹腔镜视频数据库带有语义注释时，此类信息才能方便地获取，而语义注释通常由专家手动提供。鉴于MIS的日益普及，手动注释成为一项艰巨且成本高昂的任务。在本文中，我们着手解决腹腔镜视频分类问题，即自动识别视频中所进行的腹部手术类型。除了对手术的完整记录进行分类之外，我们还进行子视频和视频片段分类。进行这些分类是为了研究视频需要多少帧才能获得良好的分类性能，以及手术的哪些部分包含更多的判别特征。

方法

我们的分类流程如下。首先，利用视频帧的颜色属性去除视频中的无关帧。其次，从相关帧中提取视觉特征。第三，使用多种特征编码方法对特征进行量化，即矢量量化、稀疏编码（SC）和Fisher编码。第四，使用支持向量机进行分类。子视频分类通过对视频帧进行均匀下采样来实现，而视频片段分类则通过选取视频的三个部分（即开头、中间和结尾）并分别对每个视频部分运行分类流程来进行。最终，我们使用多核学习（MKL）方法组合特征来构建最终的分类模型。

结果

为了进行实验，我们使用了一个包含由10位不同外科医生进行的8种不同手术的208个视频的数据集。结果表明，采用K奇异值分解（K-SVD）的SC产生了最佳的分类准确率。结果还表明，当仅使用60%的视频帧时，分类准确率仅下降3%。此外，还表明手术的结尾部分是手术中最具判别力的部分。具体而言，仅使用视频帧的最后20%，就可以实现大于70%的分类准确率。最后，所有特征的组合产生了90.

相似文献

Classification approach for automatic laparoscopic video database organization.用于自动腹腔镜视频数据库组织的分类方法。

Int J Comput Assist Radiol Surg. 2015 Sep;10(9):1449-60. doi: 10.1007/s11548-015-1183-4. Epub 2015 Apr 7.

Fisher kernel based task boundary retrieval in laparoscopic database with single video query.基于费舍尔核的单视频查询腹腔镜数据库任务边界检索

Med Image Comput Comput Assist Interv. 2014;17(Pt 3):409-16. doi: 10.1007/978-3-319-10443-0_52.

Surgical gesture classification from video and kinematic data.基于视频和运动学数据的外科手势分类。

Med Image Anal. 2013 Oct;17(7):732-45. doi: 10.1016/j.media.2013.04.007. Epub 2013 Apr 28.

Keyframe extraction from laparoscopic videos based on visual saliency detection.基于视觉显著性检测的腹腔镜视频关键帧提取。

Comput Methods Programs Biomed. 2018 Oct;165:13-23. doi: 10.1016/j.cmpb.2018.07.004. Epub 2018 Jul 18.

Automatic detection of informative frames from wireless capsule endoscopy images.无线胶囊内窥镜图像中信息帧的自动检测。

Med Image Anal. 2010 Jun;14(3):449-70. doi: 10.1016/j.media.2009.12.001. Epub 2010 Jan 4.

LRTD: long-range temporal dependency based active learning for surgical workflow recognition.基于长程时间依赖的主动学习在手术流程识别中的应用

Int J Comput Assist Radiol Surg. 2020 Sep;15(9):1573-1584. doi: 10.1007/s11548-020-02198-9. Epub 2020 Jun 25.

Visual event recognition in videos by learning from Web data.从网络数据中学习的视频中视觉事件识别。

IEEE Trans Pattern Anal Mach Intell. 2012 Sep;34(9):1667-80. doi: 10.1109/TPAMI.2011.265.

Tiny videos: a large data set for nonparametric video retrieval and frame classification.微小视频：用于非参数视频检索和帧分类的大数据集。

IEEE Trans Pattern Anal Mach Intell. 2011 Mar;33(3):618-30. doi: 10.1109/TPAMI.2010.118.

Video-based 3D reconstruction, laparoscope localization and deformation recovery for abdominal minimally invasive surgery: a survey.用于腹部微创手术的基于视频的3D重建、腹腔镜定位及变形恢复：综述

Int J Med Robot. 2016 Jun;12(2):158-78. doi: 10.1002/rcs.1661. Epub 2015 Apr 30.

Pornography classification: The hidden clues in video space-time.色情内容分类：视频时空里的隐藏线索。

Forensic Sci Int. 2016 Nov;268:46-61. doi: 10.1016/j.forsciint.2016.09.010. Epub 2016 Sep 21.

引用本文的文献

Use of artificial intelligence in the analysis of digital videos of invasive surgical procedures: scoping review.人工智能在侵入性外科手术数字视频分析中的应用：范围综述。

BJS Open. 2025 Jul 1;9(4). doi: 10.1093/bjsopen/zraf073.

Preserving privacy in surgical video analysis using a deep learning classifier to identify out-of-body scenes in endoscopic videos.使用深度学习分类器在手术视频分析中保护隐私，以识别内窥镜视频中的体外场景。

Sci Rep. 2023 Jun 7;13(1):9235. doi: 10.1038/s41598-023-36453-1.

Multispectral Image under Tissue Classification Algorithm in Screening of Cervical Cancer.多光谱图像在宫颈癌筛查中的组织分类算法。

J Healthc Eng. 2022 Jan 7;2022:9048123. doi: 10.1155/2022/9048123. eCollection 2022.

Utilising an Accelerated Delphi Process to Develop Guidance and Protocols for Telepresence Applications in Remote Robotic Surgery Training.利用加速德尔菲法制定远程机器人手术培训中远程呈现应用的指南和协议。

Eur Urol Open Sci. 2020 Nov 6;22:23-33. doi: 10.1016/j.euros.2020.09.005. eCollection 2020 Dec.

Video content analysis of surgical procedures.手术过程的视频内容分析。

Surg Endosc. 2018 Feb;32(2):553-568. doi: 10.1007/s00464-017-5878-1. Epub 2017 Oct 26.

Shot boundary detection in endoscopic surgery videos using a variational Bayesian framework.基于变分贝叶斯框架的内镜手术视频镜头边界检测

Int J Comput Assist Radiol Surg. 2016 Nov;11(11):1937-1949. doi: 10.1007/s11548-016-1431-2. Epub 2016 Jun 11.

System events: readily accessible features for surgical phase detection.系统事件：用于手术阶段检测的易于访问的功能。

Int J Comput Assist Radiol Surg. 2016 Jun;11(6):1201-9. doi: 10.1007/s11548-016-1409-0. Epub 2016 May 13.

本文引用的文献

Fisher kernel based task boundary retrieval in laparoscopic database with single video query.基于费舍尔核的单视频查询腹腔镜数据库任务边界检索

Med Image Comput Comput Assist Interv. 2014;17(Pt 3):409-16. doi: 10.1007/978-3-319-10443-0_52.

Good practice in large-scale learning for image classification.大规模图像分类学习的良好实践。

IEEE Trans Pattern Anal Mach Intell. 2014 Mar;36(3):507-20. doi: 10.1109/TPAMI.2013.146.

Surgical gesture classification from video and kinematic data.基于视频和运动学数据的外科手势分类。

Med Image Anal. 2013 Oct;17(7):732-45. doi: 10.1016/j.media.2013.04.007. Epub 2013 Apr 28.

Feature classification for tracking articulated surgical tools.用于跟踪关节式手术工具的特征分类

Med Image Comput Comput Assist Interv. 2012;15(Pt 2):592-600. doi: 10.1007/978-3-642-33418-4_73.

A framework for the recognition of high-level surgical tasks from video images for cataract surgeries.用于从视频图像中识别白内障手术中高级手术任务的框架。

IEEE Trans Biomed Eng. 2012 Apr;59(4):966-76. doi: 10.1109/TBME.2011.2181168. Epub 2011 Dec 23.

Endoscopic video manifolds for targeted optical biopsy.内镜视频流形用于靶向光学活检。

IEEE Trans Med Imaging. 2012 Mar;31(3):637-53. doi: 10.1109/TMI.2011.2174252. Epub 2011 Nov 2.

Modeling and segmentation of surgical workflow from laparoscopic video.基于腹腔镜视频的手术工作流程建模与分割

Med Image Comput Comput Assist Interv. 2010;13(Pt 3):400-7. doi: 10.1007/978-3-642-15711-0_50.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于自动腹腔镜视频数据库组织的分类方法。

Classification approach for automatic laparoscopic video database organization.

作者信息

机构信息

出版信息

PURPOSE

METHOD

RESULTS

CONCLUSIONS

目的

方法

结果

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献