基于时间、注意力和多特征融合的腹腔镜视频分析。

Laparoscopic Video Analysis Using Temporal, Attention, and Multi-Feature Fusion Based-Approaches.

机构信息

Institute of Technical Medicine (ITeM), Furtwangen University, 78054 Villingen-Schwenningen, Germany.

Innovation Center Computer Assisted Surgery (ICCAS), University of Leipzig, 04103 Leipzig, Germany.

出版信息

Sensors (Basel). 2023 Feb 9;23(4):1958. doi: 10.3390/s23041958.

DOI:10.3390/s23041958

PMID:36850554

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9964851/

Abstract

Adapting intelligent context-aware systems (CAS) to future operating rooms (OR) aims to improve situational awareness and provide surgical decision support systems to medical teams. CAS analyzes data streams from available devices during surgery and communicates real-time knowledge to clinicians. Indeed, recent advances in computer vision and machine learning, particularly deep learning, paved the way for extensive research to develop CAS. In this work, a deep learning approach for analyzing laparoscopic videos for surgical phase recognition, tool classification, and weakly-supervised tool localization in laparoscopic videos was proposed. The ResNet-50 convolutional neural network (CNN) architecture was adapted by adding attention modules and fusing features from multiple stages to generate better-focused, generalized, and well-representative features. Then, a multi-map convolutional layer followed by tool-wise and spatial pooling operations was utilized to perform tool localization and generate tool presence confidences. Finally, the long short-term memory (LSTM) network was employed to model temporal information and perform tool classification and phase recognition. The proposed approach was evaluated on the Cholec80 dataset. The experimental results (i.e., 88.5% and 89.0% mean precision and recall for phase recognition, respectively, 95.6% mean average precision for tool presence detection, and a 70.1% F1-score for tool localization) demonstrated the ability of the model to learn discriminative features for all tasks. The performances revealed the importance of integrating attention modules and multi-stage feature fusion for more robust and precise detection of surgical phases and tools.

摘要

将智能上下文感知系统 (CAS) 应用于未来的手术室 (OR) 旨在提高情境感知能力，并为医疗团队提供手术决策支持系统。CAS 分析手术过程中来自可用设备的数据流，并向临床医生实时传达知识。事实上，计算机视觉和机器学习，特别是深度学习的最新进展为开发 CAS 铺平了道路。在这项工作中，提出了一种用于分析腹腔镜视频以进行手术阶段识别、工具分类以及在腹腔镜视频中进行弱监督工具定位的深度学习方法。通过添加注意力模块和融合来自多个阶段的特征，对 ResNet-50 卷积神经网络 (CNN) 架构进行了调整，以生成更聚焦、更通用和更具代表性的特征。然后，使用多图卷积层以及工具特定和空间池化操作来执行工具定位并生成工具存在置信度。最后，使用长短时记忆 (LSTM) 网络来建模时间信息并执行工具分类和阶段识别。该方法在 Cholec80 数据集上进行了评估。实验结果（即，相位识别的平均精度分别为 88.5%和 89.0%，工具存在检测的平均精度为 95.6%，工具定位的 F1 得分为 70.1%）表明了该模型学习所有任务的区分特征的能力。性能揭示了集成注意力模块和多阶段特征融合对于更稳健和精确的手术阶段和工具检测的重要性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f9b/9964851/565bdb9b9562/sensors-23-01958-g001.jpg

相似文献

Laparoscopic Video Analysis Using Temporal, Attention, and Multi-Feature Fusion Based-Approaches.基于时间、注意力和多特征融合的腹腔镜视频分析。

Sensors (Basel). 2023 Feb 9;23(4):1958. doi: 10.3390/s23041958.

P-CSEM: An Attention Module for Improved Laparoscopic Surgical Tool Detection.P-CSEM：用于改进腹腔镜手术工具检测的注意力模块。

Sensors (Basel). 2023 Aug 18;23(16):7257. doi: 10.3390/s23167257.

Weakly supervised convolutional LSTM approach for tool tracking in laparoscopic videos.基于弱监督卷积 LSTM 的腹腔镜视频中工具跟踪方法。

Int J Comput Assist Radiol Surg. 2019 Jun;14(6):1059-1067. doi: 10.1007/s11548-019-01958-6. Epub 2019 Apr 9.

Real-time automatic surgical phase recognition in laparoscopic sigmoidectomy using the convolutional neural network-based deep learning approach.基于卷积神经网络的深度学习方法在腹腔镜乙状结肠切除术实时自动手术阶段识别中的应用。

Surg Endosc. 2020 Nov;34(11):4924-4931. doi: 10.1007/s00464-019-07281-0. Epub 2019 Dec 3.

EndoNet: A Deep Architecture for Recognition Tasks on Laparoscopic Videos.EndoNet：腹腔镜视频识别任务的深度架构。

IEEE Trans Med Imaging. 2017 Jan;36(1):86-97. doi: 10.1109/TMI.2016.2593957. Epub 2016 Jul 22.

Multi-task temporal convolutional networks for joint recognition of surgical phases and steps in gastric bypass procedures.多任务时频卷积网络联合识别胃旁路手术中的手术阶段和步骤。

Int J Comput Assist Radiol Surg. 2021 Jul;16(7):1111-1119. doi: 10.1007/s11548-021-02388-z. Epub 2021 May 19.

A convolution neural network with multi-level convolutional and attention learning for classification of cancer grades and tissue structures in colon histopathological images.用于结肠组织病理图像中癌症分级和组织结构分类的具有多层次卷积和注意力学习的卷积神经网络。

Comput Biol Med. 2022 Aug;147:105680. doi: 10.1016/j.compbiomed.2022.105680. Epub 2022 Jun 2.

Multi-task recurrent convolutional network with correlation loss for surgical video analysis.基于相关损失的多任务递归卷积网络在手术视频分析中的应用。

Med Image Anal. 2020 Jan;59:101572. doi: 10.1016/j.media.2019.101572. Epub 2019 Oct 10.

A contextual detector of surgical tools in laparoscopic videos using deep learning.基于深度学习的腹腔镜视频中手术工具的上下文探测器。

Surg Endosc. 2022 Jan;36(1):679-688. doi: 10.1007/s00464-021-08336-x. Epub 2021 Feb 8.

SV-RCNet: Workflow Recognition From Surgical Videos Using Recurrent Convolutional Network.SV-RCNet：基于递归卷积网络的手术视频工作流程识别

IEEE Trans Med Imaging. 2018 May;37(5):1114-1126. doi: 10.1109/TMI.2017.2787657.

引用本文的文献

A Novel Deep Learning Model for Predicting Colorectal Anastomotic Leakage: A Pioneer Multicenter Transatlantic Study.一种用于预测结直肠吻合口漏的新型深度学习模型：一项开创性的跨大西洋多中心研究。

J Clin Med. 2025 Aug 3;14(15):5462. doi: 10.3390/jcm14155462.

Use of artificial intelligence in the analysis of digital videos of invasive surgical procedures: scoping review.人工智能在侵入性外科手术数字视频分析中的应用：范围综述。

BJS Open. 2025 Jul 1;9(4). doi: 10.1093/bjsopen/zraf073.

本文引用的文献

Generating Rare Surgical Events Using CycleGAN: Addressing Lack of Data for Artificial Intelligence Event Recognition.利用 CycleGAN 生成罕见手术事件：解决人工智能事件识别中数据不足的问题。

J Surg Res. 2023 Mar;283:594-605. doi: 10.1016/j.jss.2022.11.008. Epub 2022 Nov 25.

Ontology-based surgical workflow recognition and prediction.基于本体的手术流程识别与预测。

J Biomed Inform. 2022 Dec;136:104240. doi: 10.1016/j.jbi.2022.104240. Epub 2022 Nov 8.

Improving the Generalisability of Deep CNNs by Combining Multi-stage Features for Surgical Tool Classification.通过组合多阶段特征提高用于手术工具分类的深度 CNN 的泛化能力。

Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul;2022:533-536. doi: 10.1109/EMBC48229.2022.9870883.

Effects of Intra-Abdominal Pressure on Lung Mechanics during Laparoscopic Gynaecology.腹腔镜妇科手术中腹腔内压对肺力学的影响。

Annu Int Conf IEEE Eng Med Biol Soc. 2021 Nov;2021:2091-2094. doi: 10.1109/EMBC46164.2021.9630753.

Surgical data science - from concepts toward clinical translation.外科数据科学——从概念到临床转化。

Med Image Anal. 2022 Feb;76:102306. doi: 10.1016/j.media.2021.102306. Epub 2021 Nov 18.

Artificial Intelligence-Assisted Surgery: Potential and Challenges.人工智能辅助手术：潜力与挑战。

Visc Med. 2020 Dec;36(6):450-455. doi: 10.1159/000511351. Epub 2020 Nov 4.

Machine Learning for Surgical Phase Recognition: A Systematic Review.基于机器学习的手术阶段识别：系统综述。

Ann Surg. 2021 Apr 1;273(4):684-693. doi: 10.1097/SLA.0000000000004425.

Multi-task recurrent convolutional network with correlation loss for surgical video analysis.基于相关损失的多任务递归卷积网络在手术视频分析中的应用。

Med Image Anal. 2020 Jan;59:101572. doi: 10.1016/j.media.2019.101572. Epub 2019 Oct 10.

Surgical process modeling.手术过程建模

Innov Surg Sci. 2017 May 20;2(3):123-137. doi: 10.1515/iss-2017-0005. eCollection 2017 Sep.

Surgical data science for next-generation interventions.面向下一代干预措施的外科数据科学。

Nat Biomed Eng. 2017 Sep;1(9):691-696. doi: 10.1038/s41551-017-0132-7.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于时间、注意力和多特征融合的腹腔镜视频分析。

Laparoscopic Video Analysis Using Temporal, Attention, and Multi-Feature Fusion Based-Approaches.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献