• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于时间记忆关系网络的手术视频流程识别

Temporal Memory Relation Network for Workflow Recognition From Surgical Video.

出版信息

IEEE Trans Med Imaging. 2021 Jul;40(7):1911-1923. doi: 10.1109/TMI.2021.3069471. Epub 2021 Jun 30.

DOI:10.1109/TMI.2021.3069471
PMID:33780335
Abstract

Automatic surgical workflow recognition is a key component for developing context-aware computer-assisted systems in the operating theatre. Previous works either jointly modeled the spatial features with short fixed-range temporal information, or separately learned visual and long temporal cues. In this paper, we propose a novel end-to-end temporal memory relation network (TMRNet) for relating long-range and multi-scale temporal patterns to augment the present features. We establish a long-range memory bank to serve as a memory cell storing the rich supportive information. Through our designed temporal variation layer, the supportive cues are further enhanced by multi-scale temporal-only convolutions. To effectively incorporate the two types of cues without disturbing the joint learning of spatio-temporal features, we introduce a non-local bank operator to attentively relate the past to the present. In this regard, our TMRNet enables the current feature to view the long-range temporal dependency, as well as tolerate complex temporal extents. We have extensively validated our approach on two benchmark surgical video datasets, M2CAI challenge dataset and Cholec80 dataset. Experimental results demonstrate the outstanding performance of our method, consistently exceeding the state-of-the-art methods by a large margin (e.g., 67.0% v.s. 78.9% Jaccard on Cholec80 dataset).

摘要

自动手术流程识别是开发手术室内上下文感知计算机辅助系统的关键组成部分。以前的工作要么联合建模空间特征和短期固定范围的时间信息,要么分别学习视觉和长期时间线索。在本文中,我们提出了一种新颖的端到端时间记忆关系网络(TMRNet),用于将远程和多尺度时间模式相关联,以增强当前的特征。我们建立了一个远程记忆库,作为存储丰富支持信息的存储单元。通过我们设计的时间变化层,通过多尺度时间卷积进一步增强支持线索。为了在不干扰时空特征联合学习的情况下有效地合并这两种类型的线索,我们引入了一个非局部银行算子来关注过去与现在的关系。在这方面,我们的 TMRNet 使当前特征能够查看远程时间依赖关系,并容忍复杂的时间范围。我们在两个基准手术视频数据集,M2CAI 挑战赛数据集和 Cholec80 数据集上广泛验证了我们的方法。实验结果表明,我们的方法表现出色,始终比最先进的方法有很大的优势(例如,Cholec80 数据集上的 Jaccard 为 67.0%,而 78.9%)。

相似文献

1
Temporal Memory Relation Network for Workflow Recognition From Surgical Video.基于时间记忆关系网络的手术视频流程识别
IEEE Trans Med Imaging. 2021 Jul;40(7):1911-1923. doi: 10.1109/TMI.2021.3069471. Epub 2021 Jun 30.
2
SV-RCNet: Workflow Recognition From Surgical Videos Using Recurrent Convolutional Network.SV-RCNet:基于递归卷积网络的手术视频工作流程识别
IEEE Trans Med Imaging. 2018 May;37(5):1114-1126. doi: 10.1109/TMI.2017.2787657.
3
Semi-supervised learning with progressive unlabeled data excavation for label-efficient surgical workflow recognition.基于渐进式未标记数据挖掘的半监督学习在标签高效手术流程识别中的应用。
Med Image Anal. 2021 Oct;73:102158. doi: 10.1016/j.media.2021.102158. Epub 2021 Jul 8.
4
LRTD: long-range temporal dependency based active learning for surgical workflow recognition.基于长程时间依赖的主动学习在手术流程识别中的应用
Int J Comput Assist Radiol Surg. 2020 Sep;15(9):1573-1584. doi: 10.1007/s11548-020-02198-9. Epub 2020 Jun 25.
5
Temporal-based Swin Transformer network for workflow recognition of surgical video.用于手术视频工作流识别的基于时间的Swin Transformer网络
Int J Comput Assist Radiol Surg. 2023 Jan;18(1):139-147. doi: 10.1007/s11548-022-02785-y. Epub 2022 Nov 4.
6
Multi-task recurrent convolutional network with correlation loss for surgical video analysis.基于相关损失的多任务递归卷积网络在手术视频分析中的应用。
Med Image Anal. 2020 Jan;59:101572. doi: 10.1016/j.media.2019.101572. Epub 2019 Oct 10.
7
Trans-SVNet: hybrid embedding aggregation Transformer for surgical workflow analysis.跨模态 SVNet:用于手术流程分析的混合嵌入聚合 Transformer。
Int J Comput Assist Radiol Surg. 2022 Dec;17(12):2193-2202. doi: 10.1007/s11548-022-02743-8. Epub 2022 Sep 21.
8
Cascade Multi-Level Transformer Network for Surgical Workflow Analysis.级联多层变换网络用于手术流程分析。
IEEE Trans Med Imaging. 2023 Oct;42(10):2817-2831. doi: 10.1109/TMI.2023.3265354. Epub 2023 Oct 2.
9
Data-driven spatio-temporal RGBD feature encoding for action recognition in operating rooms.用于手术室动作识别的数据驱动时空RGB-D特征编码
Int J Comput Assist Radiol Surg. 2015 Jun;10(6):737-47. doi: 10.1007/s11548-015-1186-1. Epub 2015 Apr 7.
10
Against spatial-temporal discrepancy: contrastive learning-based network for surgical workflow recognition.对抗时空差异:基于对比学习的手术流程识别网络。
Int J Comput Assist Radiol Surg. 2021 May;16(5):839-848. doi: 10.1007/s11548-021-02382-5. Epub 2021 May 5.

引用本文的文献

1
A weakly supervised method for surgical scene components detection with visual foundation model.一种基于视觉基础模型的手术场景组件检测弱监督方法。
PLoS One. 2025 May 27;20(5):e0322751. doi: 10.1371/journal.pone.0322751. eCollection 2025.
2
Artificial intelligence-assisted phase recognition and skill assessment in laparoscopic surgery: a systematic review.腹腔镜手术中人工智能辅助的阶段识别与技能评估:一项系统综述
Front Surg. 2025 Apr 11;12:1551838. doi: 10.3389/fsurg.2025.1551838. eCollection 2025.
3
Automated surgical action recognition and competency assessment in laparoscopic cholecystectomy: a proof-of-concept study.
腹腔镜胆囊切除术中自动手术动作识别与能力评估:一项概念验证研究。
Surg Endosc. 2025 May;39(5):3006-3016. doi: 10.1007/s00464-025-11663-y. Epub 2025 Mar 21.
4
LoViT: Long Video Transformer for surgical phase recognition.LoViT:用于手术阶段识别的长视频 Transformer。
Med Image Anal. 2025 Jan;99:103366. doi: 10.1016/j.media.2024.103366. Epub 2024 Oct 5.
5
Automated segmentation of phases, steps, and tasks in laparoscopic cholecystectomy using deep learning.使用深度学习对腹腔镜胆囊切除术的阶段、步骤和任务进行自动分割。
Surg Endosc. 2024 Jan;38(1):158-170. doi: 10.1007/s00464-023-10482-3. Epub 2023 Nov 9.
6
Intelligent surgical workflow recognition for endoscopic submucosal dissection with real-time animal study.基于实时动物研究的内镜黏膜下剥离术智能手术流程识别。
Nat Commun. 2023 Oct 21;14(1):6676. doi: 10.1038/s41467-023-42451-8.
7
Bringing Artificial Intelligence to the operating room: edge computing for real-time surgical phase recognition.将人工智能引入手术室:边缘计算实现实时手术阶段识别。
Surg Endosc. 2023 Nov;37(11):8778-8784. doi: 10.1007/s00464-023-10322-4. Epub 2023 Aug 14.
8
Surgical Phase Recognition in Inguinal Hernia Repair-AI-Based Confirmatory Baseline and Exploration of Competitive Models.腹股沟疝修补术中手术阶段的识别——基于人工智能的确证性基线及竞争性模型探索
Bioengineering (Basel). 2023 May 27;10(6):654. doi: 10.3390/bioengineering10060654.
9
Surgical workflow recognition with temporal convolution and transformer for action segmentation.基于时间卷积和Transformer的手术流程识别用于动作分割
Int J Comput Assist Radiol Surg. 2023 Apr;18(4):785-794. doi: 10.1007/s11548-022-02811-z. Epub 2022 Dec 21.
10
Automatic Detection Algorithm of Football Events in Videos.足球视频事件的自动检测算法。
Comput Intell Neurosci. 2022 May 14;2022:2839244. doi: 10.1155/2022/2839244. eCollection 2022.