• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于手术视频中视频语义分割的时空网络。

A spatio-temporal network for video semantic segmentation in surgical videos.

机构信息

Medtronic plc, London, UK.

Wellcome/EPSRC Centre for Interventional and Surgical Sciences, University College London, London, UK.

出版信息

Int J Comput Assist Radiol Surg. 2024 Feb;19(2):375-382. doi: 10.1007/s11548-023-02971-6. Epub 2023 Jun 22.

DOI:10.1007/s11548-023-02971-6
PMID:37347345
Abstract

PURPOSE

Semantic segmentation in surgical videos has applications in intra-operative guidance, post-operative analytics and surgical education. Models need to provide accurate predictions since temporally inconsistent identification of anatomy can hinder patient safety. We propose a novel architecture for modelling temporal relationships in videos to address these issues.

METHODS

We developed a temporal segmentation model that includes a static encoder and a spatio-temporal decoder. The encoder processes individual frames whilst the decoder learns spatio-temporal relationships from frame sequences. The decoder can be used with any suitable encoder to improve temporal consistency.

RESULTS

Model performance was evaluated on the CholecSeg8k dataset and a private dataset of robotic Partial Nephrectomy procedures. Mean Intersection over Union improved by 1.30% and 4.27% respectively for each dataset when the temporal decoder was applied. Our model also displayed improvements in temporal consistency up to 7.23%.

CONCLUSIONS

This work demonstrates an advance in video segmentation of surgical scenes with potential applications in surgery with a view to improve patient outcomes. The proposed decoder can extend state-of-the-art static models, and it is shown that it can improve per-frame segmentation output and video temporal consistency.

摘要

目的

手术视频中的语义分割在术中指导、术后分析和手术教育中具有应用价值。由于对解剖结构的时间不一致识别可能会妨碍患者安全,因此模型需要提供准确的预测。我们提出了一种新的架构来解决这些问题,用于对视频中的时间关系进行建模。

方法

我们开发了一种时间分割模型,该模型包括静态编码器和时空解码器。编码器处理单个帧,而解码器则从帧序列中学习时空关系。该解码器可以与任何合适的编码器一起使用,以提高时间一致性。

结果

在 CholecSeg8k 数据集和机器人部分肾切除术的私人数据集上评估了模型性能。当应用时间解码器时,每个数据集的平均交并比分别提高了 1.30%和 4.27%。我们的模型还显示出在时间一致性方面提高了 7.23%。

结论

这项工作展示了在手术场景视频分割方面的进展,具有改善患者预后的手术应用潜力。所提出的解码器可以扩展最新的静态模型,并且已经表明它可以提高每帧分割输出和视频时间一致性。

相似文献

1
A spatio-temporal network for video semantic segmentation in surgical videos.用于手术视频中视频语义分割的时空网络。
Int J Comput Assist Radiol Surg. 2024 Feb;19(2):375-382. doi: 10.1007/s11548-023-02971-6. Epub 2023 Jun 22.
2
U-NetPlus: A Modified Encoder-Decoder U-Net Architecture for Semantic and Instance Segmentation of Surgical Instruments from Laparoscopic Images.U-NetPlus:一种用于从腹腔镜图像中对手术器械进行语义和实例分割的改进型编码器-解码器U-Net架构。
Annu Int Conf IEEE Eng Med Biol Soc. 2019 Jul;2019:7205-7211. doi: 10.1109/EMBC.2019.8856791.
3
Feature-guided attention network for medical image segmentation.基于特征引导的注意力网络的医学图像分割。
Med Phys. 2023 Aug;50(8):4871-4886. doi: 10.1002/mp.16253. Epub 2023 Feb 16.
4
An Unified Recurrent Video Object Segmentation Framework for Various Surveillance Environments.一种适用于各种监控环境的统一递归视频对象分割框架。
IEEE Trans Image Process. 2021;30:7889-7902. doi: 10.1109/TIP.2021.3108405. Epub 2021 Sep 20.
5
Bilateral attention decoder: A lightweight decoder for real-time semantic segmentation.双边注意解码器:用于实时语义分割的轻量级解码器。
Neural Netw. 2021 May;137:188-199. doi: 10.1016/j.neunet.2021.01.021. Epub 2021 Jan 30.
6
Mask then classify: multi-instance segmentation for surgical instruments.掩膜分类:手术器械的多实例分割。
Int J Comput Assist Radiol Surg. 2021 Jul;16(7):1227-1236. doi: 10.1007/s11548-021-02404-2. Epub 2021 Jun 18.
7
High Efficiency Video Coding (HEVC)-Based Surgical Telementoring System Using Shallow Convolutional Neural Network.基于高效视频编码 (HEVC) 的浅层卷积神经网络手术远程指导系统。
J Digit Imaging. 2019 Dec;32(6):1027-1043. doi: 10.1007/s10278-019-00206-2.
8
ST-MTL: Spatio-Temporal multitask learning model to predict scanpath while tracking instruments in robotic surgery.ST-MTL:用于在机器人手术中跟踪器械时预测扫描路径的时空多任务学习模型。
Med Image Anal. 2021 Jan;67:101837. doi: 10.1016/j.media.2020.101837. Epub 2020 Oct 15.
9
Refined Feature-based Multi-frame and Multi-scale Fusing Gate network for accurate segmentation of plaques in ultrasound videos.基于精炼特征的多帧和多尺度融合门控网络,用于超声视频中斑块的精确分割。
Comput Biol Med. 2023 Sep;163:107091. doi: 10.1016/j.compbiomed.2023.107091. Epub 2023 Jun 7.
10
CaDIS: Cataract dataset for surgical RGB-image segmentation.CaDIS:用于手术 RGB 图像分割的白内障数据集。
Med Image Anal. 2021 Jul;71:102053. doi: 10.1016/j.media.2021.102053. Epub 2021 Mar 31.

引用本文的文献

1
Kinematic Adaptive Frame Recognition (KAFR): A Novel Framework for Video Segmentation via Frame Similarity and Surgical Tool Tracking.运动自适应帧识别(KAFR):一种通过帧相似度和手术工具跟踪进行视频分割的新型框架。
IEEE Access. 2025;13:101681-101697. doi: 10.1109/access.2025.3573264. Epub 2025 May 23.
2
Vision techniques for anatomical structures in laparoscopic surgery: a comprehensive review.腹腔镜手术中解剖结构的视觉技术:全面综述。
Front Surg. 2025 Apr 14;12:1557153. doi: 10.3389/fsurg.2025.1557153. eCollection 2025.
3
Training-free temporal object tracking in surgical videos.

本文引用的文献

1
Advancing Surgical Education: The Use of Artificial Intelligence in Surgical Training.推进外科教育:人工智能在外科培训中的应用。
Am Surg. 2023 Jan;89(1):49-54. doi: 10.1177/00031348221101503. Epub 2022 May 15.
2
Artificial Intelligence in Surgery: Promises and Perils.人工智能在外科手术中的应用:前景与风险。
Ann Surg. 2018 Jul;268(1):70-76. doi: 10.1097/SLA.0000000000002693.
手术视频中无需训练的时域目标跟踪
Int J Comput Assist Radiol Surg. 2025 Jun;20(6):1067-1075. doi: 10.1007/s11548-025-03349-6. Epub 2025 Apr 1.
4
Neural fields for 3D tracking of anatomy and surgical instruments in monocular laparoscopic video clips.用于单目腹腔镜视频片段中解剖结构和手术器械三维跟踪的神经场
Healthc Technol Lett. 2024 Dec 12;11(6):411-417. doi: 10.1049/htl2.12113. eCollection 2024 Dec.
5
Improving Surgical Scene Semantic Segmentation through a Deep Learning Architecture with Attention to Class Imbalance.通过关注类别不平衡的深度学习架构改进手术场景语义分割
Biomedicines. 2024 Jun 13;12(6):1309. doi: 10.3390/biomedicines12061309.
6
Towards better laparoscopic video segmentation: A class-wise contrastive learning approach with multi-scale feature extraction.迈向更好的腹腔镜视频分割:一种具有多尺度特征提取的逐类对比学习方法。
Healthc Technol Lett. 2024 Jan 13;11(2-3):126-136. doi: 10.1049/htl2.12069. eCollection 2024 Apr-Jun.