• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

手术视频理解中的器械-组织相互作用检测框架。

Instrument-Tissue Interaction Detection Framework for Surgical Video Understanding.

出版信息

IEEE Trans Med Imaging. 2024 Aug;43(8):2803-2813. doi: 10.1109/TMI.2024.3381209. Epub 2024 Aug 1.

DOI:10.1109/TMI.2024.3381209
PMID:38530715
Abstract

Instrument-tissue interaction detection task, which helps understand surgical activities, is vital for constructing computer-assisted surgery systems but with many challenges. Firstly, most models represent instrument-tissue interaction in a coarse-grained way which only focuses on classification and lacks the ability to automatically detect instruments and tissues. Secondly, existing works do not fully consider relations between intra- and inter-frame of instruments and tissues. In the paper, we propose to represent instrument-tissue interaction as 〈 instrument class, instrument bounding box, tissue class, tissue bounding box, action class 〉 quintuple and present an Instrument-Tissue Interaction Detection Network (ITIDNet) to detect the quintuple for surgery videos understanding. Specifically, we propose a Snippet Consecutive Feature (SCF) Layer to enhance features by modeling relationships of proposals in the current frame using global context information in the video snippet. We also propose a Spatial Corresponding Attention (SCA) Layer to incorporate features of proposals between adjacent frames through spatial encoding. To reason relationships between instruments and tissues, a Temporal Graph (TG) Layer is proposed with intra-frame connections to exploit relationships between instruments and tissues in the same frame and inter-frame connections to model the temporal information for the same instance. For evaluation, we build a cataract surgery video (PhacoQ) dataset and a cholecystectomy surgery video (CholecQ) dataset. Experimental results demonstrate the promising performance of our model, which outperforms other state-of-the-art models on both datasets.

摘要

器械-组织相互作用检测任务有助于理解手术活动,对于构建计算机辅助手术系统至关重要,但也面临许多挑战。首先,大多数模型以粗粒度的方式表示器械-组织相互作用,仅关注分类,缺乏自动检测器械和组织的能力。其次,现有的工作没有充分考虑器械和组织在帧内和帧间的关系。在本文中,我们提出将器械-组织相互作用表示为〈器械类别、器械边界框、组织类别、组织边界框、动作类别〉五元组,并提出一种器械-组织相互作用检测网络(ITIDNet),用于检测手术视频理解中的五元组。具体来说,我们提出了一个 Snippet Consecutive Feature (SCF) 层,通过使用视频片段中的全局上下文信息来建模当前帧中提议之间的关系,从而增强特征。我们还提出了一个 Spatial Corresponding Attention (SCA) 层,通过空间编码将相邻帧之间的提议特征结合起来。为了推理器械和组织之间的关系,提出了一个 Temporal Graph (TG) 层,其中包含帧内连接,以利用同一帧中器械和组织之间的关系,以及帧间连接,以对同一实例的时间信息进行建模。为了进行评估,我们构建了白内障手术视频(PhacoQ)数据集和胆囊切除术手术视频(CholecQ)数据集。实验结果表明,我们的模型性能优异,在这两个数据集上均优于其他最先进的模型。

相似文献

1
Instrument-Tissue Interaction Detection Framework for Surgical Video Understanding.手术视频理解中的器械-组织相互作用检测框架。
IEEE Trans Med Imaging. 2024 Aug;43(8):2803-2813. doi: 10.1109/TMI.2024.3381209. Epub 2024 Aug 1.
2
Video-Instrument Synergistic Network for Referring Video Instrument Segmentation in Robotic Surgery.用于机器人手术中参考视频器械分割的视频-器械协同网络
IEEE Trans Med Imaging. 2024 Dec;43(12):4457-4469. doi: 10.1109/TMI.2024.3426953. Epub 2024 Dec 2.
3
DSTAN: A Deformable Spatial-temporal Attention Network with Bidirectional Sequence Feature Refinement for Speckle Noise Removal in Thyroid Ultrasound Video.DSTAN:一种具有双向序列特征细化的可变形时空注意力网络,用于去除甲状腺超声视频中的斑点噪声。
J Imaging Inform Med. 2024 Dec;37(6):3264-3281. doi: 10.1007/s10278-023-00935-5. Epub 2024 Jun 5.
4
LACOSTE: Exploiting stereo and temporal contexts for surgical instrument segmentation.拉科斯特:利用立体和时间上下文进行手术器械分割。
Med Image Anal. 2025 Jan;99:103387. doi: 10.1016/j.media.2024.103387. Epub 2024 Nov 12.
5
Patch-based adaptive weighting with segmentation and scale (PAWSS) for visual tracking in surgical video.用于手术视频视觉跟踪的基于补丁的带分割和尺度的自适应加权(PAWSS)
Med Image Anal. 2019 Oct;57:120-135. doi: 10.1016/j.media.2019.07.002. Epub 2019 Jul 4.
6
Dual-stage semantic segmentation of endoscopic surgical instruments.内窥镜手术器械的双阶段语义分割
Med Phys. 2024 Dec;51(12):9125-9137. doi: 10.1002/mp.17397. Epub 2024 Sep 10.
7
Enhancing space-time video super-resolution via spatial-temporal feature interaction.通过时空特征交互增强时空视频超分辨率
Neural Netw. 2025 Apr;184:107033. doi: 10.1016/j.neunet.2024.107033. Epub 2024 Dec 13.
8
SF-TMN: SlowFast temporal modeling network for surgical phase recognition.SF-TMN:用于手术阶段识别的慢快时变建模网络。
Int J Comput Assist Radiol Surg. 2024 May;19(5):871-880. doi: 10.1007/s11548-024-03095-1. Epub 2024 Mar 21.
9
Multi-level feature aggregation network for instrument identification of endoscopic images.用于内镜图像仪器识别的多层次特征聚合网络。
Phys Med Biol. 2020 Aug 31;65(16):165004. doi: 10.1088/1361-6560/ab8dda.
10
IG-Net: An Instrument-guided real-time semantic segmentation framework for prostate dissection during surgery for low rectal cancer.IG-Net:一种用于低位直肠癌手术中前列腺解剖的仪器引导实时语义分割框架。
Comput Methods Programs Biomed. 2024 Dec;257:108443. doi: 10.1016/j.cmpb.2024.108443. Epub 2024 Sep 28.