• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

LSKANet:用于机器人手术场景分割的长条形核注意力网络。

LSKANet: Long Strip Kernel Attention Network for Robotic Surgical Scene Segmentation.

出版信息

IEEE Trans Med Imaging. 2024 Apr;43(4):1308-1322. doi: 10.1109/TMI.2023.3335406. Epub 2024 Apr 3.

DOI:10.1109/TMI.2023.3335406
PMID:38015689
Abstract

Surgical scene segmentation is a critical task in Robotic-assisted surgery. However, the complexity of the surgical scene, which mainly includes local feature similarity (e.g., between different anatomical tissues), intraoperative complex artifacts, and indistinguishable boundaries, poses significant challenges to accurate segmentation. To tackle these problems, we propose the Long Strip Kernel Attention network (LSKANet), including two well-designed modules named Dual-block Large Kernel Attention module (DLKA) and Multiscale Affinity Feature Fusion module (MAFF), which can implement precise segmentation of surgical images. Specifically, by introducing strip convolutions with different topologies (cascaded and parallel) in two blocks and a large kernel design, DLKA can make full use of region- and strip-like surgical features and extract both visual and structural information to reduce the false segmentation caused by local feature similarity. In MAFF, affinity matrices calculated from multiscale feature maps are applied as feature fusion weights, which helps to address the interference of artifacts by suppressing the activations of irrelevant regions. Besides, the hybrid loss with Boundary Guided Head (BGH) is proposed to help the network segment indistinguishable boundaries effectively. We evaluate the proposed LSKANet on three datasets with different surgical scenes. The experimental results show that our method achieves new state-of-the-art results on all three datasets with improvements of 2.6%, 1.4%, and 3.4% mIoU, respectively. Furthermore, our method is compatible with different backbones and can significantly increase their segmentation accuracy. Code is available at https://github.com/YubinHan73/LSKANet.

摘要

手术场景分割是机器人辅助手术中的关键任务。然而,手术场景的复杂性主要包括局部特征相似性(例如,不同解剖组织之间)、术中复杂伪影和难以区分的边界,这给准确分割带来了重大挑战。为了解决这些问题,我们提出了长带核注意力网络(LSKANet),包括两个精心设计的模块,分别是双块大核注意力模块(DLKA)和多尺度亲和特征融合模块(MAFF),可以实现手术图像的精确分割。具体来说,通过在两个块中引入具有不同拓扑结构(级联和并行)的带卷积和大核设计,DLKA 可以充分利用区域和带状手术特征,并提取视觉和结构信息,以减少局部特征相似性引起的错误分割。在 MAFF 中,从多尺度特征图计算的亲和矩阵被用作特征融合权重,有助于通过抑制不相关区域的激活来消除伪影的干扰。此外,还提出了具有边界引导头(BGH)的混合损失,以帮助网络有效地分割难以区分的边界。我们在具有不同手术场景的三个数据集上评估了所提出的 LSKANet。实验结果表明,我们的方法在所有三个数据集上都取得了新的最先进的结果,分别提高了 2.6%、1.4%和 3.4%的 mIoU。此外,我们的方法与不同的骨干网络兼容,可以显著提高它们的分割精度。代码可在 https://github.com/YubinHan73/LSKANet 上获得。

相似文献

1
LSKANet: Long Strip Kernel Attention Network for Robotic Surgical Scene Segmentation.LSKANet:用于机器人手术场景分割的长条形核注意力网络。
IEEE Trans Med Imaging. 2024 Apr;43(4):1308-1322. doi: 10.1109/TMI.2023.3335406. Epub 2024 Apr 3.
2
Branch Aggregation Attention Network for Robotic Surgical Instrument Segmentation.分支聚合注意力网络在手术器械分割中的应用。
IEEE Trans Med Imaging. 2023 Nov;42(11):3408-3419. doi: 10.1109/TMI.2023.3288127. Epub 2023 Oct 27.
3
An attention-guided network for surgical instrument segmentation from endoscopic images.基于注意力引导的内窥镜图像手术器械分割网络。
Comput Biol Med. 2022 Dec;151(Pt A):106216. doi: 10.1016/j.compbiomed.2022.106216. Epub 2022 Oct 24.
4
Fast instruments and tissues segmentation of micro-neurosurgical scene using high correlative non-local network.使用高相关非局部网络快速分割微神经外科场景中的器械和组织。
Comput Biol Med. 2023 Feb;153:106531. doi: 10.1016/j.compbiomed.2022.106531. Epub 2023 Jan 3.
5
CGBA-Net: context-guided bidirectional attention network for surgical instrument segmentation.CGBA-Net:用于手术器械分割的上下文引导双向注意网络。
Int J Comput Assist Radiol Surg. 2023 Oct;18(10):1769-1781. doi: 10.1007/s11548-023-02906-1. Epub 2023 May 18.
6
Automatic segmentation of spine x-ray images based on multiscale feature enhancement network.基于多尺度特征增强网络的脊柱 X 射线图像自动分割。
Med Phys. 2024 Oct;51(10):7282-7294. doi: 10.1002/mp.17278. Epub 2024 Jun 30.
7
TGDAUNet: Transformer and GCNN based dual-branch attention UNet for medical image segmentation.TGDAUNet:基于 Transformer 和 GCNN 的双分支注意力 U-Net 用于医学图像分割。
Comput Biol Med. 2023 Dec;167:107583. doi: 10.1016/j.compbiomed.2023.107583. Epub 2023 Oct 21.
8
PA-ResSeg: A phase attention residual network for liver tumor segmentation from multiphase CT images.PA-ResSeg:一种用于多期 CT 图像中肝脏肿瘤分割的相位注意残差网络。
Med Phys. 2021 Jul;48(7):3752-3766. doi: 10.1002/mp.14922. Epub 2021 May 30.
9
MCAFNet: multiscale cross-layer attention fusion network for honeycomb lung lesion segmentation.MCAFNet:用于蜂窝状肺病变分割的多尺度跨层注意力融合网络
Med Biol Eng Comput. 2024 Apr;62(4):1121-1137. doi: 10.1007/s11517-023-02995-9. Epub 2023 Dec 27.
10
Medical image segmentation using boundary-enhanced guided packet rotation dual attention decoder network.基于边界增强引导包旋转双注意力解码器网络的医学图像分割。
Technol Health Care. 2022;30(1):129-143. doi: 10.3233/THC-202789.

引用本文的文献

1
Medical image segmentation model based on local enhancement driven global optimization.基于局部增强驱动全局优化的医学图像分割模型
Sci Rep. 2025 May 25;15(1):18281. doi: 10.1038/s41598-025-02393-1.
2
Object detection model design for tiny road surface damage.微小路面损伤的目标检测模型设计
Sci Rep. 2025 Apr 1;15(1):11032. doi: 10.1038/s41598-025-95502-z.