• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过先进的解码器设计优化基于Transformer的网络用于医学图像分割。

Optimizing transformer-based network via advanced decoder design for medical image segmentation.

作者信息

Yang Weibin, Dong Zhiqi, Xu Mingyuan, Xu Longwei, Geng Dehua, Li Yusong, Wang Pengwei

机构信息

School of Information Science and Engineering, Shandong University, Tsingtao, 266237, People's Republic of China.

出版信息

Biomed Phys Eng Express. 2025 Feb 5;11(2). doi: 10.1088/2057-1976/adaec7.

DOI:10.1088/2057-1976/adaec7
PMID:39869936
Abstract

U-Net is widely used in medical image segmentation due to its simple and flexible architecture design. To address the challenges of scale and complexity in medical tasks, several variants of U-Net have been proposed. In particular, methods based on Vision Transformer (ViT), represented by Swin UNETR, have gained widespread attention in recent years. However, these improvements often focus on the encoder, overlooking the crucial role of the decoder in optimizing segmentation details. This design imbalance limits the potential for further enhancing segmentation performance. To address this issue, we analyze the roles of various decoder components, including upsampling method, skip connection, and feature extraction module, as well as the shortcomings of existing methods. Consequently, we propose Swin DER (i.e.,UNETRecodernhanced andefined), by specifically optimizing the design of these three components. Swin DER performs upsampling using learnable interpolation algorithm called offset coordinate neighborhood weighted up sampling (Onsampling) and replaces traditional skip connection with spatial-channel parallel attention gate (SCP AG). Additionally, Swin DER introduces deformable convolution along with attention mechanism in the feature extraction module of the decoder. Our model design achieves excellent results, surpassing other state-of-the-art methods on both the Synapse dataset and the MSD brain tumor segmentation task. Code is available at:.

摘要

U-Net因其简单灵活的架构设计而在医学图像分割中被广泛使用。为应对医学任务中尺度和复杂性方面的挑战,人们提出了U-Net的几种变体。特别是,以Swin UNETR为代表的基于视觉Transformer(ViT)的方法近年来受到了广泛关注。然而,这些改进往往集中在编码器上,而忽略了解码器在优化分割细节方面的关键作用。这种设计上的不平衡限制了进一步提高分割性能的潜力。为解决这个问题,我们分析了各种解码器组件的作用,包括上采样方法、跳跃连接和特征提取模块,以及现有方法的缺点。因此,我们提出了Swin DER(即UNETRecodernhanced andefined),通过专门优化这三个组件的设计。Swin DER使用一种名为偏移坐标邻域加权上采样(Onsampling)的可学习插值算法进行上采样,并用空间通道并行注意力门(SCP AG)取代传统的跳跃连接。此外,Swin DER在解码器的特征提取模块中引入了可变形卷积以及注意力机制。我们的模型设计取得了优异的成果,在Synapse数据集和MSD脑肿瘤分割任务上均超过了其他先进方法。代码可在:获取。

相似文献

1
Optimizing transformer-based network via advanced decoder design for medical image segmentation.通过先进的解码器设计优化基于Transformer的网络用于医学图像分割。
Biomed Phys Eng Express. 2025 Feb 5;11(2). doi: 10.1088/2057-1976/adaec7.
2
Narrowing the semantic gaps in U-Net with learnable skip connections: The case of medical image segmentation.使用可学习的跳过连接缩小 U-Net 中的语义差距:以医学图像分割为例。
Neural Netw. 2024 Oct;178:106546. doi: 10.1016/j.neunet.2024.106546. Epub 2024 Jul 17.
3
MSA-MaxNet: Multi-Scale Attention Enhanced Multi-Axis Vision Transformer Network for Medical Image Segmentation.MSA-MaxNet:用于医学图像分割的多尺度注意力增强多轴视觉Transformer网络
J Cell Mol Med. 2024 Dec;28(24):e70315. doi: 10.1111/jcmm.70315.
4
SwinCross: Cross-modal Swin transformer for head-and-neck tumor segmentation in PET/CT images.SwinCross:用于 PET/CT 图像中头颈部肿瘤分割的跨模态 Swin 变换器。
Med Phys. 2024 Mar;51(3):2096-2107. doi: 10.1002/mp.16703. Epub 2023 Sep 30.
5
Swin Unet3D: a three-dimensional medical image segmentation network combining vision transformer and convolution.Swin Unet3D:一种结合视觉Transformer 和卷积的三维医学图像分割网络。
BMC Med Inform Decis Mak. 2023 Feb 14;23(1):33. doi: 10.1186/s12911-023-02129-z.
6
Efficient brain tumor segmentation using Swin transformer and enhanced local self-attention.基于 Swin Transformer 和增强型局部自注意力的高效脑肿瘤分割。
Int J Comput Assist Radiol Surg. 2024 Feb;19(2):273-281. doi: 10.1007/s11548-023-03024-8. Epub 2023 Oct 5.
7
SwinBTS: A Method for 3D Multimodal Brain Tumor Segmentation Using Swin Transformer.SwinBTS:一种使用Swin Transformer进行3D多模态脑肿瘤分割的方法。
Brain Sci. 2022 Jun 17;12(6):797. doi: 10.3390/brainsci12060797.
8
DECTNet: Dual Encoder Network combined convolution and Transformer architecture for medical image segmentation.DECTNet:用于医学图像分割的双编码器网络结合卷积和 Transformer 架构。
PLoS One. 2024 Apr 4;19(4):e0301019. doi: 10.1371/journal.pone.0301019. eCollection 2024.
9
VSmTrans: A hybrid paradigm integrating self-attention and convolution for 3D medical image segmentation.VSmTrans:一种融合自注意力机制和卷积的 3D 医学图像分割混合范式。
Med Image Anal. 2024 Dec;98:103295. doi: 10.1016/j.media.2024.103295. Epub 2024 Aug 24.
10
MG-Net: A fetal brain tissue segmentation method based on multiscale feature fusion and graph convolution attention mechanisms.MG-Net:一种基于多尺度特征融合和图卷积注意力机制的胎儿脑组织分割方法。
Comput Methods Programs Biomed. 2024 Dec;257:108451. doi: 10.1016/j.cmpb.2024.108451. Epub 2024 Oct 5.

引用本文的文献

1
Leveraging advanced feature extraction for improved kidney biopsy segmentation.利用先进的特征提取技术改进肾活检分割。
Front Med (Lausanne). 2025 Jun 18;12:1591999. doi: 10.3389/fmed.2025.1591999. eCollection 2025.