使用可学习的跳过连接缩小 U-Net 中的语义差距：以医学图像分割为例。

Narrowing the semantic gaps in U-Net with learnable skip connections: The case of medical image segmentation.

机构信息

School of Computer Science and Engineering, Northeastern University, Shenyang, China; Key Laboratory of Intelligent Computing in Medical Image of Ministry of Education, Northeastern University, Shenyang, China.

出版信息

Neural Netw. 2024 Oct;178:106546. doi: 10.1016/j.neunet.2024.106546. Epub 2024 Jul 17.

DOI:10.1016/j.neunet.2024.106546

PMID:39053196

Abstract

Current state-of-the-art medical image segmentation techniques predominantly employ the encoder-decoder architecture. Despite its widespread use, this U-shaped framework exhibits limitations in effectively capturing multi-scale features through simple skip connections. In this study, we made a thorough analysis to investigate the potential weaknesses of connections across various segmentation tasks, and suggest two key aspects of potential semantic gaps crucial to be considered: the semantic gap among multi-scale features in different encoding stages and the semantic gap between the encoder and the decoder. To bridge these semantic gaps, we introduce a novel segmentation framework, which incorporates a Dual Attention Transformer module for capturing channel-wise and spatial-wise relationships, and a Decoder-guided Recalibration Attention module for fusing DAT tokens and decoder features. These modules establish a principle of learnable connection that resolves the semantic gaps, leading to a high-performance segmentation model for medical images. Furthermore, it provides a new paradigm for effectively incorporating the attention mechanism into the traditional convolution-based architecture. Comprehensive experimental results demonstrate that our model achieves consistent, significant gains and outperforms state-of-the-art methods with relatively fewer parameters. This study contributes to the advancement of medical image segmentation by offering a more effective and efficient framework for addressing the limitations of current encoder-decoder architectures. Code: https://github.com/McGregorWwww/UDTransNet.

摘要

当前最先进的医学图像分割技术主要采用编码器-解码器架构。尽管这种 U 型框架被广泛应用，但它通过简单的跳过连接来有效捕获多尺度特征的能力有限。在这项研究中，我们进行了全面的分析，研究了跨各种分割任务的连接的潜在弱点，并提出了两个需要考虑的关键潜在语义差距方面：不同编码阶段的多尺度特征之间的语义差距，以及编码器和解码器之间的语义差距。为了弥合这些语义差距，我们引入了一种新的分割框架，该框架包含一个双注意转换器模块，用于捕获通道和空间关系，以及一个解码器引导的再校准注意模块，用于融合 DAT 令牌和解码器特征。这些模块建立了一个可学习连接的原则，解决了语义差距问题，为医学图像提供了一个高性能的分割模型。此外，它为有效地将注意力机制融入传统的基于卷积的架构提供了一个新的范例。全面的实验结果表明，我们的模型在具有相对较少参数的情况下实现了一致的、显著的增益，并优于最先进的方法。这项研究通过提供一个更有效和高效的框架来解决当前编码器-解码器架构的局限性，为医学图像分割的发展做出了贡献。代码：https://github.com/McGregorWwww/UDTransNet。

相似文献

Narrowing the semantic gaps in U-Net with learnable skip connections: The case of medical image segmentation.使用可学习的跳过连接缩小 U-Net 中的语义差距：以医学图像分割为例。

Neural Netw. 2024 Oct;178:106546. doi: 10.1016/j.neunet.2024.106546. Epub 2024 Jul 17.

CFATransUnet: Channel-wise cross fusion attention and transformer for 2D medical image segmentation.CFATransUnet：用于二维医学图像分割的通道式交叉融合注意力和转换器。

Comput Biol Med. 2024 Jan;168:107803. doi: 10.1016/j.compbiomed.2023.107803. Epub 2023 Dec 4.

D-SAT: dual semantic aggregation transformer with dual attention for medical image segmentation.D-SAT：用于医学图像分割的具有双重注意力的双重语义聚合转换器。

Phys Med Biol. 2023 Dec 22;69(1). doi: 10.1088/1361-6560/acf2e5.

ETUNet:Exploring efficient transformer enhanced UNet for 3D brain tumor segmentation.ETUNet：探索高效的基于Transformer 的增强型 UNet 进行 3D 脑肿瘤分割。

Comput Biol Med. 2024 Mar;171:108005. doi: 10.1016/j.compbiomed.2024.108005. Epub 2024 Jan 23.

A Novel Skip-Connection Strategy by Fusing Spatial and Channel Wise Features for Multi-Region Medical Image Segmentation.一种融合空间和通道特征的新型 Skip-Connection 策略，用于多区域医学图像分割。

IEEE J Biomed Health Inform. 2024 Sep;28(9):5396-5409. doi: 10.1109/JBHI.2024.3406786. Epub 2024 Sep 5.

USCT-UNet: Rethinking the Semantic Gap in U-Net Network From U-Shaped Skip Connections With Multichannel Fusion Transformer.USCT-UNet：从具有多通道融合变换的 U 形跳跃连接重新思考 U-Net 网络中的语义鸿沟

IEEE Trans Neural Syst Rehabil Eng. 2024;32:3782-3793. doi: 10.1109/TNSRE.2024.3468339. Epub 2024 Oct 16.

MADR-Net: multi-level attention dilated residual neural network for segmentation of medical images.MADR-Net：用于医学图像分割的多层次注意扩张残差神经网络。

Sci Rep. 2024 Jun 3;14(1):12699. doi: 10.1038/s41598-024-63538-2.

Dense gate network for biomedical image segmentation.密集门网络用于生物医学图像分割。

Int J Comput Assist Radiol Surg. 2020 Aug;15(8):1247-1255. doi: 10.1007/s11548-020-02138-7. Epub 2020 Apr 8.

E-DU: Deep neural network for multimodal medical image segmentation based on semantic gap compensation.基于语义鸿沟补偿的多模态医学图像分割的深度神经网络

Comput Biol Med. 2022 Dec;151(Pt A):106206. doi: 10.1016/j.compbiomed.2022.106206. Epub 2022 Oct 12.

FMD-UNet: fine-grained feature squeeze and multiscale cascade dilated semantic aggregation dual-decoder UNet for COVID-19 lung infection segmentation from CT images.FMD-UNet：用于从 CT 图像中 COVID-19 肺部感染分割的细粒度特征挤压和多尺度级联扩张语义聚合双解码器 UNet。

Biomed Phys Eng Express. 2024 Aug 27;10(5). doi: 10.1088/2057-1976/ad6f12.

引用本文的文献

Liver Semantic Segmentation Method Based on Multi-Channel Feature Extraction and Cross Fusion.基于多通道特征提取与交叉融合的肝脏语义分割方法

Bioengineering (Basel). 2025 Jun 11;12(6):636. doi: 10.3390/bioengineering12060636.

An Algorithm for Mining the Living Habits of Elderly People Living Alone Based on AIoT.一种基于人工智能物联网挖掘独居老人生活习惯的算法。

Sensors (Basel). 2025 Apr 4;25(7):2299. doi: 10.3390/s25072299.

Advances in Deep Learning for Semantic Segmentation of Low-Contrast Images: A Systematic Review of Methods, Challenges, and Future Directions.低对比度图像语义分割的深度学习进展：方法、挑战及未来方向的系统综述

Sensors (Basel). 2025 Mar 25;25(7):2043. doi: 10.3390/s25072043.

Towards Investigating Residual Hearing Loss: Quantification of Fibrosis in a Novel Cochlear OCT Dataset.迈向残余听力损失的研究：新型耳蜗光学相干断层扫描数据集纤维化的量化

IEEE Trans Biomed Eng. 2025 Jul;72(7):2218-2228. doi: 10.1109/TBME.2025.3537868.

Enhancing signal-to-noise ratio in real-time LED-based photoacoustic imaging: A comparative study of CNN-based deep learning architectures.实时基于LED的光声成像中提高信噪比：基于卷积神经网络的深度学习架构的比较研究

Photoacoustics. 2024 Nov 30;41:100674. doi: 10.1016/j.pacs.2024.100674. eCollection 2025 Feb.

DRA-Net: Medical image segmentation based on adaptive feature extraction and region-level information fusion.DRA-Net：基于自适应特征提取和区域级信息融合的医学图像分割

Sci Rep. 2024 Apr 27;14(1):9714. doi: 10.1038/s41598-024-60475-y.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用可学习的跳过连接缩小 U-Net 中的语义差距：以医学图像分割为例。

Narrowing the semantic gaps in U-Net with learnable skip connections: The case of medical image segmentation.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献