MSR-UNet：增强医学图像分割中的多尺度和远距离依赖性

MSR-UNet: enhancing multi-scale and long-range dependencies in medical image segmentation.

作者信息

Wang Shuai, Liu Lei, Wang Jun, Peng Xinyue, Liu Baosen

机构信息

School of Computer Science and Technology, Huaibei Normal University, Huaibei, China.

Huaibei Key Laboratory of Digital Multimedia Intelligent Information Processing, Huaibei, China.

出版信息

PeerJ Comput Sci. 2024 Dec 3;10:e2563. doi: 10.7717/peerj-cs.2563. eCollection 2024.

DOI:10.7717/peerj-cs.2563

PMID:39650414

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11623095/

Abstract

Transformer-based technology has attracted widespread attention in medical image segmentation. Due to the diversity of organs, effective modelling of multi-scale information and establishing long-range dependencies between pixels are crucial for successful medical image segmentation. However, most studies rely on a fixed single-scale window for modeling, which ignores the potential impact of window size on performance. This limitation can hinder window-based models' ability to fully explore multi-scale and long-range relationships within medical images. To address this issue, we propose a multi-scale reconfiguration self-attention (MSR-SA) module that accurately models multi-scale information and long-range dependencies in medical images. The MSR-SA module first divides the attention heads into multiple groups, each assigned an ascending dilation rate. These groups are then uniformly split into several non-overlapping local windows. Using dilated sampling, we gather the same number of keys to obtain both long-range and multi-scale information. Finally, dynamic information fusion is achieved by integrating features from the sampling points at corresponding positions across different windows. Based on the MSR-SA module, we propose a multi-scale reconfiguration U-Net (MSR-UNet) framework for medical image segmentation. Experiments on the Synapse and automated cardiac diagnosis challenge (ACDC) datasets show that MSR-UNet can achieve satisfactory segmentation results. The code is available at https://github.com/davidsmithwj/MSR-UNet (DOI: 10.5281/zenodo.13969855).

摘要

基于Transformer的技术在医学图像分割中引起了广泛关注。由于器官的多样性，有效建模多尺度信息并在像素之间建立长距离依赖关系对于成功进行医学图像分割至关重要。然而，大多数研究依赖于固定的单尺度窗口进行建模，这忽略了窗口大小对性能的潜在影响。这种局限性可能会阻碍基于窗口的模型充分探索医学图像中多尺度和长距离关系的能力。为了解决这个问题，我们提出了一种多尺度重构自注意力（MSR-SA）模块，该模块可以准确地对医学图像中的多尺度信息和长距离依赖关系进行建模。MSR-SA模块首先将注意力头划分为多个组，每个组分配一个递增的扩张率。然后将这些组均匀地分割成几个不重叠的局部窗口。通过扩张采样，我们收集相同数量的键以获得长距离和多尺度信息。最后，通过整合来自不同窗口中对应位置采样点的特征来实现动态信息融合。基于MSR-SA模块，我们提出了一种用于医学图像分割的多尺度重构U-Net（MSR-UNet）框架。在Synapse和自动心脏诊断挑战赛（ACDC）数据集上的实验表明，MSR-UNet可以取得令人满意的分割结果。代码可在https://github.com/davidsmithwj/MSR-UNet获取（DOI：10.5281/zenodo.13969855）。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63e1/11623095/9797dc47cef6/peerj-cs-10-2563-g001.jpg

相似文献

MSR-UNet: enhancing multi-scale and long-range dependencies in medical image segmentation.MSR-UNet：增强医学图像分割中的多尺度和远距离依赖性

PeerJ Comput Sci. 2024 Dec 3;10:e2563. doi: 10.7717/peerj-cs.2563. eCollection 2024.

ETUNet:Exploring efficient transformer enhanced UNet for 3D brain tumor segmentation.ETUNet：探索高效的基于Transformer 的增强型 UNet 进行 3D 脑肿瘤分割。

Comput Biol Med. 2024 Mar;171:108005. doi: 10.1016/j.compbiomed.2024.108005. Epub 2024 Jan 23.

MSCT-UNET: multi-scale contrastive transformer within U-shaped network for medical image segmentation.MSCT-UNET：U 形网络中的多尺度对比变换用于医学图像分割。

Phys Med Biol. 2023 Dec 28;69(1). doi: 10.1088/1361-6560/ad135d.

Multi-Scale Dynamic Sparse Attention UNet for Medical Image Segmentation.用于医学图像分割的多尺度动态稀疏注意力UNet

IEEE J Biomed Health Inform. 2025 Sep;29(9):6754-6766. doi: 10.1109/JBHI.2025.3555805.

TAC-UNet: transformer-assisted convolutional neural network for medical image segmentation.TAC-UNet：用于医学图像分割的Transformer辅助卷积神经网络。

Quant Imaging Med Surg. 2024 Dec 5;14(12):8824-8839. doi: 10.21037/qims-24-1229. Epub 2024 Nov 5.

TGDAUNet: Transformer and GCNN based dual-branch attention UNet for medical image segmentation.TGDAUNet：基于 Transformer 和 GCNN 的双分支注意力 U-Net 用于医学图像分割。

Comput Biol Med. 2023 Dec;167:107583. doi: 10.1016/j.compbiomed.2023.107583. Epub 2023 Oct 21.

MSA-MaxNet: Multi-Scale Attention Enhanced Multi-Axis Vision Transformer Network for Medical Image Segmentation.MSA-MaxNet：用于医学图像分割的多尺度注意力增强多轴视觉Transformer网络

J Cell Mol Med. 2024 Dec;28(24):e70315. doi: 10.1111/jcmm.70315.

CSAP-UNet: Convolution and self-attention paralleling network for medical image segmentation with edge enhancement.CSAP-UNet：用于医学图像分割的具有边缘增强的卷积和自注意力并行网络。

Comput Biol Med. 2024 Apr;172:108265. doi: 10.1016/j.compbiomed.2024.108265. Epub 2024 Mar 7.

SW-UNet: a U-Net fusing sliding window transformer block with CNN for segmentation of lung nodules.SW-UNet：一种将滑动窗口变压器模块与卷积神经网络融合用于肺结节分割的U-Net。

Front Med (Lausanne). 2023 Sep 28;10:1273441. doi: 10.3389/fmed.2023.1273441. eCollection 2023.

MLFA-UNet: A multi-level feature assembly UNet for medical image segmentation.MLFA-UNet：一种用于医学图像分割的多级特征组装UNet。

Methods. 2024 Dec;232:52-64. doi: 10.1016/j.ymeth.2024.10.010. Epub 2024 Oct 29.

本文引用的文献

Medical Image Segmentation Review: The Success of U-Net.医学图像分割综述：U-Net 的成功。

IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):10076-10095. doi: 10.1109/TPAMI.2024.3435571. Epub 2024 Nov 6.

Moving Beyond Simulation: Data-Driven Quantitative Photoacoustic Imaging Using Tissue-Mimicking Phantoms.超越模拟：基于组织模拟体的全定量光声成像技术

IEEE Trans Med Imaging. 2024 Mar;43(3):1214-1224. doi: 10.1109/TMI.2023.3331198. Epub 2024 Mar 5.

FoPro-KD: Fourier Prompted Effective Knowledge Distillation for Long-Tailed Medical Image Recognition.FoPro-KD：用于长尾医学图像识别的傅里叶提示有效知识蒸馏。

IEEE Trans Med Imaging. 2024 Mar;43(3):954-965. doi: 10.1109/TMI.2023.3327428. Epub 2024 Mar 5.

MTANet: Multi-Task Attention Network for Automatic Medical Image Segmentation and Classification.MTANet：用于医学图像自动分割和分类的多任务注意力网络。

IEEE Trans Med Imaging. 2024 Feb;43(2):674-685. doi: 10.1109/TMI.2023.3317088. Epub 2024 Feb 2.

LViT: Language Meets Vision Transformer in Medical Image Segmentation.LViT：医学图像分割中语言与视觉Transformer的融合

IEEE Trans Med Imaging. 2024 Jan;43(1):96-107. doi: 10.1109/TMI.2023.3291719. Epub 2024 Jan 2.

Transformers in medical imaging: A survey.医学成像中的变压器：综述。

Med Image Anal. 2023 Aug;88:102802. doi: 10.1016/j.media.2023.102802. Epub 2023 Apr 5.

Hybrid Graph Convolutional Network With Online Masked Autoencoder for Robust Multimodal Cancer Survival Prediction.基于在线掩蔽自动编码器的混合图卷积网络在稳健的多模态癌症生存预测中的应用。

IEEE Trans Med Imaging. 2023 Aug;42(8):2462-2473. doi: 10.1109/TMI.2023.3253760. Epub 2023 Aug 1.

GREnet: Gradually REcurrent Network With Curriculum Learning for 2-D Medical Image Segmentation.GREnet：基于课程学习的渐进式递归网络用于二维医学图像分割。

IEEE Trans Neural Netw Learn Syst. 2024 Jul;35(7):10018-10032. doi: 10.1109/TNNLS.2023.3238381. Epub 2024 Jul 8.

MISSFormer: An Effective Transformer for 2D Medical Image Segmentation.MISSFormer：用于二维医学图像分割的有效 Transformer。

IEEE Trans Med Imaging. 2023 May;42(5):1484-1494. doi: 10.1109/TMI.2022.3230943. Epub 2023 May 2.

MATR: Multimodal Medical Image Fusion via Multiscale Adaptive Transformer.MATR：基于多尺度自适应变换的多模态医学图像融合。

IEEE Trans Image Process. 2022;31:5134-5149. doi: 10.1109/TIP.2022.3193288. Epub 2022 Aug 2.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

MSR-UNet：增强医学图像分割中的多尺度和远距离依赖性

MSR-UNet: enhancing multi-scale and long-range dependencies in medical image segmentation.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献