• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用多变压器U-Net增强医学图像分割

Enhancing medical image segmentation with a multi-transformer U-Net.

作者信息

Dan Yongping, Jin Weishou, Yue Xuebin, Wang Zhida

机构信息

School of Electronic and Information, Zhongyuan University Of Technology, Zhengzhou, Henan, China.

Research Organization of Science and Technology, Ritsumeikan University, Kusatsu, Japan.

出版信息

PeerJ. 2024 Feb 29;12:e17005. doi: 10.7717/peerj.17005. eCollection 2024.

DOI:10.7717/peerj.17005
PMID:38435997
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10909362/
Abstract

Various segmentation networks based on Swin Transformer have shown promise in medical segmentation tasks. Nonetheless, challenges such as lower accuracy and slower training convergence have persisted. To tackle these issues, we introduce a novel approach that combines the Swin Transformer and Deformable Transformer to enhance overall model performance. We leverage the Swin Transformer's window attention mechanism to capture local feature information and employ the Deformable Transformer to adjust sampling positions dynamically, accelerating model convergence and aligning it more closely with object shapes and sizes. By amalgamating both Transformer modules and incorporating additional skip connections to minimize information loss, our proposed model excels at rapidly and accurately segmenting CT or X-ray lung images. Experimental results demonstrate the remarkable, showcasing the significant prowess of our model. It surpasses the performance of the standalone Swin Transformer's Swin Unet and converges more rapidly under identical conditions, yielding accuracy improvements of 0.7% (resulting in 88.18%) and 2.7% (resulting in 98.01%) on the COVID-19 CT scan lesion segmentation dataset and Chest X-ray Masks and Labels dataset, respectively. This advancement has the potential to aid medical practitioners in early diagnosis and treatment decision-making.

摘要

基于Swin Transformer的各种分割网络在医学分割任务中已展现出潜力。尽管如此,诸如准确率较低和训练收敛较慢等挑战依然存在。为解决这些问题,我们引入了一种新颖的方法,将Swin Transformer和可变形Transformer相结合,以提升整体模型性能。我们利用Swin Transformer的窗口注意力机制来捕捉局部特征信息,并采用可变形Transformer动态调整采样位置,加速模型收敛并使其更紧密地与物体形状和大小对齐。通过融合这两个Transformer模块并纳入额外的跳跃连接以最小化信息损失,我们提出的模型在快速准确地分割CT或X光肺部图像方面表现出色。实验结果令人瞩目,展示了我们模型的强大能力。它超越了独立的Swin Transformer的Swin Unet的性能,并且在相同条件下收敛更快,在COVID-19 CT扫描病变分割数据集和胸部X光掩码与标签数据集上分别实现了0.7%(达到88.18%)和2.7%(达到98.01%)的准确率提升。这一进展有可能帮助医学从业者进行早期诊断和治疗决策。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/137b/10909362/b55a6e56c776/peerj-12-17005-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/137b/10909362/c7a3213b431b/peerj-12-17005-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/137b/10909362/a92d5ded5d7c/peerj-12-17005-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/137b/10909362/12489e3b05a6/peerj-12-17005-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/137b/10909362/73517dbf8f0e/peerj-12-17005-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/137b/10909362/b55a6e56c776/peerj-12-17005-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/137b/10909362/c7a3213b431b/peerj-12-17005-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/137b/10909362/a92d5ded5d7c/peerj-12-17005-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/137b/10909362/12489e3b05a6/peerj-12-17005-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/137b/10909362/73517dbf8f0e/peerj-12-17005-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/137b/10909362/b55a6e56c776/peerj-12-17005-g005.jpg

相似文献

1
Enhancing medical image segmentation with a multi-transformer U-Net.使用多变压器U-Net增强医学图像分割
PeerJ. 2024 Feb 29;12:e17005. doi: 10.7717/peerj.17005. eCollection 2024.
2
SwinCross: Cross-modal Swin transformer for head-and-neck tumor segmentation in PET/CT images.SwinCross:用于 PET/CT 图像中头颈部肿瘤分割的跨模态 Swin 变换器。
Med Phys. 2024 Mar;51(3):2096-2107. doi: 10.1002/mp.16703. Epub 2023 Sep 30.
3
Improved UNet with Attention for Medical Image Segmentation.基于注意力机制的改进型 UNet 用于医学图像分割。
Sensors (Basel). 2023 Oct 20;23(20):8589. doi: 10.3390/s23208589.
4
Dual encoder network with transformer-CNN for multi-organ segmentation.基于 Transformer-CNN 的双编码器网络的多器官分割。
Med Biol Eng Comput. 2023 Mar;61(3):661-671. doi: 10.1007/s11517-022-02723-9. Epub 2022 Dec 29.
5
High-Resolution Swin Transformer for Automatic Medical Image Segmentation.基于高分辨率 Swin Transformer 的医学图像自动分割。
Sensors (Basel). 2023 Mar 24;23(7):3420. doi: 10.3390/s23073420.
6
CPFTransformer: transformer fusion context pyramid medical image segmentation network.CPFTransformer:变换器融合上下文金字塔医学图像分割网络。
Front Neurosci. 2023 Dec 7;17:1288366. doi: 10.3389/fnins.2023.1288366. eCollection 2023.
7
Swin-Net: A Swin-Transformer-Based Network Combing with Multi-Scale Features for Segmentation of Breast Tumor Ultrasound Images.Swin-Net:一种基于Swin-Transformer并结合多尺度特征的用于乳腺肿瘤超声图像分割的网络。
Diagnostics (Basel). 2024 Jan 26;14(3):269. doi: 10.3390/diagnostics14030269.
8
Object Detection Based on Swin Deformable Transformer-BiPAFPN-YOLOX.基于 Swin 变形 Transformer-BiPAFPN-YOLOX 的目标检测。
Comput Intell Neurosci. 2023 Mar 9;2023:4228610. doi: 10.1155/2023/4228610. eCollection 2023.
9
MESTrans: Multi-scale embedding spatial transformer for medical image segmentation.MESTrans:用于医学图像分割的多尺度嵌入空间变换器
Comput Methods Programs Biomed. 2023 May;233:107493. doi: 10.1016/j.cmpb.2023.107493. Epub 2023 Mar 17.
10
ConvWin-UNet: UNet-like hierarchical vision Transformer combined with convolution for medical image segmentation.卷积窗口U-Net:结合卷积的类U-Net层次视觉Transformer用于医学图像分割
Math Biosci Eng. 2023 Jan;20(1):128-144. doi: 10.3934/mbe.2023007. Epub 2022 Sep 29.

引用本文的文献

1
IDCC-SAM: A Zero-Shot Approach for Cell Counting in Immunocytochemistry Dataset Using the Segment Anything Model.IDCC-SAM:一种使用Segment Anything模型对免疫细胞化学数据集中的细胞进行计数的零样本方法。
Bioengineering (Basel). 2025 Feb 14;12(2):184. doi: 10.3390/bioengineering12020184.
2
Flood change detection model based on an improved U-net network and multi-head attention mechanism.基于改进的U-net网络和多头注意力机制的洪水变化检测模型
Sci Rep. 2025 Jan 26;15(1):3295. doi: 10.1038/s41598-025-87851-6.
3
Joint segmentation of sternocleidomastoid and skeletal muscles in computed tomography images using a multiclass learning approach.

本文引用的文献

1
Transforming medical imaging with Transformers? A comparative review of key properties, current progresses, and future perspectives.基于 Transformer 的医学影像变革?关键特性、当前进展和未来展望的对比综述。
Med Image Anal. 2023 Apr;85:102762. doi: 10.1016/j.media.2023.102762. Epub 2023 Jan 31.
2
Chest X-ray pneumothorax segmentation using U-Net with EfficientNet and ResNet architectures.使用具有EfficientNet和ResNet架构的U-Net进行胸部X光气胸分割。
PeerJ Comput Sci. 2021 Jun 29;7:e607. doi: 10.7717/peerj-cs.607. eCollection 2021.
3
Efficient Medical Image Segmentation Based on Knowledge Distillation.
使用多类学习方法对 CT 图像中的胸锁乳突肌和骨骼肌进行联合分割。
Radiol Phys Technol. 2024 Dec;17(4):854-861. doi: 10.1007/s12194-024-00839-1. Epub 2024 Sep 6.
基于知识蒸馏的高效医学图像分割。
IEEE Trans Med Imaging. 2021 Dec;40(12):3820-3831. doi: 10.1109/TMI.2021.3098703. Epub 2021 Nov 30.
4
A review and comparison of breast tumor cell nuclei segmentation performances using deep convolutional neural networks.基于深度卷积神经网络的乳腺肿瘤细胞细胞核分割性能的评估与比较。
Sci Rep. 2021 Apr 13;11(1):8025. doi: 10.1038/s41598-021-87496-1.
5
Dense U-net Based on Patch-Based Learning for Retinal Vessel Segmentation.基于基于块学习的密集U型网络用于视网膜血管分割
Entropy (Basel). 2019 Feb 12;21(2):168. doi: 10.3390/e21020168.
6
Unsupervised 3D End-to-End Medical Image Registration With Volume Tweening Network.无监督的 3D 端到端医学图像配准方法,采用体素插值网络。
IEEE J Biomed Health Inform. 2020 May;24(5):1394-1404. doi: 10.1109/JBHI.2019.2951024. Epub 2019 Nov 1.
7
A review on brain tumor segmentation of MRI images.磁共振成像脑肿瘤分割的研究综述。
Magn Reson Imaging. 2019 Sep;61:247-259. doi: 10.1016/j.mri.2019.05.043. Epub 2019 Jun 11.
8
CE-Net: Context Encoder Network for 2D Medical Image Segmentation.CE-Net:用于二维医学图像分割的上下文编码器网络。
IEEE Trans Med Imaging. 2019 Oct;38(10):2281-2292. doi: 10.1109/TMI.2019.2903562. Epub 2019 Mar 7.
9
Deep Semantic Segmentation of Kidney and Space-Occupying Lesion Area Based on SCNN and ResNet Models Combined with SIFT-Flow Algorithm.基于 SCNN 和 ResNet 模型结合 SIFT-Flow 算法的肾脏及占位性病变区域的深度语义分割。
J Med Syst. 2018 Nov 19;43(1):2. doi: 10.1007/s10916-018-1116-1.
10
Multiple Resolution Residually Connected Feature Streams for Automatic Lung Tumor Segmentation From CT Images.多分辨率残差连接特征流用于从 CT 图像中自动分割肺肿瘤。
IEEE Trans Med Imaging. 2019 Jan;38(1):134-144. doi: 10.1109/TMI.2018.2857800. Epub 2018 Jul 23.