基于 Transformer-CNN 的双编码器网络的多器官分割。

Dual encoder network with transformer-CNN for multi-organ segmentation.

机构信息

Computer School, University of South China, Hengyang, 421001, China.

College of Mechanical and Vehicle Engineering, Hunan University, Hengyang, 410082, China.

出版信息

Med Biol Eng Comput. 2023 Mar;61(3):661-671. doi: 10.1007/s11517-022-02723-9. Epub 2022 Dec 29.

DOI:10.1007/s11517-022-02723-9

PMID:36580181

Abstract

Medical image segmentation is a critical step in many imaging applications. Automatic segmentation has gained extensive concern using a convolutional neural network (CNN). However, the traditional CNN-based methods fail to extract global and long-range contextual information due to local convolution operation. Transformer overcomes the limitation of CNN-based models. Inspired by the success of transformers in computer vision (CV), many researchers focus on designing the transformer-based U-shaped method in medical image segmentation. The transformer-based approach cannot effectively capture the fine-grained details. This paper proposes a dual encoder network with transformer-CNN for multi-organ segmentation. The new segmentation framework takes full advantage of CNN and transformer to enhance the segmentation accuracy. The Swin-transformer encoder extracts global information, and the CNN encoder captures local information. We introduce fusion modules to fuse convolutional features and the sequence of features from the transformer. Feature fusion is concatenated through the skip connection to smooth the decision boundary effectively. We extensively evaluate our method on the synapse multi-organ CT dataset and the automated cardiac diagnosis challenge (ACDC) dataset. The results demonstrate that the proposed method achieves Dice similarity coefficient (DSC) metrics of 80.68% and 91.12% on the synapse multi-organ CT and ACDC datasets, respectively. We perform the ablation studies on the ACDC dataset, demonstrating the effectiveness of critical components of our method. Our results match the ground-truth boundary more consistently than the existing models. Our approach gains more accurate results on challenging 2D images for multi-organ segmentation. Compared with the state-of-the-art methods, our proposed method achieves superior performance in multi-organ segmentation tasks. Graphical Abstract The key process in medical image segmentation.

摘要

医学图像分割是许多成像应用中的关键步骤。使用卷积神经网络（CNN），自动分割得到了广泛关注。然而，由于局部卷积运算，传统的基于 CNN 的方法无法提取全局和长程上下文信息。Transformer 克服了基于 CNN 的模型的局限性。受 Transformer 在计算机视觉（CV）中成功的启发，许多研究人员专注于在医学图像分割中设计基于 Transformer 的 U 型方法。基于 Transformer 的方法无法有效地捕捉细粒度细节。本文提出了一种具有 Transformer-CNN 的双编码器网络，用于多器官分割。新的分割框架充分利用 CNN 和 Transformer 来提高分割精度。Swin-Transformer 编码器提取全局信息，CNN 编码器捕获局部信息。我们引入融合模块来融合卷积特征和来自 Transformer 的特征序列。特征融合通过跳过连接进行拼接，以有效地平滑决策边界。我们在 synapse 多器官 CT 数据集和自动化心脏诊断挑战（ACDC）数据集上广泛评估了我们的方法。结果表明，我们的方法在 synapse 多器官 CT 和 ACDC 数据集上分别达到了 80.68%和 91.12%的 Dice 相似系数（DSC）度量。我们在 ACDC 数据集上进行了消融研究，证明了我们方法的关键组件的有效性。我们的结果比现有模型更一致地匹配真实边界。我们的方法在多器官分割的挑战性 2D 图像上获得了更准确的结果。与最先进的方法相比，我们提出的方法在多器官分割任务中表现出了优越的性能。

相似文献

Dual encoder network with transformer-CNN for multi-organ segmentation.

Med Biol Eng Comput. 2023 Mar;61(3):661-671. doi: 10.1007/s11517-022-02723-9. Epub 2022 Dec 29.

CPFTransformer: transformer fusion context pyramid medical image segmentation network.

Front Neurosci. 2023 Dec 7;17:1288366. doi: 10.3389/fnins.2023.1288366. eCollection 2023.

A new architecture combining convolutional and transformer-based networks for automatic 3D multi-organ segmentation on CT images.

Med Phys. 2023 Nov;50(11):6990-7002. doi: 10.1002/mp.16750. Epub 2023 Sep 22.

MSCT-UNET: multi-scale contrastive transformer within U-shaped network for medical image segmentation.

Phys Med Biol. 2023 Dec 28;69(1). doi: 10.1088/1361-6560/ad135d.

DECTNet: Dual Encoder Network combined convolution and Transformer architecture for medical image segmentation.

PLoS One. 2024 Apr 4;19(4):e0301019. doi: 10.1371/journal.pone.0301019. eCollection 2024.

Male pelvic multi-organ segmentation using token-based transformer Vnet.

Phys Med Biol. 2022 Oct 14;67(20). doi: 10.1088/1361-6560/ac95f7.

MESTrans: Multi-scale embedding spatial transformer for medical image segmentation.

Comput Methods Programs Biomed. 2023 May;233:107493. doi: 10.1016/j.cmpb.2023.107493. Epub 2023 Mar 17.

TPFR-Net: U-shaped model for lung nodule segmentation based on transformer pooling and dual-attention feature reorganization.

Med Biol Eng Comput. 2023 Aug;61(8):1929-1946. doi: 10.1007/s11517-023-02852-9. Epub 2023 May 27.

MMViT-Seg: A lightweight transformer and CNN fusion network for COVID-19 segmentation.

Comput Methods Programs Biomed. 2023 Mar;230:107348. doi: 10.1016/j.cmpb.2023.107348. Epub 2023 Jan 12.

A 3D hierarchical cross-modality interaction network using transformers and convolutions for brain glioma segmentation in MR images.

Med Phys. 2024 Nov;51(11):8371-8389. doi: 10.1002/mp.17354. Epub 2024 Aug 13.

引用本文的文献

ICT-Net: An Integrated Convolution and Transformer-Based Network for Complex Liver and Liver Tumor Region Segmentation.

IEEE J Transl Eng Health Med. 2025 Jul 7;13:310-322. doi: 10.1109/JTEHM.2025.3586470. eCollection 2025.

Structure preservation constraints for unsupervised domain adaptation intracranial vessel segmentation.

Med Biol Eng Comput. 2025 Mar;63(3):609-627. doi: 10.1007/s11517-024-03195-9. Epub 2024 Oct 21.

HDB-Net: hierarchical dual-branch network for retinal layer segmentation in diseased OCT images.

Biomed Opt Express. 2024 Aug 19;15(9):5359-5383. doi: 10.1364/BOE.530469. eCollection 2024 Sep 1.

Retina Blood Vessels Segmentation and Classification with the Multi-featured Approach.

J Imaging Inform Med. 2025 Feb;38(1):520-533. doi: 10.1007/s10278-024-01219-2. Epub 2024 Aug 8.

MFEM-CIN: A Lightweight Architecture Combining CNN and Transformer for the Classification of Pre-Cancerous Lesions of the Cervix.

IEEE Open J Eng Med Biol. 2024 Feb 20;5:216-225. doi: 10.1109/OJEMB.2024.3367243. eCollection 2024.

Improved UNet with Attention for Medical Image Segmentation.

Sensors (Basel). 2023 Oct 20;23(20):8589. doi: 10.3390/s23208589.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于 Transformer-CNN 的双编码器网络的多器官分割。

Dual encoder network with transformer-CNN for multi-organ segmentation.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献