CoTrFuse：一种融合 CNN 和 Transformer 的用于医学图像分割的新框架。

CoTrFuse: a novel framework by fusing CNN and transformer for medical image segmentation.

机构信息

College of Physics and Information Engineering, Fuzhou University, Fuzhou 350116, People's Republic of China.

Fujian Key Lab of Medical Instrumentation & Pharmaceutical Technology, Fuzhou University, Fuzhou 350116, People's Republic of China.

出版信息

Phys Med Biol. 2023 Aug 22;68(17). doi: 10.1088/1361-6560/acede8.

DOI:10.1088/1361-6560/acede8

PMID:37605997

Abstract

Medical image segmentation is a crucial and intricate process in medical image processing and analysis. With the advancements in artificial intelligence, deep learning techniques have been widely used in recent years for medical image segmentation. One such technique is the U-Net framework based on the U-shaped convolutional neural networks (CNN) and its variants. However, these methods have limitations in simultaneously capturing both the global and the remote semantic information due to the restricted receptive domain caused by the convolution operation's intrinsic features. Transformers are attention-based models with excellent global modeling capabilities, but their ability to acquire local information is limited. To address this, we propose a network that combines the strengths of both CNN and Transformer, called CoTrFuse. The proposed CoTrFuse network uses EfficientNet and Swin Transformer as dual encoders. The Swin Transformer and CNN Fusion module are combined to fuse the features of both branches before the skip connection structure. We evaluated the proposed network on two datasets: the ISIC-2017 challenge dataset and the COVID-QU-Ex dataset. Our experimental results demonstrate that the proposed CoTrFuse outperforms several state-of-the-art segmentation methods, indicating its superiority in medical image segmentation. The codes are available athttps://github.com/BinYCn/CoTrFuse.

摘要

医学图像分割是医学图像处理和分析中的一个关键且复杂的过程。近年来，随着人工智能的发展，深度学习技术已广泛应用于医学图像分割。基于 U 形卷积神经网络（CNN）及其变体的 U-Net 框架就是一种这样的技术。然而，由于卷积操作的固有特征导致的受限感受野，这些方法在同时捕捉全局和远程语义信息方面存在局限性。Transformer 是一种基于注意力的模型，具有出色的全局建模能力，但获取局部信息的能力有限。为了解决这个问题，我们提出了一种结合 CNN 和 Transformer 优势的网络，称为 CoTrFuse。所提出的 CoTrFuse 网络使用 EfficientNet 和 Swin Transformer 作为双编码器。在跳过连接结构之前，将 Swin Transformer 和 CNN 融合模块结合起来融合两个分支的特征。我们在两个数据集上评估了所提出的网络：ISIC-2017 挑战赛数据集和 COVID-QU-Ex 数据集。我们的实验结果表明，所提出的 CoTrFuse 优于几种最先进的分割方法，表明其在医学图像分割中的优越性。代码可在 https://github.com/BinYCn/CoTrFuse 上获得。

相似文献

CoTrFuse: a novel framework by fusing CNN and transformer for medical image segmentation.

Phys Med Biol. 2023 Aug 22;68(17). doi: 10.1088/1361-6560/acede8.

O-Net: A Novel Framework With Deep Fusion of CNN and Transformer for Simultaneous Segmentation and Classification.

Front Neurosci. 2022 Jun 2;16:876065. doi: 10.3389/fnins.2022.876065. eCollection 2022.

Dual encoder network with transformer-CNN for multi-organ segmentation.

Med Biol Eng Comput. 2023 Mar;61(3):661-671. doi: 10.1007/s11517-022-02723-9. Epub 2022 Dec 29.

ETU-Net: edge enhancement-guided U-Net with transformer for skin lesion segmentation.

Phys Med Biol. 2023 Dec 22;69(1). doi: 10.1088/1361-6560/ad13d2.

LumVertCancNet: A novel 3D lumbar vertebral body cancellous bone location and segmentation method based on hybrid Swin-transformer.

Comput Biol Med. 2024 Mar;171:108237. doi: 10.1016/j.compbiomed.2024.108237. Epub 2024 Feb 28.

D-SAT: dual semantic aggregation transformer with dual attention for medical image segmentation.

Phys Med Biol. 2023 Dec 22;69(1). doi: 10.1088/1361-6560/acf2e5.

FAFuse: A Four-Axis Fusion framework of CNN and Transformer for medical image segmentation.

Comput Biol Med. 2023 Nov;166:107567. doi: 10.1016/j.compbiomed.2023.107567. Epub 2023 Oct 13.

SwinCross: Cross-modal Swin transformer for head-and-neck tumor segmentation in PET/CT images.

Med Phys. 2024 Mar;51(3):2096-2107. doi: 10.1002/mp.16703. Epub 2023 Sep 30.

TGDAUNet: Transformer and GCNN based dual-branch attention UNet for medical image segmentation.

Comput Biol Med. 2023 Dec;167:107583. doi: 10.1016/j.compbiomed.2023.107583. Epub 2023 Oct 21.

FDB-Net: Fusion double branch network combining CNN and transformer for medical image segmentation.

J Xray Sci Technol. 2024;32(4):931-951. doi: 10.3233/XST-230413.

引用本文的文献

Efficient 3D Biomedical Image Segmentation by Parallelly Multiscale Transformer-CNN Aggregation Network.

Chem Biomed Imaging. 2025 Apr 8;3(8):522-533. doi: 10.1021/cbmi.4c00102. eCollection 2025 Aug 25.

A novel framework for segmentation of small targets in medical images.

Sci Rep. 2025 Mar 22;15(1):9924. doi: 10.1038/s41598-025-94437-9.

A semi-supervised domain adaptation method with scale-aware and global-local fusion for abdominal multi-organ segmentation.

J Appl Clin Med Phys. 2025 Mar;26(3):e70008. doi: 10.1002/acm2.70008. Epub 2025 Feb 9.

Hybrid transformer-CNN and LSTM model for lung disease segmentation and classification.

PeerJ Comput Sci. 2024 Dec 13;10:e2444. doi: 10.7717/peerj-cs.2444. eCollection 2024.

HDS-Net: Achieving fine-grained skin lesion segmentation using hybrid encoding and dynamic sparse attention.

PLoS One. 2024 Mar 21;19(3):e0299392. doi: 10.1371/journal.pone.0299392. eCollection 2024.

A deep learning-based framework (Co-ReTr) for auto-segmentation of non-small cell-lung cancer in computed tomography images.

J Appl Clin Med Phys. 2024 Mar;25(3):e14297. doi: 10.1002/acm2.14297. Epub 2024 Feb 19.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

CoTrFuse：一种融合 CNN 和 Transformer 的用于医学图像分割的新框架。

CoTrFuse: a novel framework by fusing CNN and transformer for medical image segmentation.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献