O-Net：一种将卷积神经网络（CNN）与Transformer深度融合以实现同步分割和分类的新型框架。

O-Net: A Novel Framework With Deep Fusion of CNN and Transformer for Simultaneous Segmentation and Classification.

作者信息

Wang Tao, Lan Junlin, Han Zixin, Hu Ziwei, Huang Yuxiu, Deng Yanglin, Zhang Hejun, Wang Jianchao, Chen Musheng, Jiang Haiyan, Lee Ren-Guey, Gao Qinquan, Du Ming, Tong Tong, Chen Gang

机构信息

College of Physics and Information Engineering, Fuzhou University, Fuzhou, China.

Fujian Key Lab of Medical Instrumentation and Pharmaceutical Technology, Fuzhou University, Fuzhou, China.

出版信息

Front Neurosci. 2022 Jun 2;16:876065. doi: 10.3389/fnins.2022.876065. eCollection 2022.

DOI:10.3389/fnins.2022.876065

PMID:35720715

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9201625/

Abstract

The application of deep learning in the medical field has continuously made huge breakthroughs in recent years. Based on convolutional neural network (CNN), the U-Net framework has become the benchmark of the medical image segmentation task. However, this framework cannot fully learn global information and remote semantic information. The transformer structure has been demonstrated to capture global information relatively better than the U-Net, but the ability to learn local information is not as good as CNN. Therefore, we propose a novel network referred to as the O-Net, which combines the advantages of CNN and transformer to fully use both the global and the local information for improving medical image segmentation and classification. In the encoder part of our proposed O-Net framework, we combine the CNN and the Swin Transformer to acquire both global and local contextual features. In the decoder part, the results of the Swin Transformer and the CNN blocks are fused to get the final results. We have evaluated the proposed network on the synapse multi-organ CT dataset and the ISIC 2017 challenge dataset for the segmentation task. The classification network is simultaneously trained by using the encoder weights of the segmentation network. The experimental results show that our proposed O-Net achieves superior segmentation performance than state-of-the-art approaches, and the segmentation results are beneficial for improving the accuracy of the classification task. The codes and models of this study are available at https://github.com/ortonwang/O-Net.

摘要

近年来，深度学习在医学领域的应用不断取得巨大突破。基于卷积神经网络（CNN）的U-Net框架已成为医学图像分割任务的基准。然而，该框架无法充分学习全局信息和远程语义信息。已证明变压器结构在捕获全局信息方面比U-Net相对更好，但学习局部信息的能力不如CNN。因此，我们提出了一种新颖的网络，称为O-Net，它结合了CNN和变压器的优点，以充分利用全局和局部信息来改进医学图像分割和分类。在我们提出的O-Net框架的编码器部分，我们将CNN和Swin Transformer结合起来，以获取全局和局部上下文特征。在解码器部分，将Swin Transformer和CNN块的结果融合以获得最终结果。我们在突触多器官CT数据集和ISIC 2017挑战数据集上对提出的网络进行了分割任务评估。分类网络通过使用分割网络的编码器权重同时进行训练。实验结果表明，我们提出的O-Net比现有方法具有更好的分割性能，并且分割结果有利于提高分类任务的准确性。本研究的代码和模型可在https://github.com/ortonwang/O-Net上获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/868c/9201625/b8b58cefbf9f/fnins-16-876065-g0001.jpg

相似文献

O-Net: A Novel Framework With Deep Fusion of CNN and Transformer for Simultaneous Segmentation and Classification.

Front Neurosci. 2022 Jun 2;16:876065. doi: 10.3389/fnins.2022.876065. eCollection 2022.

CoTrFuse: a novel framework by fusing CNN and transformer for medical image segmentation.

Phys Med Biol. 2023 Aug 22;68(17). doi: 10.1088/1361-6560/acede8.

Dual encoder network with transformer-CNN for multi-organ segmentation.

Med Biol Eng Comput. 2023 Mar;61(3):661-671. doi: 10.1007/s11517-022-02723-9. Epub 2022 Dec 29.

CPFTransformer: transformer fusion context pyramid medical image segmentation network.

Front Neurosci. 2023 Dec 7;17:1288366. doi: 10.3389/fnins.2023.1288366. eCollection 2023.

BiFTransNet: A unified and simultaneous segmentation network for gastrointestinal images of CT & MRI.

Comput Biol Med. 2023 Oct;165:107326. doi: 10.1016/j.compbiomed.2023.107326. Epub 2023 Aug 8.

MS-TCNet: An effective Transformer-CNN combined network using multi-scale feature learning for 3D medical image segmentation.

Comput Biol Med. 2024 Mar;170:108057. doi: 10.1016/j.compbiomed.2024.108057. Epub 2024 Jan 28.

A new architecture combining convolutional and transformer-based networks for automatic 3D multi-organ segmentation on CT images.

Med Phys. 2023 Nov;50(11):6990-7002. doi: 10.1002/mp.16750. Epub 2023 Sep 22.

ETU-Net: edge enhancement-guided U-Net with transformer for skin lesion segmentation.

Phys Med Biol. 2023 Dec 22;69(1). doi: 10.1088/1361-6560/ad13d2.

MSCT-UNET: multi-scale contrastive transformer within U-shaped network for medical image segmentation.

Phys Med Biol. 2023 Dec 28;69(1). doi: 10.1088/1361-6560/ad135d.

UMRFormer-net: a three-dimensional U-shaped pancreas segmentation method based on a double-layer bridged transformer network.

Quant Imaging Med Surg. 2023 Mar 1;13(3):1619-1630. doi: 10.21037/qims-22-544. Epub 2023 Feb 10.

引用本文的文献

A novel framework for segmentation of small targets in medical images.

Sci Rep. 2025 Mar 22;15(1):9924. doi: 10.1038/s41598-025-94437-9.

BiFPN-enhanced SwinDAT-based cherry variety classification with YOLOv8.

Sci Rep. 2025 Feb 13;15(1):5427. doi: 10.1038/s41598-025-89624-7.

A Multichannel CT and Radiomics-Guided CNN-ViT (RadCT-CNNViT) Ensemble Network for Diagnosis of Pulmonary Sarcoidosis.

Diagnostics (Basel). 2024 May 18;14(10):1049. doi: 10.3390/diagnostics14101049.

HI-MViT: A lightweight model for explainable skin disease classification based on modified MobileViT.

Digit Health. 2023 Oct 12;9:20552076231207197. doi: 10.1177/20552076231207197. eCollection 2023 Jan-Dec.

Conv-ViT: A Convolution and Vision Transformer-Based Hybrid Feature Extraction Method for Retinal Disease Detection.

J Imaging. 2023 Jul 10;9(7):140. doi: 10.3390/jimaging9070140.

Deep Learning in Ischemic Stroke Imaging Analysis: A Comprehensive Review.

Biomed Res Int. 2022 Nov 14;2022:2456550. doi: 10.1155/2022/2456550. eCollection 2022.

A Review on Data Fusion of Multidimensional Medical and Biomedical Data.

Molecules. 2022 Nov 2;27(21):7448. doi: 10.3390/molecules27217448.

本文引用的文献

TransConver: transformer and convolution parallel network for developing automatic brain tumor segmentation in MRI images.

Quant Imaging Med Surg. 2022 Apr;12(4):2397-2415. doi: 10.21037/qims-21-919.

TransMed: Transformers Advance Multi-Modal Medical Image Classification.

Diagnostics (Basel). 2021 Jul 31;11(8):1384. doi: 10.3390/diagnostics11081384.

Pairwise learning for medical image segmentation.

Med Image Anal. 2021 Jan;67:101876. doi: 10.1016/j.media.2020.101876. Epub 2020 Oct 17.

UNet++: A Nested U-Net Architecture for Medical Image Segmentation.

Deep Learn Med Image Anal Multimodal Learn Clin Decis Support (2018). 2018 Sep;11045:3-11. doi: 10.1007/978-3-030-00889-5_1. Epub 2018 Sep 20.

Coronary angiography video segmentation method for assisting cardiovascular disease interventional treatment.

BMC Med Imaging. 2020 Jun 16;20(1):65. doi: 10.1186/s12880-020-00460-9.

CE-Net: Context Encoder Network for 2D Medical Image Segmentation.

IEEE Trans Med Imaging. 2019 Oct;38(10):2281-2292. doi: 10.1109/TMI.2019.2903562. Epub 2019 Mar 7.

Attention gated networks: Learning to leverage salient regions in medical images.

Med Image Anal. 2019 Apr;53:197-207. doi: 10.1016/j.media.2019.01.012. Epub 2019 Feb 5.

H-DenseUNet: Hybrid Densely Connected UNet for Liver and Tumor Segmentation From CT Volumes.

IEEE Trans Med Imaging. 2018 Dec;37(12):2663-2674. doi: 10.1109/TMI.2018.2845918. Epub 2018 Jun 11.

Joint Optic Disc and Cup Segmentation Based on Multi-Label Deep Network and Polar Transformation.

IEEE Trans Med Imaging. 2018 Jul;37(7):1597-1605. doi: 10.1109/TMI.2018.2791488.

DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.

IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):834-848. doi: 10.1109/TPAMI.2017.2699184. Epub 2017 Apr 27.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

O-Net：一种将卷积神经网络（CNN）与Transformer深度融合以实现同步分割和分类的新型框架。

O-Net: A Novel Framework With Deep Fusion of CNN and Transformer for Simultaneous Segmentation and Classification.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献