DECTNet：用于医学图像分割的双编码器网络结合卷积和 Transformer 架构。

DECTNet: Dual Encoder Network combined convolution and Transformer architecture for medical image segmentation.

机构信息

Department of Control Science and Engineering, Harbin Institute of Technology, Harbin, Heilongjiang, China.

Sergeant Schools of Army Academy of Armored Forces, Changchun, Jilin, China.

出版信息

PLoS One. 2024 Apr 4;19(4):e0301019. doi: 10.1371/journal.pone.0301019. eCollection 2024.

DOI:10.1371/journal.pone.0301019

PMID:38573957

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10994332/

Abstract

Automatic and accurate segmentation of medical images plays an essential role in disease diagnosis and treatment planning. Convolution neural networks have achieved remarkable results in medical image segmentation in the past decade. Meanwhile, deep learning models based on Transformer architecture also succeeded tremendously in this domain. However, due to the ambiguity of the medical image boundary and the high complexity of physical organization structures, implementing effective structure extraction and accurate segmentation remains a problem requiring a solution. In this paper, we propose a novel Dual Encoder Network named DECTNet to alleviate this problem. Specifically, the DECTNet embraces four components, which are a convolution-based encoder, a Transformer-based encoder, a feature fusion decoder, and a deep supervision module. The convolutional structure encoder can extract fine spatial contextual details in images. Meanwhile, the Transformer structure encoder is designed using a hierarchical Swin Transformer architecture to model global contextual information. The novel feature fusion decoder integrates the multi-scale representation from two encoders and selects features that focus on segmentation tasks by channel attention mechanism. Further, a deep supervision module is used to accelerate the convergence of the proposed method. Extensive experiments demonstrate that, compared to the other seven models, the proposed method achieves state-of-the-art results on four segmentation tasks: skin lesion segmentation, polyp segmentation, Covid-19 lesion segmentation, and MRI cardiac segmentation.

摘要

自动且准确的医学图像分割在疾病诊断和治疗规划中起着至关重要的作用。在过去十年中，卷积神经网络在医学图像分割方面取得了显著的成果。与此同时，基于 Transformer 架构的深度学习模型在该领域也取得了巨大的成功。然而，由于医学图像边界的模糊性和组织结构的高度复杂性，实现有效的结构提取和精确的分割仍然是一个需要解决的问题。在本文中，我们提出了一种名为 DECTNet 的新型双编码器网络来缓解这个问题。具体来说，DECTNet 包含四个组件，分别是基于卷积的编码器、基于 Transformer 的编码器、特征融合解码器和深度监督模块。卷积结构编码器可以提取图像中的精细空间上下文细节。同时，基于分层 Swin Transformer 架构设计的 Transformer 结构编码器用于对全局上下文信息进行建模。新颖的特征融合解码器集成了来自两个编码器的多尺度表示，并通过通道注意力机制选择专注于分割任务的特征。此外，深度监督模块用于加速所提出方法的收敛。大量实验表明，与其他七个模型相比，所提出的方法在四个分割任务上取得了最先进的结果：皮肤病变分割、息肉分割、Covid-19 病变分割和 MRI 心脏分割。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8739/10994332/c61555e9a4c1/pone.0301019.g001.jpg

相似文献

DECTNet: Dual Encoder Network combined convolution and Transformer architecture for medical image segmentation.

PLoS One. 2024 Apr 4;19(4):e0301019. doi: 10.1371/journal.pone.0301019. eCollection 2024.

Dual encoder network with transformer-CNN for multi-organ segmentation.

Med Biol Eng Comput. 2023 Mar;61(3):661-671. doi: 10.1007/s11517-022-02723-9. Epub 2022 Dec 29.

TransConver: transformer and convolution parallel network for developing automatic brain tumor segmentation in MRI images.

Quant Imaging Med Surg. 2022 Apr;12(4):2397-2415. doi: 10.21037/qims-21-919.

TLTNet: A novel transscale cascade layered transformer network for enhanced retinal blood vessel segmentation.

Comput Biol Med. 2024 Aug;178:108773. doi: 10.1016/j.compbiomed.2024.108773. Epub 2024 Jun 25.

TPFR-Net: U-shaped model for lung nodule segmentation based on transformer pooling and dual-attention feature reorganization.

Med Biol Eng Comput. 2023 Aug;61(8):1929-1946. doi: 10.1007/s11517-023-02852-9. Epub 2023 May 27.

MC-DC: An MLP-CNN Based Dual-path Complementary Network for Medical Image Segmentation.

Comput Methods Programs Biomed. 2023 Dec;242:107846. doi: 10.1016/j.cmpb.2023.107846. Epub 2023 Oct 5.

MESTrans: Multi-scale embedding spatial transformer for medical image segmentation.

Comput Methods Programs Biomed. 2023 May;233:107493. doi: 10.1016/j.cmpb.2023.107493. Epub 2023 Mar 17.

MS-TCNet: An effective Transformer-CNN combined network using multi-scale feature learning for 3D medical image segmentation.

Comput Biol Med. 2024 Mar;170:108057. doi: 10.1016/j.compbiomed.2024.108057. Epub 2024 Jan 28.

D-SAT: dual semantic aggregation transformer with dual attention for medical image segmentation.

Phys Med Biol. 2023 Dec 22;69(1). doi: 10.1088/1361-6560/acf2e5.

FDR-TransUNet: A novel encoder-decoder architecture with vision transformer for improved medical image segmentation.

Comput Biol Med. 2024 Feb;169:107858. doi: 10.1016/j.compbiomed.2023.107858. Epub 2023 Dec 14.

本文引用的文献

A modality-collaborative convolution and transformer hybrid network for unpaired multi-modal medical image segmentation with limited annotations.

Med Phys. 2023 Sep;50(9):5460-5478. doi: 10.1002/mp.16338. Epub 2023 Mar 15.

CAT-Net: A Cross-Slice Attention Transformer Model for Prostate Zonal Segmentation in MRI.

IEEE Trans Med Imaging. 2023 Jan;42(1):291-303. doi: 10.1109/TMI.2022.3211764. Epub 2022 Dec 29.

Learning COVID-19 Pneumonia Lesion Segmentation From Imperfect Annotations via Divergence-Aware Selective Training.

IEEE J Biomed Health Inform. 2022 Aug;26(8):3673-3684. doi: 10.1109/JBHI.2022.3172978. Epub 2022 Aug 11.

Global and Local Feature Reconstruction for Medical Image Segmentation.

IEEE Trans Med Imaging. 2022 Sep;41(9):2273-2284. doi: 10.1109/TMI.2022.3162111. Epub 2022 Aug 31.

FAT-Net: Feature adaptive transformers for automated skin lesion segmentation.

Med Image Anal. 2022 Feb;76:102327. doi: 10.1016/j.media.2021.102327. Epub 2021 Dec 4.

Exploiting Shared Knowledge From Non-COVID Lesions for Annotation-Efficient COVID-19 CT Lung Infection Segmentation.

IEEE J Biomed Health Inform. 2021 Nov;25(11):4152-4162. doi: 10.1109/JBHI.2021.3106341. Epub 2021 Nov 5.

Multi-Centre, Multi-Vendor and Multi-Disease Cardiac Segmentation: The M&Ms Challenge.

IEEE Trans Med Imaging. 2021 Dec;40(12):3543-3554. doi: 10.1109/TMI.2021.3090082. Epub 2021 Nov 30.

Attention-RefNet: Interactive Attention Refinement Network for Infected Area Segmentation of COVID-19.

IEEE J Biomed Health Inform. 2021 Jul;25(7):2363-2373. doi: 10.1109/JBHI.2021.3082527. Epub 2021 Jul 27.

Learn to Threshold: ThresholdNet With Confidence-Guided Manifold Mixup for Polyp Segmentation.

IEEE Trans Med Imaging. 2021 Apr;40(4):1134-1146. doi: 10.1109/TMI.2020.3046843. Epub 2021 Apr 1.

UNet++: A Nested U-Net Architecture for Medical Image Segmentation.

Deep Learn Med Image Anal Multimodal Learn Clin Decis Support (2018). 2018 Sep;11045:3-11. doi: 10.1007/978-3-030-00889-5_1. Epub 2018 Sep 20.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

DECTNet：用于医学图像分割的双编码器网络结合卷积和 Transformer 架构。

DECTNet: Dual Encoder Network combined convolution and Transformer architecture for medical image segmentation.

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献