HyFormer：一种用于视网膜光学相干断层扫描（OCT）图像分割的混合变压器-卷积神经网络（CNN）架构

HyFormer: a hybrid transformer-CNN architecture for retinal OCT image segmentation.

作者信息

Jiang Qingxin, Fan Ying, Li Menghan, Fang Sheng, Zhu Weifang, Xiang Dehui, Peng Tao, Chen Xinjian, Xu Xun, Shi Fei

机构信息

MIPAV Lab, School of Electronic and Information Engineering, Soochow University, Suzhou 215006, China.

Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200080, China.

出版信息

Biomed Opt Express. 2024 Oct 2;15(11):6156-6170. doi: 10.1364/BOE.538959. eCollection 2024 Nov 1.

DOI:10.1364/BOE.538959

PMID:39553862

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11563338/

Abstract

Optical coherence tomography (OCT) has become the leading imaging technique in diagnosing and treatment planning for retinal diseases. Retinal OCT image segmentation involves extracting lesions and/or tissue structures to aid in the decisions of ophthalmologists, and multi-class segmentation is commonly needed. As the target regions often spread widely inside the retina, and the intensities and locations of different categories can be close, good segmentation networks must possess both global modeling capabilities and the ability to capture fine details. To address the challenge in capturing both global and local features simultaneously, we propose HyFormer, an efficient, lightweight, and robust hybrid network architecture. The proposed architecture features parallel Transformer and convolutional encoders for independent feature capture. A multi-scale gated attention block and a group positional embedding block are introduced within the Transformer encoder to enhance feature extraction. Feature integration is achieved in the decoder composed of the proposed three-path fusion modules. A class activation map-based cross-entropy loss function is also proposed to improve segmentation results. Evaluations are performed on a private dataset with myopic traction maculopathy lesions and the public AROI dataset for retinal layer and lesion segmentation with age-related degeneration. The results demonstrate HyFormer's superior segmentation performance and robustness compared to existing methods, showing promise for accurate and efficient OCT image segmentation. .

摘要

光学相干断层扫描（OCT）已成为视网膜疾病诊断和治疗规划中的领先成像技术。视网膜OCT图像分割涉及提取病变和/或组织结构，以协助眼科医生做出决策，通常需要进行多类别分割。由于目标区域在视网膜内往往广泛分布，且不同类别的强度和位置可能相近，因此良好的分割网络必须具备全局建模能力和捕捉精细细节的能力。为了应对同时捕捉全局和局部特征的挑战，我们提出了HyFormer，一种高效、轻量级且强大的混合网络架构。所提出的架构具有并行的Transformer和卷积编码器，用于独立特征捕捉。在Transformer编码器中引入了多尺度门控注意力块和组位置嵌入块，以增强特征提取。特征集成在由所提出的三路径融合模块组成的解码器中实现。还提出了一种基于类激活映射的交叉熵损失函数来改善分割结果。在一个包含近视牵引性黄斑病变的私有数据集以及用于年龄相关性黄斑变性的视网膜层和病变分割的公共AROI数据集上进行了评估。结果表明，与现有方法相比，HyFormer具有卓越的分割性能和鲁棒性，为准确高效的OCT图像分割展现出了前景。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/874a/11563338/f5bad95e1c70/boe-15-11-6156-g001.jpg

相似文献

HyFormer: a hybrid transformer-CNN architecture for retinal OCT image segmentation.HyFormer：一种用于视网膜光学相干断层扫描（OCT）图像分割的混合变压器-卷积神经网络（CNN）架构

Biomed Opt Express. 2024 Oct 2;15(11):6156-6170. doi: 10.1364/BOE.538959. eCollection 2024 Nov 1.

Transformer guided self-adaptive network for multi-scale skin lesion image segmentation.Transformer 引导的自适网络用于多尺度皮肤病变图像分割。

Comput Biol Med. 2024 Feb;169:107846. doi: 10.1016/j.compbiomed.2023.107846. Epub 2023 Dec 23.

TranSegNet: Hybrid CNN-Vision Transformers Encoder for Retina Segmentation of Optical Coherence Tomography.TranSegNet：用于光学相干断层扫描视网膜分割的混合卷积神经网络-视觉Transformer编码器

Life (Basel). 2023 Apr 10;13(4):976. doi: 10.3390/life13040976.

TLTNet: A novel transscale cascade layered transformer network for enhanced retinal blood vessel segmentation.TLTNet：一种新颖的跨尺度级联分层Transformer 网络，用于增强视网膜血管分割。

Comput Biol Med. 2024 Aug;178:108773. doi: 10.1016/j.compbiomed.2024.108773. Epub 2024 Jun 25.

FNeXter: A Multi-Scale Feature Fusion Network Based on ConvNeXt and Transformer for Retinal OCT Fluid Segmentation.FNeXter：一种基于ConvNeXt和Transformer的多尺度特征融合网络用于视网膜光学相干断层扫描液体分割

Sensors (Basel). 2024 Apr 10;24(8):2425. doi: 10.3390/s24082425.

HTC-retina: A hybrid retinal diseases classification model using transformer-Convolutional Neural Network from optical coherence tomography images.HTC-retina：一种使用来自光学相干断层扫描图像的变压器-卷积神经网络的混合视网膜疾病分类模型。

Comput Biol Med. 2024 Aug;178:108726. doi: 10.1016/j.compbiomed.2024.108726. Epub 2024 Jun 9.

Dual encoder network with transformer-CNN for multi-organ segmentation.基于 Transformer-CNN 的双编码器网络的多器官分割。

Med Biol Eng Comput. 2023 Mar;61(3):661-671. doi: 10.1007/s11517-022-02723-9. Epub 2022 Dec 29.

MC-DC: An MLP-CNN Based Dual-path Complementary Network for Medical Image Segmentation.MC-DC：一种基于多层感知器-卷积神经网络的医学图像分割双路径互补网络

Comput Methods Programs Biomed. 2023 Dec;242:107846. doi: 10.1016/j.cmpb.2023.107846. Epub 2023 Oct 5.

OCTFormer: A retinal OCT-angiography vessel segmentation transformer.OCTFormer：一种用于视网膜光学相干断层扫描血管造影的血管分割变压器

Comput Methods Programs Biomed. 2023 May;233:107454. doi: 10.1016/j.cmpb.2023.107454. Epub 2023 Mar 5.

MSCT-UNET: multi-scale contrastive transformer within U-shaped network for medical image segmentation.MSCT-UNET：U 形网络中的多尺度对比变换用于医学图像分割。

Phys Med Biol. 2023 Dec 28;69(1). doi: 10.1088/1361-6560/ad135d.

引用本文的文献

ISOSNet: a unified framework for cone photoreceptor detection and inner segment and outer segment length measurement from AO-OCT B-scans.ISOSNet：一种用于从自适应光学光学相干断层扫描（AO-OCT）B扫描中检测视锥光感受器以及测量内节和外节长度的统一框架。

Biomed Opt Express. 2025 Jul 17;16(8):3237-3254. doi: 10.1364/BOE.563128. eCollection 2025 Aug 1.

本文引用的文献

Segmentation of retinal detachment and retinoschisis in OCT images based on complementary multi-class segmentation networks.基于互补多类分割网络的 OCT 图像中视网膜脱离和视网膜劈裂的分割。

Phys Med Biol. 2023 May 30;68(11). doi: 10.1088/1361-6560/acd223.

H2Former: An Efficient Hierarchical Hybrid Transformer for Medical Image Segmentation.H2Former：一种用于医学图像分割的高效分层混合 Transformer

IEEE Trans Med Imaging. 2023 Sep;42(9):2763-2775. doi: 10.1109/TMI.2023.3264513. Epub 2023 Aug 31.

RetiFluidNet: A Self-Adaptive and Multi-Attention Deep Convolutional Network for Retinal OCT Fluid Segmentation.RetiFluidNet：一种用于视网膜 OCT 流体分割的自适应多注意深度卷积网络。

IEEE Trans Med Imaging. 2023 May;42(5):1413-1423. doi: 10.1109/TMI.2022.3228285. Epub 2023 May 2.

LOCTseg: A lightweight fully convolutional network for end-to-end optical coherence tomography segmentation.LOCTseg：一种用于端到端光学相干断层扫描分割的轻量级全卷积网络。

Comput Biol Med. 2022 Nov;150:106174. doi: 10.1016/j.compbiomed.2022.106174. Epub 2022 Oct 4.

Multi-Scale Pathological Fluid Segmentation in OCT With a Novel Curvature Loss in Convolutional Neural Network.基于卷积神经网络新型曲率损失的 OCT 多尺度病理性流体分割。

IEEE Trans Med Imaging. 2022 Jun;41(6):1547-1559. doi: 10.1109/TMI.2022.3142048. Epub 2022 Jun 1.

MsTGANet: Automatic Drusen Segmentation From Retinal OCT Images.MsTGANet：从视网膜 OCT 图像中自动进行脉络膜新生血管分割。

IEEE Trans Med Imaging. 2022 Feb;41(2):394-406. doi: 10.1109/TMI.2021.3112716. Epub 2022 Feb 2.

UNet++: A Nested U-Net Architecture for Medical Image Segmentation.U-Net++：一种用于医学图像分割的嵌套U-Net架构。

Deep Learn Med Image Anal Multimodal Learn Clin Decis Support (2018). 2018 Sep;11045:3-11. doi: 10.1007/978-3-030-00889-5_1. Epub 2018 Sep 20.

CPFNet: Context Pyramid Fusion Network for Medical Image Segmentation.CPFNet：用于医学图像分割的上下文金字塔融合网络。

IEEE Trans Med Imaging. 2020 Oct;39(10):3008-3018. doi: 10.1109/TMI.2020.2983721. Epub 2020 Mar 27.

MultiResUNet : Rethinking the U-Net architecture for multimodal biomedical image segmentation.多模态生物医学图像分割的 U-Net 架构再思考：MultiResUNet

Neural Netw. 2020 Jan;121:74-87. doi: 10.1016/j.neunet.2019.08.025. Epub 2019 Sep 4.

CE-Net: Context Encoder Network for 2D Medical Image Segmentation.CE-Net：用于二维医学图像分割的上下文编码器网络。

IEEE Trans Med Imaging. 2019 Oct;38(10):2281-2292. doi: 10.1109/TMI.2019.2903562. Epub 2019 Mar 7.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

HyFormer：一种用于视网膜光学相干断层扫描（OCT）图像分割的混合变压器-卷积神经网络（CNN）架构

HyFormer: a hybrid transformer-CNN architecture for retinal OCT image segmentation.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献