HCTNet：一种用于视网膜光学相干断层扫描图像分类的混合卷积神经网络-Transformer 网络。

HCTNet: A Hybrid ConvNet-Transformer Network for Retinal Optical Coherence Tomography Image Classification.

机构信息

Key Laboratory of the Ministry of Education for Optoelectronic Measurement Technology and Instrument, Beijing Information Science and Technology University, Beijing 100192, China.

Beijing Laboratory of Biomedical Testing Technology and Instruments, Beijing Information Science and Technology University, Beijing 100192, China.

出版信息

Biosensors (Basel). 2022 Jul 20;12(7):542. doi: 10.3390/bios12070542.

DOI:10.3390/bios12070542

PMID:35884345

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9313149/

Abstract

Automatic and accurate optical coherence tomography (OCT) image classification is of great significance to computer-assisted diagnosis of retinal disease. In this study, we propose a hybrid ConvNet-Transformer network (HCTNet) and verify the feasibility of a Transformer-based method for retinal OCT image classification. The HCTNet first utilizes a low-level feature extraction module based on the residual dense block to generate low-level features for facilitating the network training. Then, two parallel branches of the Transformer and the ConvNet are designed to exploit the global and local context of the OCT images. Finally, a feature fusion module based on an adaptive re-weighting mechanism is employed to combine the extracted global and local features for predicting the category of OCT images in the testing datasets. The HCTNet combines the advantage of the convolutional neural network in extracting local features and the advantage of the vision Transformer in establishing long-range dependencies. A verification on two public retinal OCT datasets shows that our HCTNet method achieves an overall accuracy of 91.56% and 86.18%, respectively, outperforming the pure ViT and several ConvNet-based classification methods.

摘要

自动且准确的光学相干断层扫描（OCT）图像分类对于视网膜疾病的计算机辅助诊断具有重要意义。在本研究中，我们提出了一种混合卷积神经网络-Transformer 网络（HCTNet），并验证了基于 Transformer 的方法在视网膜 OCT 图像分类中的可行性。HCTNet 首先利用基于残差密集块的底层特征提取模块生成底层特征，以促进网络训练。然后，设计了两个并行分支的 Transformer 和 ConvNet，以利用 OCT 图像的全局和局部上下文。最后，采用基于自适应重加权机制的特征融合模块，融合提取的全局和局部特征，以预测测试数据集中 OCT 图像的类别。HCTNet 结合了卷积神经网络在提取局部特征方面的优势和视觉 Transformer 在建立长程依赖关系方面的优势。在两个公共的视网膜 OCT 数据集上的验证表明，我们的 HCTNet 方法分别实现了 91.56%和 86.18%的整体准确率，优于纯 ViT 和几种基于 ConvNet 的分类方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d6ee/9313149/20bde9048b23/biosensors-12-00542-g001.jpg

相似文献

HCTNet: A Hybrid ConvNet-Transformer Network for Retinal Optical Coherence Tomography Image Classification.

Biosensors (Basel). 2022 Jul 20;12(7):542. doi: 10.3390/bios12070542.

HTC-retina: A hybrid retinal diseases classification model using transformer-Convolutional Neural Network from optical coherence tomography images.

Comput Biol Med. 2024 Aug;178:108726. doi: 10.1016/j.compbiomed.2024.108726. Epub 2024 Jun 9.

HCTNet: A hybrid CNN-transformer network for breast ultrasound image segmentation.

Comput Biol Med. 2023 Mar;155:106629. doi: 10.1016/j.compbiomed.2023.106629. Epub 2023 Feb 9.

Attention to Lesion: Lesion-Aware Convolutional Neural Network for Retinal Optical Coherence Tomography Image Classification.

IEEE Trans Med Imaging. 2019 Aug;38(8):1959-1970. doi: 10.1109/TMI.2019.2898414. Epub 2019 Feb 8.

Stitched vision transformer for age-related macular degeneration detection using retinal optical coherence tomography images.

PLoS One. 2024 Jun 5;19(6):e0304943. doi: 10.1371/journal.pone.0304943. eCollection 2024.

Towards more efficient ophthalmic disease classification and lesion location via convolution transformer.

Comput Methods Programs Biomed. 2022 Jun;220:106832. doi: 10.1016/j.cmpb.2022.106832. Epub 2022 Apr 27.

FNeXter: A Multi-Scale Feature Fusion Network Based on ConvNeXt and Transformer for Retinal OCT Fluid Segmentation.

Sensors (Basel). 2024 Apr 10;24(8):2425. doi: 10.3390/s24082425.

MBT: Model-Based Transformer for retinal optical coherence tomography image and video multi-classification.

Int J Med Inform. 2023 Oct;178:105178. doi: 10.1016/j.ijmedinf.2023.105178. Epub 2023 Aug 21.

An interpretable transformer network for the retinal disease classification using optical coherence tomography.

Sci Rep. 2023 Mar 3;13(1):3637. doi: 10.1038/s41598-023-30853-z.

TranSegNet: Hybrid CNN-Vision Transformers Encoder for Retina Segmentation of Optical Coherence Tomography.

Life (Basel). 2023 Apr 10;13(4):976. doi: 10.3390/life13040976.

引用本文的文献

Residual self-attention vision transformer for detecting acquired vitelliform lesions and age-related macular drusen.

Sci Rep. 2025 May 16;15(1):17107. doi: 10.1038/s41598-025-02299-y.

Discriminative, generative artificial intelligence, and foundation models in retina imaging.

Taiwan J Ophthalmol. 2024 Nov 28;14(4):473-485. doi: 10.4103/tjo.TJO-D-24-00064. eCollection 2024 Oct-Dec.

L2NLF: a novel linear-to-nonlinear framework for multi-modal medical image registration.

Biomed Eng Lett. 2024 Jan 10;14(3):497-509. doi: 10.1007/s13534-023-00344-1. eCollection 2024 May.

Multi-Scale-Denoising Residual Convolutional Network for Retinal Disease Classification Using OCT.

Sensors (Basel). 2023 Dec 27;24(1):150. doi: 10.3390/s24010150.

Vision transformers: The next frontier for deep learning-based ophthalmic image analysis.

Saudi J Ophthalmol. 2023 Jul 14;37(3):173-178. doi: 10.4103/sjopt.sjopt_91_23. eCollection 2023 Jul-Sep.

Attention TurkerNeXt: Investigations into Bipolar Disorder Detection Using OCT Images.

Diagnostics (Basel). 2023 Nov 10;13(22):3422. doi: 10.3390/diagnostics13223422.

本文引用的文献

A novel multiscale and multipath convolutional neural network based age-related macular degeneration detection using OCT images.

Comput Methods Programs Biomed. 2021 Sep;209:106294. doi: 10.1016/j.cmpb.2021.106294. Epub 2021 Jul 27.

Multi-Modal Retinal Image Classification With Modality-Specific Attention Network.

IEEE Trans Med Imaging. 2021 Jun;40(6):1591-1602. doi: 10.1109/TMI.2021.3059956. Epub 2021 Jun 1.

Automated diagnoses of age-related macular degeneration and polypoidal choroidal vasculopathy using bi-modal deep convolutional neural networks.

Br J Ophthalmol. 2021 Apr;105(4):561-566. doi: 10.1136/bjophthalmol-2020-315817. Epub 2020 Jun 4.

Classification of optical coherence tomography images using a capsule network.

BMC Ophthalmol. 2020 Mar 19;20(1):114. doi: 10.1186/s12886-020-01382-4.

In vivo monitoring the dynamic process of acute retinal hemorrhage and repair in zebrafish with spectral-domain optical coherence tomography.

J Biophotonics. 2019 Dec;12(12):e201900235. doi: 10.1002/jbio.201900235. Epub 2019 Oct 2.

Automated detection and classification of early AMD biomarkers using deep learning.

Sci Rep. 2019 Jul 29;9(1):10990. doi: 10.1038/s41598-019-47390-3.

Attention to Lesion: Lesion-Aware Convolutional Neural Network for Retinal Optical Coherence Tomography Image Classification.

IEEE Trans Med Imaging. 2019 Aug;38(8):1959-1970. doi: 10.1109/TMI.2019.2898414. Epub 2019 Feb 8.

Deep learning is effective for the classification of OCT images of normal versus Age-related Macular Degeneration.

Ophthalmol Retina. 2017 Jul-Aug;1(4):322-327. doi: 10.1016/j.oret.2016.12.009. Epub 2017 Feb 13.

Artificial intelligence-based decision-making for age-related macular degeneration.

Theranostics. 2019 Jan 1;9(1):232-245. doi: 10.7150/thno.28447. eCollection 2019.

The possibility of the combination of OCT and fundus images for improving the diagnostic accuracy of deep learning for age-related macular degeneration: a preliminary experiment.

Med Biol Eng Comput. 2019 Mar;57(3):677-687. doi: 10.1007/s11517-018-1915-z. Epub 2018 Oct 22.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

HCTNet：一种用于视网膜光学相干断层扫描图像分类的混合卷积神经网络-Transformer 网络。

HCTNet: A Hybrid ConvNet-Transformer Network for Retinal Optical Coherence Tomography Image Classification.

机构信息

Key Laboratory of the Ministry of Education for Optoelectronic Measurement Technology and Instrument, Beijing Information Science and Technology University, Beijing 100192, China.

Beijing Laboratory of Biomedical Testing Technology and Instruments, Beijing Information Science and Technology University, Beijing 100192, China.

出版信息

Biosensors (Basel). 2022 Jul 20;12(7):542. doi: 10.3390/bios12070542.

DOI:10.3390/bios12070542

PMID:35884345

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9313149/

Abstract

摘要

HCTNet：一种用于视网膜光学相干断层扫描图像分类的混合卷积神经网络-Transformer 网络。

HCTNet: A Hybrid ConvNet-Transformer Network for Retinal Optical Coherence Tomography Image Classification.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

HCTNet：一种用于视网膜光学相干断层扫描图像分类的混合卷积神经网络-Transformer 网络。

HCTNet: A Hybrid ConvNet-Transformer Network for Retinal Optical Coherence Tomography Image Classification.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献