OcuViT：一种基于视觉Transformer的自动糖尿病视网膜病变和年龄相关性黄斑变性分类方法。

OcuViT: A Vision Transformer-Based Approach for Automated Diabetic Retinopathy and AMD Classification.

作者信息

Ahmed Faisal, Uddin M D Joshem

机构信息

Department of Data Science and Mathematics, Embry-Riddle Aeronautical University, 3700 Willow Creek Rd, Prescott, 86301, AZ, USA.

Department of Mathematical Sciences, The University of Texas at Dallas, 800 W Campbell Rd, Richardson, 75080, TX, USA.

出版信息

J Imaging Inform Med. 2025 Sep 19. doi: 10.1007/s10278-025-01676-3.

DOI:10.1007/s10278-025-01676-3

PMID:40973913

Abstract

Early detection and accurate classification of retinal diseases, such as diabetic retinopathy (DR) and age-related macular degeneration (AMD), are essential to preventing vision loss and improving patient outcomes. Traditional methods for analyzing retinal fundus images are often manual, prolonged, and rely on the expertise of the clinician, leading to delays in diagnosis and treatment. Recent advances in machine learning, particularly deep learning, have introduced automated systems to assist in retinal disease detection; however, challenges such as computational inefficiency and robustness still remain. This paper proposes a novel approach that utilizes vision transformers (ViT) through transfer learning to address challenges in ophthalmic diagnostics. Using a pre-trained ViT-Base-Patch16-224 model, we fine-tune it for diabetic retinopathy (DR) and age-related macular degeneration (AMD) classification tasks. To adapt the model for retinal fundus images, we implement a streamlined preprocessing pipeline that converts the images into PyTorch tensors and standardizes them, ensuring compatibility with the ViT architecture and improving model performance. We validated our model, OcuViT, on two datasets. We used the APTOS dataset to perform binary and five-level severity classification and the IChallenge-AMD dataset for grading age-related macular degeneration (AMD). In the five-class DR and AMD grading tasks, OcuViT outperforms all existing CNN- and ViT-based methods across multiple metrics, achieving superior accuracy and robustness. For the binary DR task, it delivers highly competitive performance. These results demonstrate that OcuViT effectively leverages ViT-based transfer learning with an efficient preprocessing pipeline, significantly improving the precision and reliability of automated ophthalmic diagnosis.

摘要

早期检测和准确分类视网膜疾病，如糖尿病性视网膜病变（DR）和年龄相关性黄斑变性（AMD），对于预防视力丧失和改善患者预后至关重要。传统的分析视网膜眼底图像的方法通常是人工的、耗时的，并且依赖临床医生的专业知识，这导致诊断和治疗的延迟。机器学习，特别是深度学习的最新进展，引入了自动化系统来辅助视网膜疾病检测；然而，诸如计算效率低下和鲁棒性等挑战仍然存在。本文提出了一种新颖的方法，即通过迁移学习利用视觉Transformer（ViT）来解决眼科诊断中的挑战。我们使用预训练的ViT-Base-Patch16-224模型，针对糖尿病性视网膜病变（DR）和年龄相关性黄斑变性（AMD）分类任务对其进行微调。为了使模型适用于视网膜眼底图像，我们实施了一个简化的预处理管道，将图像转换为PyTorch张量并进行标准化，确保与ViT架构兼容并提高模型性能。我们在两个数据集上验证了我们的模型OcuViT。我们使用APTOS数据集进行二元和五级严重程度分类，并使用IChallenge-AMD数据集对年龄相关性黄斑变性（AMD）进行分级。在五类DR和AMD分级任务中，OcuViT在多个指标上优于所有现有的基于CNN和ViT的方法，实现了更高的准确性和鲁棒性。对于二元DR任务，它提供了极具竞争力的性能。这些结果表明，OcuViT通过高效的预处理管道有效地利用了基于ViT 的迁移学习，显著提高了自动化眼科诊断的精度和可靠性。

相似文献

OcuViT: A Vision Transformer-Based Approach for Automated Diabetic Retinopathy and AMD Classification.

J Imaging Inform Med. 2025 Sep 19. doi: 10.1007/s10278-025-01676-3.

Artificial intelligence for diagnosing exudative age-related macular degeneration.

Cochrane Database Syst Rev. 2024 Oct 17;10(10):CD015522. doi: 10.1002/14651858.CD015522.pub2.

Diabetic retinopathy screening through artificial intelligence algorithms: A systematic review.

Surv Ophthalmol. 2024 Sep-Oct;69(5):707-721. doi: 10.1016/j.survophthal.2024.05.008. Epub 2024 Jun 15.

A deep learning approach to direct immunofluorescence pattern recognition in autoimmune bullous diseases.

Br J Dermatol. 2024 Jul 16;191(2):261-266. doi: 10.1093/bjd/ljae142.

Prescription of Controlled Substances: Benefits and Risks

Diabetic retinopathy detection from fundus images: A wide survey from grading to segmentation of lesions.

Comput Biol Med. 2025 Sep;196(Pt B):110715. doi: 10.1016/j.compbiomed.2025.110715. Epub 2025 Jul 18.

Optical coherence tomography (OCT) for detection of macular oedema in patients with diabetic retinopathy.

Cochrane Database Syst Rev. 2015 Jan 7;1(1):CD008081. doi: 10.1002/14651858.CD008081.pub3.

Advancing respiratory disease diagnosis: A deep learning and vision transformer-based approach with a novel X-ray dataset.

Comput Biol Med. 2025 Aug;194:110501. doi: 10.1016/j.compbiomed.2025.110501. Epub 2025 Jun 9.

Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.

Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.

Detection and classification of diabetic retinopathy in retinal fundus images using deep spiking Q Network optimized with partial reinforcement optimizer.

Comput Biol Med. 2025 Sep;196(Pt C):110863. doi: 10.1016/j.compbiomed.2025.110863. Epub 2025 Aug 13.

本文引用的文献

Ensemble deep learning and EfficientNet for accurate diagnosis of diabetic retinopathy.

Sci Rep. 2024 Dec 18;14(1):30554. doi: 10.1038/s41598-024-81132-4.

Novel artificial intelligence for diabetic retinopathy and diabetic macular edema: what is new in 2024?

Curr Opin Ophthalmol. 2024 Nov 1;35(6):472-479. doi: 10.1097/ICU.0000000000001084. Epub 2024 Sep 9.

Optimized deep CNN for detection and classification of diabetic retinopathy and diabetic macular edema.

BMC Med Imaging. 2024 Aug 28;24(1):227. doi: 10.1186/s12880-024-01406-1.

Advantages of transformer and its application for medical image segmentation: a survey.

Biomed Eng Online. 2024 Feb 3;23(1):14. doi: 10.1186/s12938-024-01212-4.

Transformers in medical imaging: A survey.

Med Image Anal. 2023 Aug;88:102802. doi: 10.1016/j.media.2023.102802. Epub 2023 Apr 5.

An interpretable transformer network for the retinal disease classification using optical coherence tomography.

Sci Rep. 2023 Mar 3;13(1):3637. doi: 10.1038/s41598-023-30853-z.

Applying supervised contrastive learning for the detection of diabetic retinopathy and its severity levels from fundus images.

Comput Biol Med. 2022 Jul;146:105602. doi: 10.1016/j.compbiomed.2022.105602. Epub 2022 May 10.

Vision Transformer-based recognition of diabetic retinopathy grade.

Med Phys. 2021 Dec;48(12):7850-7863. doi: 10.1002/mp.15312. Epub 2021 Nov 16.

Understanding inherent image features in CNN-based assessment of diabetic retinopathy.

Sci Rep. 2021 May 6;11(1):9704. doi: 10.1038/s41598-021-89225-0.

Global Prevalence of Diabetic Retinopathy and Projection of Burden through 2045: Systematic Review and Meta-analysis.

Ophthalmology. 2021 Nov;128(11):1580-1591. doi: 10.1016/j.ophtha.2021.04.027. Epub 2021 May 1.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

OcuViT：一种基于视觉Transformer的自动糖尿病视网膜病变和年龄相关性黄斑变性分类方法。

OcuViT: A Vision Transformer-Based Approach for Automated Diabetic Retinopathy and AMD Classification.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献