• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

OcuViT:一种基于视觉Transformer的自动糖尿病视网膜病变和年龄相关性黄斑变性分类方法。

OcuViT: A Vision Transformer-Based Approach for Automated Diabetic Retinopathy and AMD Classification.

作者信息

Ahmed Faisal, Uddin M D Joshem

机构信息

Department of Data Science and Mathematics, Embry-Riddle Aeronautical University, 3700 Willow Creek Rd, Prescott, 86301, AZ, USA.

Department of Mathematical Sciences, The University of Texas at Dallas, 800 W Campbell Rd, Richardson, 75080, TX, USA.

出版信息

J Imaging Inform Med. 2025 Sep 19. doi: 10.1007/s10278-025-01676-3.

DOI:10.1007/s10278-025-01676-3
PMID:40973913
Abstract

Early detection and accurate classification of retinal diseases, such as diabetic retinopathy (DR) and age-related macular degeneration (AMD), are essential to preventing vision loss and improving patient outcomes. Traditional methods for analyzing retinal fundus images are often manual, prolonged, and rely on the expertise of the clinician, leading to delays in diagnosis and treatment. Recent advances in machine learning, particularly deep learning, have introduced automated systems to assist in retinal disease detection; however, challenges such as computational inefficiency and robustness still remain. This paper proposes a novel approach that utilizes vision transformers (ViT) through transfer learning to address challenges in ophthalmic diagnostics. Using a pre-trained ViT-Base-Patch16-224 model, we fine-tune it for diabetic retinopathy (DR) and age-related macular degeneration (AMD) classification tasks. To adapt the model for retinal fundus images, we implement a streamlined preprocessing pipeline that converts the images into PyTorch tensors and standardizes them, ensuring compatibility with the ViT architecture and improving model performance. We validated our model, OcuViT, on two datasets. We used the APTOS dataset to perform binary and five-level severity classification and the IChallenge-AMD dataset for grading age-related macular degeneration (AMD). In the five-class DR and AMD grading tasks, OcuViT outperforms all existing CNN- and ViT-based methods across multiple metrics, achieving superior accuracy and robustness. For the binary DR task, it delivers highly competitive performance. These results demonstrate that OcuViT effectively leverages ViT-based transfer learning with an efficient preprocessing pipeline, significantly improving the precision and reliability of automated ophthalmic diagnosis.

摘要

早期检测和准确分类视网膜疾病,如糖尿病性视网膜病变(DR)和年龄相关性黄斑变性(AMD),对于预防视力丧失和改善患者预后至关重要。传统的分析视网膜眼底图像的方法通常是人工的、耗时的,并且依赖临床医生的专业知识,这导致诊断和治疗的延迟。机器学习,特别是深度学习的最新进展,引入了自动化系统来辅助视网膜疾病检测;然而,诸如计算效率低下和鲁棒性等挑战仍然存在。本文提出了一种新颖的方法,即通过迁移学习利用视觉Transformer(ViT)来解决眼科诊断中的挑战。我们使用预训练的ViT-Base-Patch16-224模型,针对糖尿病性视网膜病变(DR)和年龄相关性黄斑变性(AMD)分类任务对其进行微调。为了使模型适用于视网膜眼底图像,我们实施了一个简化的预处理管道,将图像转换为PyTorch张量并进行标准化,确保与ViT架构兼容并提高模型性能。我们在两个数据集上验证了我们的模型OcuViT。我们使用APTOS数据集进行二元和五级严重程度分类,并使用IChallenge-AMD数据集对年龄相关性黄斑变性(AMD)进行分级。在五类DR和AMD分级任务中,OcuViT在多个指标上优于所有现有的基于CNN和ViT的方法,实现了更高的准确性和鲁棒性。对于二元DR任务,它提供了极具竞争力的性能。这些结果表明,OcuViT通过高效的预处理管道有效地利用了基于ViT 的迁移学习,显著提高了自动化眼科诊断的精度和可靠性。

相似文献

1
OcuViT: A Vision Transformer-Based Approach for Automated Diabetic Retinopathy and AMD Classification.OcuViT:一种基于视觉Transformer的自动糖尿病视网膜病变和年龄相关性黄斑变性分类方法。
J Imaging Inform Med. 2025 Sep 19. doi: 10.1007/s10278-025-01676-3.
2
Artificial intelligence for diagnosing exudative age-related macular degeneration.人工智能在渗出性年龄相关性黄斑变性诊断中的应用。
Cochrane Database Syst Rev. 2024 Oct 17;10(10):CD015522. doi: 10.1002/14651858.CD015522.pub2.
3
Diabetic retinopathy screening through artificial intelligence algorithms: A systematic review.基于人工智能算法的糖尿病视网膜病变筛查:系统综述。
Surv Ophthalmol. 2024 Sep-Oct;69(5):707-721. doi: 10.1016/j.survophthal.2024.05.008. Epub 2024 Jun 15.
4
A deep learning approach to direct immunofluorescence pattern recognition in autoimmune bullous diseases.深度学习方法在自身免疫性大疱性疾病中的直接免疫荧光模式识别。
Br J Dermatol. 2024 Jul 16;191(2):261-266. doi: 10.1093/bjd/ljae142.
5
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
6
Diabetic retinopathy detection from fundus images: A wide survey from grading to segmentation of lesions.基于眼底图像的糖尿病视网膜病变检测:从病变分级到分割的全面综述。
Comput Biol Med. 2025 Sep;196(Pt B):110715. doi: 10.1016/j.compbiomed.2025.110715. Epub 2025 Jul 18.
7
Optical coherence tomography (OCT) for detection of macular oedema in patients with diabetic retinopathy.光学相干断层扫描(OCT)用于检测糖尿病视网膜病变患者的黄斑水肿。
Cochrane Database Syst Rev. 2015 Jan 7;1(1):CD008081. doi: 10.1002/14651858.CD008081.pub3.
8
Advancing respiratory disease diagnosis: A deep learning and vision transformer-based approach with a novel X-ray dataset.推进呼吸系统疾病诊断:一种基于深度学习和视觉Transformer的方法及新型X射线数据集
Comput Biol Med. 2025 Aug;194:110501. doi: 10.1016/j.compbiomed.2025.110501. Epub 2025 Jun 9.
9
Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.利用基础模型库进行跨设备肿瘤显微镜检查中的细胞相似性搜索。
Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.
10
Detection and classification of diabetic retinopathy in retinal fundus images using deep spiking Q Network optimized with partial reinforcement optimizer.使用部分强化优化器优化的深度脉冲Q网络对视网膜眼底图像中的糖尿病视网膜病变进行检测和分类。
Comput Biol Med. 2025 Sep;196(Pt C):110863. doi: 10.1016/j.compbiomed.2025.110863. Epub 2025 Aug 13.

本文引用的文献

1
Ensemble deep learning and EfficientNet for accurate diagnosis of diabetic retinopathy.集成深度学习与高效神经网络用于糖尿病视网膜病变的准确诊断。
Sci Rep. 2024 Dec 18;14(1):30554. doi: 10.1038/s41598-024-81132-4.
2
Novel artificial intelligence for diabetic retinopathy and diabetic macular edema: what is new in 2024?新型人工智能在糖尿病视网膜病变和糖尿病黄斑水肿中的应用:2024 年有哪些新进展?
Curr Opin Ophthalmol. 2024 Nov 1;35(6):472-479. doi: 10.1097/ICU.0000000000001084. Epub 2024 Sep 9.
3
Optimized deep CNN for detection and classification of diabetic retinopathy and diabetic macular edema.
优化的深度卷积神经网络用于糖尿病性视网膜病变和糖尿病性黄斑水肿的检测和分类。
BMC Med Imaging. 2024 Aug 28;24(1):227. doi: 10.1186/s12880-024-01406-1.
4
Advantages of transformer and its application for medical image segmentation: a survey.Transformer 的优势及其在医学图像分割中的应用:综述。
Biomed Eng Online. 2024 Feb 3;23(1):14. doi: 10.1186/s12938-024-01212-4.
5
Transformers in medical imaging: A survey.医学成像中的变压器:综述。
Med Image Anal. 2023 Aug;88:102802. doi: 10.1016/j.media.2023.102802. Epub 2023 Apr 5.
6
An interpretable transformer network for the retinal disease classification using optical coherence tomography.基于光相干断层扫描的视网膜疾病分类的可解释性变换网络
Sci Rep. 2023 Mar 3;13(1):3637. doi: 10.1038/s41598-023-30853-z.
7
Applying supervised contrastive learning for the detection of diabetic retinopathy and its severity levels from fundus images.应用监督对比学习从眼底图像中检测糖尿病性视网膜病变及其严重程度。
Comput Biol Med. 2022 Jul;146:105602. doi: 10.1016/j.compbiomed.2022.105602. Epub 2022 May 10.
8
Vision Transformer-based recognition of diabetic retinopathy grade.基于 Vision Transformer 的糖尿病视网膜病变分级识别。
Med Phys. 2021 Dec;48(12):7850-7863. doi: 10.1002/mp.15312. Epub 2021 Nov 16.
9
Understanding inherent image features in CNN-based assessment of diabetic retinopathy.基于卷积神经网络的糖尿病视网膜病变评估中内在图像特征的理解。
Sci Rep. 2021 May 6;11(1):9704. doi: 10.1038/s41598-021-89225-0.
10
Global Prevalence of Diabetic Retinopathy and Projection of Burden through 2045: Systematic Review and Meta-analysis.全球糖尿病视网膜病变的患病率及 2045 年预期负担的系统评价和荟萃分析。
Ophthalmology. 2021 Nov;128(11):1580-1591. doi: 10.1016/j.ophtha.2021.04.027. Epub 2021 May 1.