• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于迁移学习的视觉变换器对眼底镜图像中的视网膜眼病进行多类别分类

Multi-class Classification of Retinal Eye Diseases from Ophthalmoscopy Images Using Transfer Learning-Based Vision Transformers.

作者信息

Cutur Elif Setenay, Inan Neslihan Gokmen

机构信息

Graduate School of Sciences and Engineering, Data Science, Koç University, Istanbul, Turkey.

College of Engineering, Department of Computer Engineering, Koç University, Rumelifeneri Yolu, 34450, Sarıyer, Istanbul, Turkey.

出版信息

J Imaging Inform Med. 2025 Jan 27. doi: 10.1007/s10278-025-01416-7.

DOI:10.1007/s10278-025-01416-7
PMID:39871038
Abstract

This study explores a transfer learning approach with vision transformers (ViTs) and convolutional neural networks (CNNs) for classifying retinal diseases, specifically diabetic retinopathy, glaucoma, and cataracts, from ophthalmoscopy images. Using a balanced subset of 4217 images and ophthalmology-specific pretrained ViT backbones, this method demonstrates significant improvements in classification accuracy, offering potential for broader applications in medical imaging. Glaucoma, diabetic retinopathy, and cataracts are common eye diseases that can cause vision loss if not treated. These diseases must be identified in the early stages to prevent eye damage progression. This paper focuses on the accurate identification and analysis of disparate eye diseases, including glaucoma, diabetic retinopathy, and cataracts, using ophthalmoscopy images. Deep learning (DL) has been widely used in image recognition for the early detection and treatment of eye diseases. In this study, ResNet50, DenseNet121, Inception-ResNetV2, and six variations of ViT are employed, and their performance in diagnosing diseases such as glaucoma, cataracts, and diabetic retinopathy is evaluated. In particular, the article uses the vision transformer model as an automated method to diagnose retinal eye diseases, highlighting the accuracy of pre-trained deep transfer learning (DTL) structures. The updated ViT#5 model with the augmented-regularized pre-trained model (AugReg ViT-L/16_224) and learning rate of 0.00002 outperforms the state-of-the-art techniques, obtaining a data-based accuracy score of 98.1% on a publicly accessible retinal ophthalmoscopy image dataset, which includes 4217 images. In most categories, the model outperforms other convolutional-based and ViT models in terms of accuracy, precision, recall, and F1 score. This research contributes significantly to medical image analysis, demonstrating the potential of AI in enhancing the precision of eye disease diagnoses and advocating for the integration of artificial intelligence in medical diagnostics.

摘要

本研究探索了一种使用视觉Transformer(ViT)和卷积神经网络(CNN)的迁移学习方法,用于从检眼镜图像中对视网膜疾病进行分类,特别是糖尿病性视网膜病变、青光眼和白内障。使用4217幅图像的平衡子集和眼科专用的预训练ViT主干,该方法在分类准确率上有显著提高,为医学成像的更广泛应用提供了潜力。青光眼、糖尿病性视网膜病变和白内障是常见的眼部疾病,如果不治疗可能导致视力丧失。这些疾病必须在早期阶段被识别出来,以防止眼部损伤的进展。本文重点使用检眼镜图像对包括青光眼、糖尿病性视网膜病变和白内障在内的不同眼部疾病进行准确识别和分析。深度学习(DL)已广泛应用于眼部疾病的早期检测和治疗的图像识别中。在本研究中,使用了ResNet50、DenseNet121、Inception-ResNetV2以及六种ViT变体,并评估了它们在诊断青光眼、白内障和糖尿病性视网膜病变等疾病方面的性能。特别是,本文使用视觉Transformer模型作为诊断视网膜眼部疾病的自动化方法,突出了预训练深度迁移学习(DTL)结构的准确性。具有增强正则化预训练模型(AugReg ViT-L/16_224)和0.00002学习率的更新后的ViT#5模型优于现有技术,在一个可公开访问的视网膜检眼镜图像数据集(包含4217幅图像)上获得了基于数据的98.1%的准确率得分。在大多数类别中,该模型在准确率、精确率、召回率和F1分数方面优于其他基于卷积的模型和ViT模型。这项研究对医学图像分析有重大贡献,展示了人工智能在提高眼部疾病诊断精度方面的潜力,并倡导将人工智能整合到医学诊断中。

相似文献

1
Multi-class Classification of Retinal Eye Diseases from Ophthalmoscopy Images Using Transfer Learning-Based Vision Transformers.基于迁移学习的视觉变换器对眼底镜图像中的视网膜眼病进行多类别分类
J Imaging Inform Med. 2025 Jan 27. doi: 10.1007/s10278-025-01416-7.
2
ResViT FusionNet Model: An explainable AI-driven approach for automated grading of diabetic retinopathy in retinal images.ResViT融合网络模型:一种用于视网膜图像中糖尿病视网膜病变自动分级的可解释人工智能驱动方法。
Comput Biol Med. 2025 Mar;186:109656. doi: 10.1016/j.compbiomed.2025.109656. Epub 2025 Jan 16.
3
Artificial intelligence based glaucoma and diabetic retinopathy detection using MATLAB - retrained AlexNet convolutional neural network.基于人工智能的青光眼和糖尿病视网膜病变检测,使用 MATLAB - 重新训练的 AlexNet 卷积神经网络。
F1000Res. 2024 Apr 3;12:14. doi: 10.12688/f1000research.122288.2. eCollection 2023.
4
Comparative Analysis of Vision Transformers and Conventional Convolutional Neural Networks in Detecting Referable Diabetic Retinopathy.视觉Transformer与传统卷积神经网络在检测可转诊糖尿病视网膜病变中的对比分析
Ophthalmol Sci. 2024 May 17;4(6):100552. doi: 10.1016/j.xops.2024.100552. eCollection 2024 Nov-Dec.
5
CA-ViT: Contour-Guided and Augmented Vision Transformers to Enhance Glaucoma Classification Using Fundus Images.CA-ViT:基于轮廓引导和增强的视觉Transformer用于通过眼底图像增强青光眼分类
Bioengineering (Basel). 2024 Aug 31;11(9):887. doi: 10.3390/bioengineering11090887.
6
A deep learning approach to direct immunofluorescence pattern recognition in autoimmune bullous diseases.深度学习方法在自身免疫性大疱性疾病中的直接免疫荧光模式识别。
Br J Dermatol. 2024 Jul 16;191(2):261-266. doi: 10.1093/bjd/ljae142.
7
Brain tumor segmentation and detection in MRI using convolutional neural networks and VGG16.使用卷积神经网络和VGG16在磁共振成像(MRI)中进行脑肿瘤分割与检测
Cancer Biomark. 2025 Mar;42(3):18758592241311184. doi: 10.1177/18758592241311184. Epub 2025 Apr 4.
8
Detection and diagnosis of diabetic eye diseases using two phase transfer learning approach.使用两种迁移学习方法检测和诊断糖尿病眼病
PeerJ Comput Sci. 2024 Sep 19;10:e2135. doi: 10.7717/peerj-cs.2135. eCollection 2024.
9
Glaucoma detection in Latino population through OCT's RNFL thickness map using transfer learning.利用迁移学习通过 OCT 的 RNFL 厚度图检测拉丁裔人群的青光眼。
Int Ophthalmol. 2021 Nov;41(11):3727-3741. doi: 10.1007/s10792-021-01931-w. Epub 2021 Jul 1.
10
Do it the transformer way: A comprehensive review of brain and vision transformers for autism spectrum disorder diagnosis and classification.采用变压器方法:自闭症谱系障碍诊断和分类的脑和视觉变压器的全面综述。
Comput Biol Med. 2023 Dec;167:107667. doi: 10.1016/j.compbiomed.2023.107667. Epub 2023 Nov 3.

本文引用的文献

1
Polyp Segmentation Using a Hybrid Vision Transformer and a Hybrid Loss Function.基于混合视觉 Transformer 和混合损失函数的息肉分割。
J Imaging Inform Med. 2024 Apr;37(2):851-863. doi: 10.1007/s10278-023-00954-2. Epub 2024 Jan 12.
2
ChatGPT-assisted deep learning model for thyroid nodule analysis: beyond artifical intelligence.基于 ChatGPT 的甲状腺结节分析深度学习模型:超越人工智能。
Med Ultrason. 2023 Dec 27;25(4):375-383. doi: 10.11152/mu-4306.
3
Self-FI: Self-Supervised Learning for Disease Diagnosis in Fundus Images.
Self-FI:用于眼底图像疾病诊断的自监督学习
Bioengineering (Basel). 2023 Sep 16;10(9):1089. doi: 10.3390/bioengineering10091089.
4
Conv-ViT: A Convolution and Vision Transformer-Based Hybrid Feature Extraction Method for Retinal Disease Detection.Conv-ViT:一种基于卷积和视觉Transformer的视网膜疾病检测混合特征提取方法
J Imaging. 2023 Jul 10;9(7):140. doi: 10.3390/jimaging9070140.