基于大尺寸视网膜图像的可引用糖尿病视网膜病变分类的掩蔽自动编码器视觉转换器。

Vision transformer with masked autoencoders for referable diabetic retinopathy classification based on large-size retina image.

机构信息

College of Science, China Jiliang University, Hangzhou, Zhejiang, China.

Key Laboratory of Intelligent Manufacturing Quality Big Data Tracing and Analysis of Zhejiang Province, Hangzhou, Zhejiang, China.

出版信息

PLoS One. 2024 Mar 6;19(3):e0299265. doi: 10.1371/journal.pone.0299265. eCollection 2024.

DOI:10.1371/journal.pone.0299265

PMID:38446810

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10917269/

Abstract

Computer-aided diagnosis systems based on deep learning algorithms have shown potential applications in rapid diagnosis of diabetic retinopathy (DR). Due to the superior performance of Transformer over convolutional neural networks (CNN) on natural images, we attempted to develop a new model to classify referable DR based on a limited number of large-size retinal images by using Transformer. Vision Transformer (ViT) with Masked Autoencoders (MAE) was applied in this study to improve the classification performance of referable DR. We collected over 100,000 publicly fundus retinal images larger than 224×224, and then pre-trained ViT on these retinal images using MAE. The pre-trained ViT was applied to classify referable DR, the performance was also compared with that of ViT pre-trained using ImageNet. The improvement in model classification performance by pre-training with over 100,000 retinal images using MAE is superior to that pre-trained with ImageNet. The accuracy, area under curve (AUC), highest sensitivity and highest specificity of the present model are 93.42%, 0.9853, 0.973 and 0.9539, respectively. This study shows that MAE can provide more flexibility to the input image and substantially reduce the number of images required. Meanwhile, the pretraining dataset scale in this study is much smaller than ImageNet, and the pre-trained weights from ImageNet are not required also.

摘要

基于深度学习算法的计算机辅助诊断系统在糖尿病视网膜病变 (DR) 的快速诊断中显示出了潜在的应用。由于 Transformer 在自然图像上的表现优于卷积神经网络 (CNN)，我们尝试开发一种新的模型，通过使用 Transformer 基于有限数量的大尺寸视网膜图像来分类可转诊 DR。在这项研究中，应用 Vision Transformer (ViT) 与 Masked Autoencoders (MAE) 来提高可转诊 DR 的分类性能。我们收集了超过 10 万张大于 224×224 的公共眼底视网膜图像，然后使用 MAE 对这些视网膜图像进行预训练 ViT。应用预训练的 ViT 来分类可转诊 DR，并将其性能与使用 ImageNet 预训练的 ViT 进行比较。通过使用 MAE 对超过 10 万张视网膜图像进行预训练，模型分类性能的提高优于使用 ImageNet 进行预训练。本模型的准确率、曲线下面积 (AUC)、最高灵敏度和最高特异性分别为 93.42%、0.9853、0.973 和 0.9539。这项研究表明，MAE 可以为输入图像提供更多的灵活性，并大大减少所需的图像数量。同时，本研究中的预训练数据集规模比 ImageNet 小得多，也不需要来自 ImageNet 的预训练权重。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/19d6/10917269/f3015b84fba0/pone.0299265.g001.jpg

相似文献

Vision transformer with masked autoencoders for referable diabetic retinopathy classification based on large-size retina image.

PLoS One. 2024 Mar 6;19(3):e0299265. doi: 10.1371/journal.pone.0299265. eCollection 2024.

Comparative Analysis of Vision Transformers and Conventional Convolutional Neural Networks in Detecting Referable Diabetic Retinopathy.

Ophthalmol Sci. 2024 May 17;4(6):100552. doi: 10.1016/j.xops.2024.100552. eCollection 2024 Nov-Dec.

Artificial intelligence using deep learning to screen for referable and vision-threatening diabetic retinopathy in Africa: a clinical validation study.

Lancet Digit Health. 2019 May;1(1):e35-e44. doi: 10.1016/S2589-7500(19)30004-4. Epub 2019 May 2.

Contrastive learning-based pretraining improves representation and transferability of diabetic retinopathy classification models.

Sci Rep. 2023 Apr 13;13(1):6047. doi: 10.1038/s41598-023-33365-y.

Vision Transformer-based recognition of diabetic retinopathy grade.

Med Phys. 2021 Dec;48(12):7850-7863. doi: 10.1002/mp.15312. Epub 2021 Nov 16.

Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs.

JAMA. 2016 Dec 13;316(22):2402-2410. doi: 10.1001/jama.2016.17216.

Development and Validation of a Deep Learning System for Diabetic Retinopathy and Related Eye Diseases Using Retinal Images From Multiethnic Populations With Diabetes.

JAMA. 2017 Dec 12;318(22):2211-2223. doi: 10.1001/jama.2017.18152.

Referable diabetic retinopathy identification from eye fundus images with weighted path for convolutional neural network.

Artif Intell Med. 2019 Aug;99:101694. doi: 10.1016/j.artmed.2019.07.002. Epub 2019 Jul 10.

Simple methods for the lesion detection and severity grading of diabetic retinopathy by image processing and transfer learning.

Comput Biol Med. 2021 Oct;137:104795. doi: 10.1016/j.compbiomed.2021.104795. Epub 2021 Aug 25.

Diabetic Retinopathy Fundus Image Classification and Lesions Localization System Using Deep Learning.

Sensors (Basel). 2021 May 26;21(11):3704. doi: 10.3390/s21113704.

引用本文的文献

A Comparative Study of Lesion-Centered and Severity-Based Approaches to Diabetic Retinopathy Classification: Improving Interpretability and Performance.

Biomedicines. 2025 Jun 12;13(6):1446. doi: 10.3390/biomedicines13061446.

Lesion classification and diabetic retinopathy grading by integrating softmax and pooling operators into vision transformer.

Front Public Health. 2025 Jan 6;12:1442114. doi: 10.3389/fpubh.2024.1442114. eCollection 2024.

Discriminative, generative artificial intelligence, and foundation models in retina imaging.

Taiwan J Ophthalmol. 2024 Nov 28;14(4):473-485. doi: 10.4103/tjo.TJO-D-24-00064. eCollection 2024 Oct-Dec.

本文引用的文献

Transformers in medical imaging: A survey.

Med Image Anal. 2023 Aug;88:102802. doi: 10.1016/j.media.2023.102802. Epub 2023 Apr 5.

To pretrain or not? A systematic analysis of the benefits of pretraining in diabetic retinopathy.

PLoS One. 2022 Oct 18;17(10):e0274291. doi: 10.1371/journal.pone.0274291. eCollection 2022.

Multi-Model Domain Adaptation for Diabetic Retinopathy Classification.

Front Physiol. 2022 Jul 1;13:918929. doi: 10.3389/fphys.2022.918929. eCollection 2022.

Diabetic retinopathy detection through convolutional neural networks with synaptic metaplasticity.

Comput Methods Programs Biomed. 2021 Jul;206:106094. doi: 10.1016/j.cmpb.2021.106094. Epub 2021 Apr 22.

A review on deep learning approaches in healthcare systems: Taxonomies, challenges, and open issues.

J Biomed Inform. 2021 Jan;113:103627. doi: 10.1016/j.jbi.2020.103627. Epub 2020 Nov 28.

Deep learning based computer-aided diagnosis systems for diabetic retinopathy: A survey.

Artif Intell Med. 2019 Aug;99:101701. doi: 10.1016/j.artmed.2019.07.009. Epub 2019 Aug 7.

Reproduction study using public data of: Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs.

PLoS One. 2019 Jun 6;14(6):e0217541. doi: 10.1371/journal.pone.0217541. eCollection 2019.

Deep Learning-Based Algorithms in Screening of Diabetic Retinopathy: A Systematic Review of Diagnostic Performance.

Ophthalmol Retina. 2019 Apr;3(4):294-304. doi: 10.1016/j.oret.2018.10.014. Epub 2018 Nov 3.

Global estimates of diabetes prevalence for 2013 and projections for 2035.

Diabetes Res Clin Pract. 2014 Feb;103(2):137-49. doi: 10.1016/j.diabres.2013.11.002. Epub 2013 Dec 1.

EyePACS: an adaptable telemedicine system for diabetic retinopathy screening.

J Diabetes Sci Technol. 2009 May 1;3(3):509-16. doi: 10.1177/193229680900300315.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于大尺寸视网膜图像的可引用糖尿病视网膜病变分类的掩蔽自动编码器视觉转换器。

Vision transformer with masked autoencoders for referable diabetic retinopathy classification based on large-size retina image.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献