在一项全国性筛查项目中，深度学习与人工分级在糖尿病视网膜病变严重程度分类方面的比较

Deep learning versus human graders for classifying diabetic retinopathy severity in a nationwide screening program.

作者信息

Raumviboonsuk Paisan, Krause Jonathan, Chotcomwongse Peranut, Sayres Rory, Raman Rajiv, Widner Kasumi, Campana Bilson J L, Phene Sonia, Hemarat Kornwipa, Tadarati Mongkol, Silpa-Archa Sukhum, Limwattanayingyong Jirawut, Rao Chetan, Kuruvilla Oscar, Jung Jesse, Tan Jeffrey, Orprayoon Surapong, Kangwanwongpaisan Chawawat, Sukumalpaiboon Ramase, Luengchaichawang Chainarong, Fuangkaew Jitumporn, Kongsap Pipat, Chualinpha Lamyong, Saree Sarawuth, Kawinpanitan Srirut, Mitvongsa Korntip, Lawanasakol Siriporn, Thepchatri Chaiyasit, Wongpichedchai Lalita, Corrado Greg S, Peng Lily, Webster Dale R

机构信息

1Department of Ophthalmology, Rajavithi Hospital, Bangkok, Thailand.

2Google AI, Google, Mountain View, CA USA.

出版信息

NPJ Digit Med. 2019 Apr 10;2:25. doi: 10.1038/s41746-019-0099-8. eCollection 2019.

DOI:10.1038/s41746-019-0099-8

PMID:31304372

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6550283/

Abstract

Deep learning algorithms have been used to detect diabetic retinopathy (DR) with specialist-level accuracy. This study aims to validate one such algorithm on a large-scale clinical population, and compare the algorithm performance with that of human graders. A total of 25,326 gradable retinal images of patients with diabetes from the community-based, nationwide screening program of DR in Thailand were analyzed for DR severity and referable diabetic macular edema (DME). Grades adjudicated by a panel of international retinal specialists served as the reference standard. Relative to human graders, for detecting referable DR (moderate NPDR or worse), the deep learning algorithm had significantly higher sensitivity (0.97 vs. 0.74, < 0.001), and a slightly lower specificity (0.96 vs. 0.98, < 0.001). Higher sensitivity of the algorithm was also observed for each of the categories of severe or worse NPDR, PDR, and DME ( < 0.001 for all comparisons). The quadratic-weighted kappa for determination of DR severity levels by the algorithm and human graders was 0.85 and 0.78 respectively ( < 0.001 for the difference). Across different severity levels of DR for determining referable disease, deep learning significantly reduced the false negative rate (by 23%) at the cost of slightly higher false positive rates (2%). Deep learning algorithms may serve as a valuable tool for DR screening.

摘要

深度学习算法已被用于检测糖尿病视网膜病变（DR），其准确性达到了专家水平。本研究旨在在大规模临床人群中验证一种此类算法，并将该算法的性能与人工分级者的性能进行比较。对来自泰国全国社区DR筛查项目的25326张可分级糖尿病患者视网膜图像进行了分析，以确定DR严重程度和可转诊的糖尿病性黄斑水肿（DME）。由一组国际视网膜专家判定的分级用作参考标准。相对于人工分级者，对于检测可转诊的DR（中度非增殖性糖尿病视网膜病变或更严重），深度学习算法具有显著更高的灵敏度（0.97对0.74，<0.001），以及略低的特异性（0.96对0.98，<0.001）。对于严重或更严重的非增殖性糖尿病视网膜病变、增殖性糖尿病视网膜病变和糖尿病性黄斑水肿的每一类，该算法也观察到了更高的灵敏度（所有比较均<0.001）。算法和人工分级者用于确定DR严重程度水平的二次加权kappa分别为0.85和0.78（差异<0.001）。在确定可转诊疾病的不同DR严重程度水平上，深度学习显著降低了假阴性率（降低了23%），代价是假阳性率略有升高（2%）。深度学习算法可能是DR筛查的一种有价值的工具。

相似文献

Deep learning versus human graders for classifying diabetic retinopathy severity in a nationwide screening program.

NPJ Digit Med. 2019 Apr 10;2:25. doi: 10.1038/s41746-019-0099-8. eCollection 2019.

Grader Variability and the Importance of Reference Standards for Evaluating Machine Learning Models for Diabetic Retinopathy.

Ophthalmology. 2018 Aug;125(8):1264-1272. doi: 10.1016/j.ophtha.2018.01.034. Epub 2018 Mar 13.

Impact of Gold-Standard Label Errors on Evaluating Performance of Deep Learning Models in Diabetic Retinopathy Screening: Nationwide Real-World Validation Study.

J Med Internet Res. 2024 Aug 14;26:e52506. doi: 10.2196/52506.

Accuracy of Integrated Artificial Intelligence Grading Using Handheld Retinal Imaging in a Community Diabetic Eye Screening Program.

Ophthalmol Sci. 2023 Dec 15;4(3):100457. doi: 10.1016/j.xops.2023.100457. eCollection 2024 May-Jun.

Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs.

JAMA. 2016 Dec 13;316(22):2402-2410. doi: 10.1001/jama.2016.17216.

Automated multidimensional deep learning platform for referable diabetic retinopathy detection: a multicentre, retrospective study.

BMJ Open. 2022 Jul 28;12(7):e060155. doi: 10.1136/bmjopen-2021-060155.

Validation of Deep Convolutional Neural Network-based algorithm for detection of diabetic retinopathy - Artificial intelligence versus clinician for screening.

Indian J Ophthalmol. 2020 Feb;68(2):398-405. doi: 10.4103/ijo.IJO_966_19.

An Automated Grading System for Detection of Vision-Threatening Referable Diabetic Retinopathy on the Basis of Color Fundus Photographs.

Diabetes Care. 2018 Dec;41(12):2509-2516. doi: 10.2337/dc18-0147. Epub 2018 Oct 1.

Using a Deep Learning Algorithm and Integrated Gradients Explanation to Assist Grading for Diabetic Retinopathy.

Ophthalmology. 2019 Apr;126(4):552-564. doi: 10.1016/j.ophtha.2018.11.016. Epub 2018 Dec 13.

Improved Automated Detection of Diabetic Retinopathy on a Publicly Available Dataset Through Integration of Deep Learning.

Invest Ophthalmol Vis Sci. 2016 Oct 1;57(13):5200-5206. doi: 10.1167/iovs.16-19964.

引用本文的文献

Improving diabetic retinopathy screening using artificial intelligence: design, evaluation and before-and-after study of a custom development.

Front Digit Health. 2025 Jun 19;7:1547045. doi: 10.3389/fdgth.2025.1547045. eCollection 2025.

CT-based artificial intelligence system complementing deep learning model and radiologist for liver fibrosis staging.

iScience. 2025 Mar 17;28(4):112224. doi: 10.1016/j.isci.2025.112224. eCollection 2025 Apr 18.

Artificial intelligence for early detection of diabetes mellitus complications via retinal imaging.

J Diabetes Metab Disord. 2025 Apr 12;24(1):104. doi: 10.1007/s40200-025-01596-7. eCollection 2025 Jun.

How Foundational Is the Retina Foundation Model? Estimating RETFound's Label Efficiency on Binary Classification of Normal versus Abnormal OCT Images.

Ophthalmol Sci. 2025 Jan 11;5(3):100707. doi: 10.1016/j.xops.2025.100707. eCollection 2025 May-Jun.

A Deep Learning Segmentation Model for Detection of Active Proliferative Diabetic Retinopathy.

Ophthalmol Ther. 2025 May;14(5):1053-1063. doi: 10.1007/s40123-025-01127-w. Epub 2025 Mar 27.

Performance and limitation of machine learning algorithms for diabetic retinopathy screening and its application in health management: a meta-analysis.

Biomed Eng Online. 2025 Mar 14;24(1):34. doi: 10.1186/s12938-025-01336-1.

Discriminative, generative artificial intelligence, and foundation models in retina imaging.

Taiwan J Ophthalmol. 2024 Nov 28;14(4):473-485. doi: 10.4103/tjo.TJO-D-24-00064. eCollection 2024 Oct-Dec.

Transforming Non-Digital, Clinical Workflows to Detect and Track Vision-Threatening Diabetic Retinopathy via a Digital Platform Integrating Artificial Intelligence: Implementation Research.

Ophthalmol Ther. 2025 Feb;14(2):447-460. doi: 10.1007/s40123-024-01086-8. Epub 2025 Jan 10.

Trends and hotspots in the field of diabetic retinopathy imaging research from 2000-2023.

Front Med (Lausanne). 2024 Oct 9;11:1481088. doi: 10.3389/fmed.2024.1481088. eCollection 2024.

Big data to guide glaucoma treatment.

Taiwan J Ophthalmol. 2023 Jul 28;14(3):333-339. doi: 10.4103/tjo.TJO-D-23-00068. eCollection 2024 Jul-Sep.

本文引用的文献

Saving sight in China and beyond: the Lifeline Express model.

BMJ Glob Health. 2018 Aug 16;3(4):e000766. doi: 10.1136/bmjgh-2018-000766. eCollection 2018.

Guidelines on Diabetic Eye Care: The International Council of Ophthalmology Recommendations for Screening, Follow-up, Referral, and Treatment Based on Resource Settings.

Ophthalmology. 2018 Oct;125(10):1608-1622. doi: 10.1016/j.ophtha.2018.04.007. Epub 2018 May 24.

Grader Variability and the Importance of Reference Standards for Evaluating Machine Learning Models for Diabetic Retinopathy.

Ophthalmology. 2018 Aug;125(8):1264-1272. doi: 10.1016/j.ophtha.2018.01.034. Epub 2018 Mar 13.

Development and Validation of a Deep Learning System for Diabetic Retinopathy and Related Eye Diseases Using Retinal Images From Multiethnic Populations With Diabetes.

JAMA. 2017 Dec 12;318(22):2211-2223. doi: 10.1001/jama.2017.18152.

Screening Intervals for Diabetic Retinopathy and Implications for Care.

Curr Diab Rep. 2017 Sep 5;17(10):96. doi: 10.1007/s11892-017-0928-6.

Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs.

JAMA. 2016 Dec 13;316(22):2402-2410. doi: 10.1001/jama.2016.17216.

Cost-effectiveness of a National Telemedicine Diabetic Retinopathy Screening Program in Singapore.

Ophthalmology. 2016 Dec;123(12):2571-2580. doi: 10.1016/j.ophtha.2016.08.021. Epub 2016 Oct 7.

Comparison of Prevalence of Diabetic Macular Edema Based on Monocular Fundus Photography vs Optical Coherence Tomography.

JAMA Ophthalmol. 2016 Feb;134(2):222-8. doi: 10.1001/jamaophthalmol.2015.5332.

Assessment of diabetic teleretinal imaging program at the Portland Department of Veterans Affairs Medical Center.

J Rehabil Res Dev. 2015;52(2):193-200. doi: 10.1682/JRRD.2014.03.0077.

The first rapid assessment of avoidable blindness (RAAB) in Thailand.

PLoS One. 2014 Dec 11;9(12):e114245. doi: 10.1371/journal.pone.0114245. eCollection 2014.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

在一项全国性筛查项目中，深度学习与人工分级在糖尿病视网膜病变严重程度分类方面的比较

Deep learning versus human graders for classifying diabetic retinopathy severity in a nationwide screening program.

作者信息

机构信息

1Department of Ophthalmology, Rajavithi Hospital, Bangkok, Thailand.

2Google AI, Google, Mountain View, CA USA.

出版信息

NPJ Digit Med. 2019 Apr 10;2:25. doi: 10.1038/s41746-019-0099-8. eCollection 2019.

DOI:10.1038/s41746-019-0099-8

PMID:31304372

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6550283/

Abstract

摘要

在一项全国性筛查项目中，深度学习与人工分级在糖尿病视网膜病变严重程度分类方面的比较

Deep learning versus human graders for classifying diabetic retinopathy severity in a nationwide screening program.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

在一项全国性筛查项目中，深度学习与人工分级在糖尿病视网膜病变严重程度分类方面的比较

Deep learning versus human graders for classifying diabetic retinopathy severity in a nationwide screening program.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献