From the Centre for Eye Research Australia (S.J, X.S, M.H), Royal Victorian Eye and Ear Hospital, East Melbourne, Australia; Department of Surgery (Ophthalmology) (S.J, X.S, M.H), The University of Melbourne, Melbourne, Australia; Lions Aravind Institute of Community Ophthalmology (S.J, J.S, T.R), Aravind Eye Care System. Madurai, India.
Lions Aravind Institute of Community Ophthalmology (S.J, J.S, T.R), Aravind Eye Care System. Madurai, India.
Am J Ophthalmol. 2024 Jul;263:214-230. doi: 10.1016/j.ajo.2024.02.012. Epub 2024 Mar 2.
To evaluate the diagnostic accuracy of artificial intelligence (AI)-based automated diabetic retinopathy (DR) screening in real-world settings.
Systematic review and meta-analysis METHODS: We conducted a systematic review of relevant literature from January 2012 to August 2022 using databases including PubMed, Scopus and Web of Science. The quality of studies was evaluated using Quality Assessment for Diagnostic Accuracy Studies 2 (QUADAS-2) checklist. We calculated pooled accuracy, sensitivity, specificity, and diagnostic odds ratio (DOR) as summary measures. The study protocol was registered with the International Prospective Register of Systematic Reviews (PROSPERO - CRD42022367034).
We included 34 studies which utilized AI algorithms for diagnosing DR based on real-world fundus images. Quality assessment of these studies indicated a low risk of bias and low applicability concern. Among gradable images, the overall pooled accuracy, sensitivity, specificity, and DOR were 81%, 94% (95% CI: 92.0-96.0), 89% (95% CI: 85.0-92.0) and 128 (95% CI: 80-204) respectively. Sub-group analysis showed that, when acceptable quality imaging could be obtained, non-mydriatic fundus images had a better DOR of 143 (95% CI: 82-251) and studies using 2 field images had a better DOR of 161 (95% CI 74-347). Our meta-regression analysis revealed a statistically significant association between DOR and variables such as the income status, and the type of fundus camera.
Our findings indicate that AI algorithms have acceptable performance in screening for DR using fundus images compared to human graders. Implementing a fundus camera with AI-based software has the potential to assist ophthalmologists in reducing their workload and improving the accuracy of DR diagnosis.
评估基于人工智能(AI)的自动糖尿病视网膜病变(DR)筛查在真实环境中的诊断准确性。
系统评价和荟萃分析方法:我们从 2012 年 1 月至 2022 年 8 月使用包括 PubMed、Scopus 和 Web of Science 在内的数据库对相关文献进行了系统回顾。使用 QUADAS-2 清单评估研究质量。我们计算了汇总准确性、敏感度、特异性和诊断比值比(DOR)作为汇总指标。该研究方案已在国际前瞻性系统评价注册库(PROSPERO - CRD42022367034)中注册。
我们纳入了 34 项研究,这些研究基于真实世界的眼底图像利用 AI 算法诊断 DR。这些研究的质量评估表明存在低偏倚风险和低适用性问题。在可分级图像中,总体汇总准确性、敏感度、特异性和 DOR 分别为 81%、94%(95%CI:92.0-96.0)、89%(95%CI:85.0-92.0)和 128(95%CI:80-204)。亚组分析表明,当可以获得可接受质量的图像时,非散瞳眼底图像具有更好的 DOR(143,95%CI:82-251),使用 2 张眼底图像的研究具有更好的 DOR(161,95%CI:74-347)。我们的荟萃回归分析显示,DOR 与收入状况和眼底相机类型等变量之间存在统计学显著关联。
我们的研究结果表明,与人工分级相比,AI 算法在使用眼底图像筛查 DR 方面具有可接受的性能。使用基于 AI 的软件的眼底相机有可能协助眼科医生减轻工作量并提高 DR 诊断的准确性。