Noriega Alejandro, Meizner Daniela, Camacho Dalia, Enciso Jennifer, Quiroz-Mercado Hugo, Morales-Canton Virgilio, Almaatouq Abdullah, Pentland Alex
MIT Media Laboratory, Massachusetts Institute of Technology, Cambridge, MA, United States.
Prosperia Salud, Mexico City, Mexico.
JMIR Form Res. 2021 Aug 26;5(8):e25290. doi: 10.2196/25290.
The automated screening of patients at risk of developing diabetic retinopathy represents an opportunity to improve their midterm outcome and lower the public expenditure associated with direct and indirect costs of common sight-threatening complications of diabetes.
This study aimed to develop and evaluate the performance of an automated deep learning-based system to classify retinal fundus images as referable and nonreferable diabetic retinopathy cases, from international and Mexican patients. In particular, we aimed to evaluate the performance of the automated retina image analysis (ARIA) system under an independent scheme (ie, only ARIA screening) and 2 assistive schemes (ie, hybrid ARIA plus ophthalmologist screening), using a web-based platform for remote image analysis to determine and compare the sensibility and specificity of the 3 schemes.
A randomized controlled experiment was performed where 17 ophthalmologists were asked to classify a series of retinal fundus images under 3 different conditions. The conditions were to (1) screen the fundus image by themselves (solo); (2) screen the fundus image after exposure to the retina image classification of the ARIA system (ARIA answer); and (3) screen the fundus image after exposure to the classification of the ARIA system, as well as its level of confidence and an attention map highlighting the most important areas of interest in the image according to the ARIA system (ARIA explanation). The ophthalmologists' classification in each condition and the result from the ARIA system were compared against a gold standard generated by consulting and aggregating the opinion of 3 retina specialists for each fundus image.
The ARIA system was able to classify referable vs nonreferable cases with an area under the receiver operating characteristic curve of 98%, a sensitivity of 95.1%, and a specificity of 91.5% for international patient cases. There was an area under the receiver operating characteristic curve of 98.3%, a sensitivity of 95.2%, and a specificity of 90% for Mexican patient cases. The ARIA system performance was more successful than the average performance of the 17 ophthalmologists enrolled in the study. Additionally, the results suggest that the ARIA system can be useful as an assistive tool, as sensitivity was significantly higher in the experimental condition where ophthalmologists were exposed to the ARIA system's answer prior to their own classification (93.3%), compared with the sensitivity of the condition where participants assessed the images independently (87.3%; P=.05).
These results demonstrate that both independent and assistive use cases of the ARIA system present, for Latin American countries such as Mexico, a substantial opportunity toward expanding the monitoring capacity for the early detection of diabetes-related blindness.
对有患糖尿病视网膜病变风险的患者进行自动筛查,为改善他们的中期预后以及降低与糖尿病常见致盲并发症的直接和间接成本相关的公共支出提供了契机。
本研究旨在开发并评估一种基于深度学习的自动系统,用于将国际和墨西哥患者的视网膜眼底图像分类为可转诊和不可转诊的糖尿病视网膜病变病例。具体而言,我们旨在评估自动视网膜图像分析(ARIA)系统在独立方案(即仅ARIA筛查)和两种辅助方案(即ARIA与眼科医生联合筛查)下的性能,使用基于网络的远程图像分析平台来确定并比较这三种方案的敏感性和特异性。
进行了一项随机对照实验,邀请17名眼科医生在3种不同条件下对一系列视网膜眼底图像进行分类。这些条件分别是:(1)独自筛查眼底图像(单独筛查);(2)在接触ARIA系统的视网膜图像分类结果后筛查眼底图像(ARIA结果);(3)在接触ARIA系统的分类结果及其置信度以及根据ARIA系统突出显示图像中最重要感兴趣区域的注意力图后筛查眼底图像(ARIA解释)。将每种条件下眼科医生的分类结果与ARIA系统的结果,与通过咨询并汇总3位视网膜专家对每张眼底图像的意见所生成的金标准进行比较。
对于国际患者病例,ARIA系统能够将可转诊与不可转诊病例进行分类,其受试者操作特征曲线下面积为98%,敏感性为95.1%,特异性为91.5%。对于墨西哥患者病例,受试者操作特征曲线下面积为98.3%,敏感性为95.2%,特异性为90%。ARIA系统的性能比参与该研究的17名眼科医生的平均性能更出色。此外,结果表明ARIA系统可作为一种辅助工具,因为在眼科医生在自己分类之前接触ARIA系统结果的实验条件下,敏感性显著更高(93.3%),而在参与者独立评估图像的条件下敏感性为87.3%(P = 0.05)。
这些结果表明,对于墨西哥等拉丁美洲国家而言,ARIA系统无论是独立使用还是辅助使用,都为扩大糖尿病相关失明早期检测的监测能力提供了重要契机。