Burlina Philippe M, Joshi Neil, Pekala Michael, Pacheco Katia D, Freund David E, Bressler Neil M
The Johns Hopkins University Applied Physics Laboratory, Laurel, Maryland.
Retina Division, Brazilian Center of Vision Eye Hospital, Basilia, DF, Brazil.
JAMA Ophthalmol. 2017 Nov 1;135(11):1170-1176. doi: 10.1001/jamaophthalmol.2017.3782.
Age-related macular degeneration (AMD) affects millions of people throughout the world. The intermediate stage may go undetected, as it typically is asymptomatic. However, the preferred practice patterns for AMD recommend identifying individuals with this stage of the disease to educate how to monitor for the early detection of the choroidal neovascular stage before substantial vision loss has occurred and to consider dietary supplements that might reduce the risk of the disease progressing from the intermediate to the advanced stage. Identification, though, can be time-intensive and requires expertly trained individuals.
To develop methods for automatically detecting AMD from fundus images using a novel application of deep learning methods to the automated assessment of these images and to leverage artificial intelligence advances.
DESIGN, SETTING, AND PARTICIPANTS: Deep convolutional neural networks that are explicitly trained for performing automated AMD grading were compared with an alternate deep learning method that used transfer learning and universal features and with a trained clinical grader. Age-related macular degeneration automated detection was applied to a 2-class classification problem in which the task was to distinguish the disease-free/early stages from the referable intermediate/advanced stages. Using several experiments that entailed different data partitioning, the performance of the machine algorithms and human graders in evaluating over 130 000 images that were deidentified with respect to age, sex, and race/ethnicity from 4613 patients against a gold standard included in the National Institutes of Health Age-related Eye Disease Study data set was evaluated.
Accuracy, receiver operating characteristics and area under the curve, and kappa score.
The deep convolutional neural network method yielded accuracy (SD) that ranged between 88.4% (0.5%) and 91.6% (0.1%), the area under the receiver operating characteristic curve was between 0.94 and 0.96, and kappa coefficient (SD) between 0.764 (0.010) and 0.829 (0.003), which indicated a substantial agreement with the gold standard Age-related Eye Disease Study data set.
Applying a deep learning-based automated assessment of AMD from fundus images can produce results that are similar to human performance levels. This study demonstrates that automated algorithms could play a role that is independent of expert human graders in the current management of AMD and could address the costs of screening or monitoring, access to health care, and the assessment of novel treatments that address the development or progression of AMD.
年龄相关性黄斑变性(AMD)影响着全球数百万人。中间阶段可能未被察觉,因为它通常没有症状。然而,AMD的首选实践模式建议识别处于该疾病阶段的个体,以指导如何在视力大幅丧失之前监测脉络膜新生血管阶段的早期发现,并考虑使用可能降低疾病从中期进展到晚期风险的膳食补充剂。然而,识别过程可能耗时且需要经过专业培训的人员。
利用深度学习方法在这些图像自动评估中的新应用,开发从眼底图像自动检测AMD的方法,并利用人工智能的进展。
设计、设置和参与者:将经过明确训练以执行自动AMD分级的深度卷积神经网络与使用迁移学习和通用特征的另一种深度学习方法以及经过训练的临床分级人员进行比较。年龄相关性黄斑变性自动检测应用于一个二分类问题,其任务是将无疾病/早期阶段与可参考的中期/晚期阶段区分开来。通过进行涉及不同数据划分的多个实验,评估了机器算法和人类分级人员在评估来自4613名患者的超过130000张图像(这些图像在年龄、性别和种族/民族方面进行了去识别处理)时相对于美国国立卫生研究院年龄相关性眼病研究数据集所包含的金标准的性能。
准确率、受试者操作特征曲线及曲线下面积和kappa评分。
深度卷积神经网络方法的准确率(标准差)在88.4%(0.5%)至91.6%(0.1%)之间,受试者操作特征曲线下面积在0.94至0.96之间,kappa系数(标准差)在0.764(0.010)至0.829(0.003)之间,这表明与金标准年龄相关性眼病研究数据集有实质性一致性。
应用基于深度学习的眼底图像AMD自动评估可以产生与人类表现水平相似的结果。这项研究表明,自动算法在当前AMD管理中可以发挥独立于专业人类分级人员的作用,并可以解决筛查或监测成本、医疗保健可及性以及评估针对AMD发展或进展的新治疗方法等问题。