Department of Ophthalmology, Taipei Veterans General Hospital, Taipei, Taiwan.
Faculty of Medicine, National Yang-Ming University School of Medicine, Taipei, Taiwan.
PLoS One. 2020 May 14;15(5):e0233079. doi: 10.1371/journal.pone.0233079. eCollection 2020.
To evaluate ways to improve the generalizability of a deep learning algorithm for identifying glaucomatous optic neuropathy (GON) using a limited number of fundus photographs, as well as the key features being used for classification.
A total of 944 fundus images from Taipei Veterans General Hospital (TVGH) were retrospectively collected. Clinical and demographic characteristics, including structural and functional measurements of the images with GON, were recorded. Transfer learning based on VGGNet was used to construct a convolutional neural network (CNN) to identify GON. To avoid missing cases with advanced GON, an ensemble model was adopted in which a support vector machine classifier would make final classification based on cup-to-disc ratio if the CNN classifier had low-confidence score. The CNN classifier was first established using TVGH dataset, and then fine-tuned by combining the training images of TVGH and Drishti-GS datasets. Class activation map (CAM) was used to identify key features used for CNN classification. Performance of each classifier was determined through area under receiver operating characteristic curve (AUC) and compared with the ensemble model by diagnostic accuracy.
In 187 TVGH test images, the accuracy, sensitivity, and specificity of the CNN classifier were 95.0%, 95.7%, and 94.2%, respectively, and the AUC was 0.992 compared to the 92.8% accuracy rate of the ensemble model. For the Drishti-GS test images, the accuracy of the CNN, the fine-tuned CNN and ensemble model was 33.3%, 80.3%, and 80.3%, respectively. The CNN classifier did not misclassify images with moderate to severe diseases. Class-discriminative regions revealed by CAM co-localized with known characteristics of GON.
The ensemble model or a fine-tuned CNN classifier may be potential designs to build a generalizable deep learning model for glaucoma detection when large image databases are not available.
评估使用有限数量的眼底照片提高深度学习算法识别青光眼视神经病变(GON)的泛化能力的方法,以及用于分类的关键特征。
回顾性收集了台北荣民总医院(TVGH)的 944 张眼底图像。记录了临床和人口统计学特征,包括具有 GON 的图像的结构和功能测量。使用基于 VGGNet 的迁移学习构建卷积神经网络(CNN)来识别 GON。为了避免错过晚期 GON 的病例,采用集成模型,如果 CNN 分类器的置信得分较低,则支持向量机分类器将根据杯盘比进行最终分类。首先使用 TVGH 数据集建立 CNN 分类器,然后通过结合 TVGH 和 Drishti-GS 数据集的训练图像进行微调。使用类激活图(CAM)识别用于 CNN 分类的关键特征。通过接收者操作特征曲线下面积(AUC)确定每个分类器的性能,并通过诊断准确性与集成模型进行比较。
在 187 张 TVGH 测试图像中,CNN 分类器的准确率、敏感度和特异性分别为 95.0%、95.7%和 94.2%,AUC 为 0.992,而集成模型的准确率为 92.8%。对于 Drishti-GS 测试图像,CNN、微调 CNN 和集成模型的准确率分别为 33.3%、80.3%和 80.3%。CNN 分类器不会错误分类中度至重度疾病的图像。CAM 揭示的类区分区域与 GON 的已知特征重合。
当没有大型图像数据库时,集成模型或微调 CNN 分类器可能是构建用于青光眼检测的可泛化深度学习模型的潜在设计。