Jiang Hongyang, Xu Jie, Shi Rongjie, Yang Kang, Zhang Dongdong, Gao Mengdi, Ma He, Qian Wei
Annu Int Conf IEEE Eng Med Biol Soc. 2020 Jul;2020:1560-1563. doi: 10.1109/EMBC44109.2020.9175884.
The characteristics of diabetic retinopathy (DR) fundus images generally consist of multiple types of lesions which provided strong evidence for the ophthalmologists to make diagnosis. It is particularly significant to figure out an efficient method to not only accurately classify DR fundus images but also recognize all kinds of lesions on them. In this paper, a deep learning-based multi-label classification model with Gradient-weighted Class Activation Mapping (Grad-CAM) was proposed, which can both make DR classification and automatically locate the regions of different lesions. To reducing laborious annotation work and improve the efficiency of labeling, this paper innovatively considered different types of lesions as different labels for a fundus image so that this paper changed the task of lesion detection into that of image classification. A total of five labels were pre-defined and 3228 fundus images were collected for developing our model. The architecture of deep learning model was designed by ourselves based on ResNet. Through experiments on the test images, this method acquired a sensitive of 93.9% and a specificity of 94.4% on DR classification. Moreover, the corresponding regions of lesions were reasonably outlined on the DR fundus images.
糖尿病视网膜病变(DR)眼底图像的特征通常由多种类型的病变组成,这些病变为眼科医生进行诊断提供了有力证据。找到一种高效的方法,既能准确分类DR眼底图像,又能识别其上的各种病变,这一点尤为重要。本文提出了一种基于深度学习的多标签分类模型,该模型结合了梯度加权类激活映射(Grad-CAM),既能进行DR分类,又能自动定位不同病变区域。为了减少繁琐的标注工作并提高标注效率,本文创新性地将不同类型的病变视为眼底图像的不同标签,从而将病变检测任务转变为图像分类任务。总共预定义了五个标签,并收集了3228张眼底图像来开发我们的模型。深度学习模型的架构是我们基于ResNet自行设计的。通过对测试图像的实验,该方法在DR分类上的灵敏度为93.9%,特异性为94.4%。此外,在DR眼底图像上合理地勾勒出了病变的相应区域。