Zhou Kang, Gu Zaiwang, Liu Wen, Luo Weixin, Cheng Jun, Gao Shenghua, Liu Jiang
Annu Int Conf IEEE Eng Med Biol Soc. 2018 Jul;2018:2724-2727. doi: 10.1109/EMBC.2018.8512828.
Diabetic Retinopathy (DR) is a non-negligible eye disease among patients with Diabetes Mellitus, and automatic retinal image analysis algorithm for the DR screening is in high demand. Considering the resolution of retinal image is very high, where small pathological tissues can be detected only with large resolution image and large local receptive field are required to identify those late stage disease, but directly training a neural network with very deep architecture and high resolution image is both time computational expensive and difficult because of gradient vanishing/exploding problem, we propose a Multi-Cell architecture which gradually increases the depth of deep neural network and the resolution of input image, which both boosts the training time but also improves the classification accuracy. Further, considering the different stages of DR actually progress gradually, which means the labels of different stages are related. To considering the relationships of images with different stages, we propose a Multi-Task learning strategy which predicts the label with both classification and regression. Experimental results on the Kaggle dataset show that our method achieves a Kappa of 0.841 on test set which is the 4th rank of all state-of-the-arts methods. Further, our Multi-Cell Multi-Task Convolutional Neural Networks (MCNN) solution is a general framework, which can be readily integrated with many other deep neural network architectures.
糖尿病性视网膜病变(DR)是糖尿病患者中一种不可忽视的眼部疾病,因此对用于DR筛查的自动视网膜图像分析算法有很高的需求。考虑到视网膜图像的分辨率非常高,只有大分辨率图像才能检测到小的病理组织,并且需要大的局部感受野来识别那些晚期疾病,但是直接用非常深的架构和高分辨率图像训练神经网络既耗时又计算成本高,而且由于梯度消失/爆炸问题而困难重重,我们提出了一种多单元架构,它逐渐增加深度神经网络的深度和输入图像的分辨率,这既提高了训练时间又提高了分类准确率。此外,考虑到DR的不同阶段实际上是逐渐进展的,这意味着不同阶段的标签是相关的。为了考虑不同阶段图像之间的关系,我们提出了一种多任务学习策略,通过分类和回归来预测标签。在Kaggle数据集上的实验结果表明,我们的方法在测试集上的Kappa值为0.841,在所有最先进的方法中排名第四。此外,我们的多单元多任务卷积神经网络(MCNN)解决方案是一个通用框架,可以很容易地与许多其他深度神经网络架构集成。