Department of Statistics, University of Oxford, Oxford, UK.
VUNO Inc., 6F, 507, Gangnam-daero, Seocho-gu, Seoul, Republic of Korea.
J Digit Imaging. 2018 Dec;31(6):923-928. doi: 10.1007/s10278-018-0099-2.
In this paper, we aimed to understand and analyze the outputs of a convolutional neural network model that classifies the laterality of fundus images. Our model not only automatizes the classification process, which results in reducing the labors of clinicians, but also highlights the key regions in the image and evaluates the uncertainty for the decision with proper analytic tools. Our model was trained and tested with 25,911 fundus images (43.4% of macula-centered images and 28.3% each of superior and nasal retinal fundus images). Also, activation maps were generated to mark important regions in the image for the classification. Then, uncertainties were quantified to support explanations as to why certain images were incorrectly classified under the proposed model. Our model achieved a mean training accuracy of 99%, which is comparable to the performance of clinicians. Strong activations were detected at the location of optic disc and retinal blood vessels around the disc, which matches to the regions that clinicians attend when deciding the laterality. Uncertainty analysis discovered that misclassified images tend to accompany with high prediction uncertainties and are likely ungradable. We believe that visualization of informative regions and the estimation of uncertainty, along with presentation of the prediction result, would enhance the interpretability of neural network models in a way that clinicians can be benefitted from using the automatic classification system.
在本文中,我们旨在理解和分析一个用于对眼底图像进行侧别分类的卷积神经网络模型的输出。我们的模型不仅实现了分类过程的自动化,从而减少了临床医生的工作量,还突出了图像中的关键区域,并通过适当的分析工具评估了决策的不确定性。我们的模型使用了 25911 张眼底图像进行训练和测试(43.4%的图像以黄斑为中心,28.3%的图像分别为视网膜上方和鼻侧)。此外,还生成了激活图以标记图像中用于分类的重要区域。然后,对不确定性进行量化,以支持对为什么某些图像在提出的模型下被错误分类的解释。我们的模型的平均训练准确率为 99%,与临床医生的表现相当。在视盘和视盘周围的视网膜血管的位置检测到了强烈的激活,这与临床医生在决定侧别时关注的区域相吻合。不确定性分析发现,错误分类的图像往往伴随着较高的预测不确定性,并且可能无法分级。我们相信,信息区域的可视化和不确定性的估计,以及预测结果的呈现,将以一种使临床医生受益于使用自动分类系统的方式提高神经网络模型的可解释性。