Yamaguchi Ryohei, Kawazoe Yoshimasa, Shimamoto Kiminori, Shinohara Emiko, Tsukamoto Tatsuo, Shintani-Domoto Yukako, Nagasu Hajime, Uozaki Hiroshi, Ushiku Tetsuo, Nangaku Masaomi, Kashihara Naoki, Shimizu Akira, Nagata Michio, Ohe Kazuhiko
Artificial Intelligence in Healthcare, Graduate School of Medicine, Faculty of Medicine, The University of Tokyo, Tokyo, Japan.
Department of Nephrology and Dialysis, Tazuke Kofukai Medical Research Institute, Kitano Hospital, Osaka, Japan.
Kidney Int Rep. 2020 Dec 13;6(3):716-726. doi: 10.1016/j.ekir.2020.11.037. eCollection 2021 Mar.
Diagnosing renal pathologies is important for performing treatments. However, classifying every glomerulus is difficult for clinicians; thus, a support system, such as a computer, is required. This paper describes the automatic classification of glomerular images using a convolutional neural network (CNN).
To generate appropriate labeled data, annotation criteria including 12 features (e.g., "fibrous crescent") were defined. The concordance among 5 clinicians was evaluated for 100 images using the kappa (κ) coefficient for each feature. Using the annotation criteria, 1 clinician annotated 10,102 images. We trained the CNNs to classify the features with an average κ ≥0.4 and evaluated their performance using the receiver operating characteristic-area under the curve (ROC-AUC). An error analysis was conducted and the gradient-weighted class activation mapping (Grad-CAM) was also applied; it expresses the CNN's focusing point with a heat map when the CNN classifies the glomerular image for a feature.
The average κ coefficient of the features ranged from 0.28 to 0.50. The ROC-AUC of the CNNs for test data varied from 0.65 to 0.98. Among the features, "capillary collapse" and "fibrous crescent" had high ROC-AUC values of 0.98 and 0.91, respectively. The error analysis and the Grad-CAM visually showed that the CNN could not distinguish between 2 different features that had similar visual structures or that occurred simultaneously.
The differences in the texture or frequency of the co-occurrence between the different features affected the CNN performance; thus, to improve the classification accuracy, methods such as segmentation are required.
诊断肾脏病理对于实施治疗很重要。然而,对临床医生来说,对每个肾小球进行分类很困难;因此,需要计算机等支持系统。本文描述了使用卷积神经网络(CNN)对肾小球图像进行自动分类。
为了生成合适的标注数据,定义了包括12个特征(如“纤维性新月体”)的标注标准。使用kappa(κ)系数对100张图像评估了5位临床医生之间的一致性。根据标注标准,1位临床医生标注了10102张图像。我们训练CNN对平均κ≥0.4的特征进行分类,并使用曲线下面积的受试者工作特征(ROC-AUC)评估其性能。进行了误差分析,并应用了梯度加权类激活映射(Grad-CAM);当CNN对肾小球图像的一个特征进行分类时,它用热图表示CNN的聚焦点。
特征的平均κ系数范围为0.28至0.50。CNN对测试数据的ROC-AUC在0.65至0.98之间变化。在这些特征中,“毛细血管塌陷”和“纤维性新月体”的ROC-AUC值较高,分别为0.98和0.91。误差分析和Grad-CAM直观地表明,CNN无法区分具有相似视觉结构或同时出现的两种不同特征。
不同特征之间纹理或共现频率的差异影响了CNN的性能;因此,为了提高分类准确率,需要分割等方法。