Suppr超能文献

基于明确标注标准和临床医生间一致性评估的卷积神经网络肾小球分类

Glomerular Classification Using Convolutional Neural Networks Based on Defined Annotation Criteria and Concordance Evaluation Among Clinicians.

作者信息

Yamaguchi Ryohei, Kawazoe Yoshimasa, Shimamoto Kiminori, Shinohara Emiko, Tsukamoto Tatsuo, Shintani-Domoto Yukako, Nagasu Hajime, Uozaki Hiroshi, Ushiku Tetsuo, Nangaku Masaomi, Kashihara Naoki, Shimizu Akira, Nagata Michio, Ohe Kazuhiko

机构信息

Artificial Intelligence in Healthcare, Graduate School of Medicine, Faculty of Medicine, The University of Tokyo, Tokyo, Japan.

Department of Nephrology and Dialysis, Tazuke Kofukai Medical Research Institute, Kitano Hospital, Osaka, Japan.

出版信息

Kidney Int Rep. 2020 Dec 13;6(3):716-726. doi: 10.1016/j.ekir.2020.11.037. eCollection 2021 Mar.

Abstract

INTRODUCTION

Diagnosing renal pathologies is important for performing treatments. However, classifying every glomerulus is difficult for clinicians; thus, a support system, such as a computer, is required. This paper describes the automatic classification of glomerular images using a convolutional neural network (CNN).

METHOD

To generate appropriate labeled data, annotation criteria including 12 features (e.g., "fibrous crescent") were defined. The concordance among 5 clinicians was evaluated for 100 images using the kappa (κ) coefficient for each feature. Using the annotation criteria, 1 clinician annotated 10,102 images. We trained the CNNs to classify the features with an average κ ≥0.4 and evaluated their performance using the receiver operating characteristic-area under the curve (ROC-AUC). An error analysis was conducted and the gradient-weighted class activation mapping (Grad-CAM) was also applied; it expresses the CNN's focusing point with a heat map when the CNN classifies the glomerular image for a feature.

RESULTS

The average κ coefficient of the features ranged from 0.28 to 0.50. The ROC-AUC of the CNNs for test data varied from 0.65 to 0.98. Among the features, "capillary collapse" and "fibrous crescent" had high ROC-AUC values of 0.98 and 0.91, respectively. The error analysis and the Grad-CAM visually showed that the CNN could not distinguish between 2 different features that had similar visual structures or that occurred simultaneously.

CONCLUSION

The differences in the texture or frequency of the co-occurrence between the different features affected the CNN performance; thus, to improve the classification accuracy, methods such as segmentation are required.

摘要

引言

诊断肾脏病理对于实施治疗很重要。然而,对临床医生来说,对每个肾小球进行分类很困难;因此,需要计算机等支持系统。本文描述了使用卷积神经网络(CNN)对肾小球图像进行自动分类。

方法

为了生成合适的标注数据,定义了包括12个特征(如“纤维性新月体”)的标注标准。使用kappa(κ)系数对100张图像评估了5位临床医生之间的一致性。根据标注标准,1位临床医生标注了10102张图像。我们训练CNN对平均κ≥0.4的特征进行分类,并使用曲线下面积的受试者工作特征(ROC-AUC)评估其性能。进行了误差分析,并应用了梯度加权类激活映射(Grad-CAM);当CNN对肾小球图像的一个特征进行分类时,它用热图表示CNN的聚焦点。

结果

特征的平均κ系数范围为0.28至0.50。CNN对测试数据的ROC-AUC在0.65至0.98之间变化。在这些特征中,“毛细血管塌陷”和“纤维性新月体”的ROC-AUC值较高,分别为0.98和0.91。误差分析和Grad-CAM直观地表明,CNN无法区分具有相似视觉结构或同时出现的两种不同特征。

结论

不同特征之间纹理或共现频率的差异影响了CNN的性能;因此,为了提高分类准确率,需要分割等方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d8fd/7938073/14b66fb76cd1/gr1a.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验