Shashiprakash Avinash Kammardi, Lutnick Brendon, Ginley Brandon, Govind Darshana, Lucarelli Nicholas, Jen Kuang-Yu, Rosenberg Avi Z, Urisman Anatoly, Walavalkar Vighnesh, Zuckerman Jonathan E, Delsante Marco, Bissonnette Mei Lin Z, Tomaszewski John E, Manthey David, Sarder Pinaki
Department of Biomedical Engineering, University at Buffalo - The State University of New York.
Department of Pathology and Anatomical Sciences, University at Buffalo - The State University of New York.
Proc SPIE Int Soc Opt Eng. 2021 Feb;11603. doi: 10.1117/12.2581789. Epub 2021 Feb 15.
Histologic examination of interstitial fibrosis and tubular atrophy (IFTA) is critical to determine the extent of irreversible kidney injury in renal disease. The current clinical standard involves pathologist's visual assessment of IFTA, which is prone to inter-observer variability. To address this diagnostic variability, we designed two case studies (CSs), including seven pathologists, using HistomicsTK- a distributed system developed by Kitware Inc. (Clifton Park, NY). Twenty-five whole slide images (WSIs) were classified into a training set of 21 and a validation set of four. The training set was composed of seven unique subsets, each provided to an individual pathologist along with four common WSIs from the validation set. In CS 1, all pathologists individually annotated IFTA in their respective slides. These annotations were then used to train a deep learning algorithm to computationally segment IFTA. In CS 2, manual and computational annotations from CS 1 were first reviewed by the annotators to improve concordance of IFTA annotation. Both the manual and computational annotation processes were then repeated as in CS1. The inter-observer concordance in the validation set was measured by Krippendorff's alpha (KA). The KA for the seven pathologists in CS1 was 0.62 with CI [0.57, 0.67], and after reviewing each other's annotations in CS2, 0.66 with CI [0.60, 0.72]. The respective CS1 and CS2 KA were 0.58 with CI [0.52, 0.64] and 0.63 with CI [0.56, 0.69] when including the deep learner as an eighth annotator. These results suggest that our designed annotation framework refines agreement of spatial annotation of IFTA and demonstrates a human-AI approach to significantly improve the development of computational models.
间质纤维化和肾小管萎缩(IFTA)的组织学检查对于确定肾脏疾病中不可逆肾损伤的程度至关重要。当前的临床标准涉及病理学家对IFTA的视觉评估,这容易出现观察者间的差异。为了解决这种诊断差异,我们设计了两个案例研究(CSs),包括七位病理学家,使用了由Kitware公司(纽约州克利夫顿帕克)开发的分布式系统HistomicsTK。二十五张全切片图像(WSIs)被分为一个包含21张图像的训练集和一个包含4张图像的验证集。训练集由七个独特的子集组成,每个子集与来自验证集的四张常见WSIs一起提供给一位病理学家。在案例研究1中,所有病理学家分别在各自的切片上标注IFTA。然后,这些标注被用于训练一种深度学习算法,以通过计算分割IFTA。在案例研究2中,案例研究1中的手动和计算标注首先由标注者进行审查,以提高IFTA标注的一致性。然后,手动和计算标注过程都像在案例研究1中一样重复进行。验证集中观察者间的一致性通过Krippendorff's alpha(KA)进行测量。案例研究1中七位病理学家的KA为0.62,置信区间为[0.57, 0.67],在案例研究2中相互审查彼此的标注后,KA为0.66,置信区间为[0.60, 0.72]。当将深度学习器作为第八位标注者纳入时,案例研究1和案例研究2各自的KA分别为0.58,置信区间为[0.52, 0.64]和0.63,置信区间为[0.56, 0.69]。这些结果表明,我们设计的标注框架优化了IFTA空间标注的一致性,并展示了一种人机结合的方法,可显著改善计算模型的开发。