Department of Pathology and Immunology, Washington University School of Medicine in St Louis, St Louis, Missouri.
Institute for Informatics (I 2 ), Washington University School of Medicine in St Louis, St Louis, Missouri.
JAMA Netw Open. 2021 Jan 4;4(1):e2030939. doi: 10.1001/jamanetworkopen.2020.30939.
A chronic shortage of donor kidneys is compounded by a high discard rate, and this rate is directly associated with biopsy specimen evaluation, which shows poor reproducibility among pathologists. A deep learning algorithm for measuring percent global glomerulosclerosis (an important predictor of outcome) on images of kidney biopsy specimens could enable pathologists to more reproducibly and accurately quantify percent global glomerulosclerosis, potentially saving organs that would have been discarded.
To compare the performances of pathologists with a deep learning model on quantification of percent global glomerulosclerosis in whole-slide images of donor kidney biopsy specimens, and to determine the potential benefit of a deep learning model on organ discard rates.
DESIGN, SETTING, AND PARTICIPANTS: This prognostic study used whole-slide images acquired from 98 hematoxylin-eosin-stained frozen and 51 permanent donor biopsy specimen sections retrieved from 83 kidneys. Serial annotation by 3 board-certified pathologists served as ground truth for model training and for evaluation. Images of kidney biopsy specimens were obtained from the Washington University database (retrieved between June 2015 and June 2017). Cases were selected randomly from a database of more than 1000 cases to include biopsy specimens representing an equitable distribution within 0% to 5%, 6% to 10%, 11% to 15%, 16% to 20%, and more than 20% global glomerulosclerosis.
Correlation coefficient (r) and root-mean-square error (RMSE) with respect to annotations were computed for cross-validated model predictions and on-call pathologists' estimates of percent global glomerulosclerosis when using individual and pooled slide results. Data were analyzed from March 2018 to August 2020.
The cross-validated model results of section images retrieved from 83 donor kidneys showed higher correlation with annotations (r = 0.916; 95% CI, 0.886-0.939) than on-call pathologists (r = 0.884; 95% CI, 0.825-0.923) that was enhanced when pooling glomeruli counts from multiple levels (r = 0.933; 95% CI, 0.898-0.956). Model prediction error for single levels (RMSE, 5.631; 95% CI, 4.735-6.517) was 14% lower than on-call pathologists (RMSE, 6.523; 95% CI, 5.191-7.783), improving to 22% with multiple levels (RMSE, 5.094; 95% CI, 3.972-6.301). The model decreased the likelihood of unnecessary organ discard by 37% compared with pathologists.
The findings of this prognostic study suggest that this deep learning model provided a scalable and robust method to quantify percent global glomerulosclerosis in whole-slide images of donor kidneys. The model performance improved by analyzing multiple levels of a section, surpassing the capacity of pathologists in the time-sensitive setting of examining donor biopsy specimens. The results indicate the potential of a deep learning model to prevent erroneous donor organ discard.
供体肾脏的慢性短缺加上高淘汰率,而淘汰率与活检标本评估直接相关,活检标本评估显示病理学家之间的评估结果重复性差。一种用于测量肾脏活检标本图像中肾小球全球硬化程度(预后的重要预测指标)的深度学习算法,可以使病理学家更准确地、可重复地定量肾小球全球硬化程度,从而可能挽救原本会被丢弃的器官。
比较病理学家与深度学习模型在全切片图像中定量供体肾脏活检标本肾小球全球硬化程度的表现,并确定深度学习模型对器官淘汰率的潜在益处。
设计、设置和参与者:这项预后研究使用了从华盛顿大学数据库(2015 年 6 月至 2017 年 6 月期间检索)中获取的 98 张苏木精-伊红染色冷冻和 51 张永久性供体活检标本切片的全幻灯片图像。3 位经过董事会认证的病理学家的连续注释作为模型训练和评估的真实数据。从一个超过 1000 例的数据库中随机选择病例,以包括在 0%至 5%、6%至 10%、11%至 15%、16%至 20%和超过 20%的肾小球全球硬化程度内具有公平分布的活检标本。
对于使用单个和多个幻灯片结果的模型预测和调用病理学家对肾小球全球硬化程度的估计,计算交叉验证模型预测与注释之间的相关系数(r)和均方根误差(RMSE)。数据分析于 2018 年 3 月至 2020 年 8 月进行。
从 83 个供体肾脏中检索的切片图像的交叉验证模型结果与注释的相关性更高(r=0.916;95%置信区间,0.886-0.939),而调用病理学家的相关性较低(r=0.884;95%置信区间,0.825-0.923),当从多个水平汇总肾小球计数时,相关性增强(r=0.933;95%置信区间,0.898-0.956)。单个水平的模型预测误差(RMSE,5.631;95%置信区间,4.735-6.517)比调用病理学家低 14%(RMSE,6.523;95%置信区间,5.191-7.783),多个水平的预测误差(RMSE,5.094;95%置信区间,3.972-6.301)则降低了 22%。与病理学家相比,该模型使不必要的器官淘汰的可能性降低了 37%。
这项预后研究的结果表明,这种深度学习模型为定量供体肾脏全切片图像中的肾小球全球硬化程度提供了一种可扩展且强大的方法。通过分析切片的多个水平,模型的性能得到了提高,超过了病理学家在检查供体活检标本时的时间敏感性设置。结果表明,深度学习模型有可能防止错误地丢弃供体器官。