Tam Ka Ho, Soares Maria F, Kers Jesper, Sharples Edward J, Ploeg Rutger J, Kaisar Maria, Rittscher Jens
Institute of Biomedical Engineering, University of Oxford, Oxford, United Kingdom.
Department of Cellular Pathology, Oxford University Hospitals NHS Foundation Trust, John Radcliffe Hospital, Oxford, United Kingdom.
Front Transplant. 2024 Apr 3;3:1305468. doi: 10.3389/frtra.2024.1305468. eCollection 2024.
Two common obstacles limiting the performance of data-driven algorithms in digital histopathology classification tasks are the lack of expert annotations and the narrow diversity of datasets. Multi-instance learning (MIL) can address the former challenge for the analysis of whole slide images (WSI), but performance is often inferior to full supervision. We show that the inclusion of weak annotations can significantly enhance the effectiveness of MIL while keeping the approach scalable. An analysis framework was developed to process periodic acid-Schiff (PAS) and Sirius Red (SR) slides of renal biopsies. The workflow segments tissues into coarse tissue classes. Handcrafted and deep features were extracted from these tissues and combined using a soft attention model to predict several slide-level labels: delayed graft function (DGF), acute tubular injury (ATI), and Remuzzi grade components. A tissue segmentation quality metric was also developed to reduce the adverse impact of poorly segmented instances. The soft attention model was trained using 5-fold cross-validation on a mixed dataset and tested on the QUOD dataset containing PAS and SR biopsies. The average ROC-AUC over different prediction tasks was found to be , significantly higher than using only ResNet50 ( ), only handcrafted features ( ), and the baseline ( ) of state-of-the-art performance. In conjunction with soft attention, weighting tissues by segmentation quality has led to further improvement . Using an intuitive visualisation scheme, we show that our approach may also be used to support clinical decision making as it allows pinpointing individual tissues relevant to the predictions.
限制数据驱动算法在数字组织病理学分类任务中性能的两个常见障碍是缺乏专家注释和数据集多样性狭窄。多实例学习(MIL)可以解决全切片图像(WSI)分析中的前一个挑战,但性能通常不如完全监督。我们表明,包含弱注释可以显著提高MIL的有效性,同时保持该方法的可扩展性。开发了一个分析框架来处理肾活检的过碘酸希夫(PAS)和天狼星红(SR)切片。该工作流程将组织分割为粗略的组织类别。从这些组织中提取手工制作的特征和深度特征,并使用软注意力模型进行组合,以预测几个切片级标签:移植肾功能延迟(DGF)、急性肾小管损伤(ATI)和雷穆齐分级成分。还开发了一种组织分割质量度量,以减少分割不佳实例的不利影响。软注意力模型在混合数据集上使用5折交叉验证进行训练,并在包含PAS和SR活检的QUOD数据集上进行测试。发现不同预测任务的平均ROC-AUC为 ,显著高于仅使用ResNet50( )、仅使用手工制作的特征( )和最先进性能的基线( )。结合软注意力,按分割质量对组织进行加权可进一步提高 。使用直观的可视化方案,我们表明我们的方法还可用于支持临床决策,因为它允许确定与预测相关的单个组织。