基于计算病理学的全切片分类的弱监督深度学习管道的基准测试。

Benchmarking weakly-supervised deep learning pipelines for whole slide classification in computational pathology.

机构信息

Department of Medicine III, University Hospital RWTH Aachen, Aachen, Germany.

Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.

出版信息

Med Image Anal. 2022 Jul;79:102474. doi: 10.1016/j.media.2022.102474. Epub 2022 May 4.

DOI:10.1016/j.media.2022.102474

PMID:35588568

Abstract

Artificial intelligence (AI) can extract visual information from histopathological slides and yield biological insight and clinical biomarkers. Whole slide images are cut into thousands of tiles and classification problems are often weakly-supervised: the ground truth is only known for the slide, not for every single tile. In classical weakly-supervised analysis pipelines, all tiles inherit the slide label while in multiple-instance learning (MIL), only bags of tiles inherit the label. However, it is still unclear how these widely used but markedly different approaches perform relative to each other. We implemented and systematically compared six methods in six clinically relevant end-to-end prediction tasks using data from N=2980 patients for training with rigorous external validation. We tested three classical weakly-supervised approaches with convolutional neural networks and vision transformers (ViT) and three MIL-based approaches with and without an additional attention module. Our results empirically demonstrate that histological tumor subtyping of renal cell carcinoma is an easy task in which all approaches achieve an area under the receiver operating curve (AUROC) of above 0.9. In contrast, we report significant performance differences for clinically relevant tasks of mutation prediction in colorectal, gastric, and bladder cancer. In these mutation prediction tasks, classical weakly-supervised workflows outperformed MIL-based weakly-supervised methods for mutation prediction, which is surprising given their simplicity. This shows that new end-to-end image analysis pipelines in computational pathology should be compared to classical weakly-supervised methods. Also, these findings motivate the development of new methods which combine the elegant assumptions of MIL with the empirically observed higher performance of classical weakly-supervised approaches. We make all source codes publicly available at https://github.com/KatherLab/HIA, allowing easy application of all methods to any similar task.

摘要

人工智能 (AI) 可以从组织病理学幻灯片中提取视觉信息，并提供生物学见解和临床生物标志物。全切片图像被切成数千个瓦片，分类问题通常是弱监督的：只有幻灯片有真实标签，而不是每个瓦片都有。在经典的弱监督分析管道中，所有瓦片都继承幻灯片标签，而在多实例学习 (MIL) 中，只有瓦片袋继承标签。然而，目前尚不清楚这些广泛使用但明显不同的方法彼此之间的表现如何。我们使用来自 2980 名患者的数据实现并系统比较了六种方法在六个临床相关的端到端预测任务中的表现，使用严格的外部验证进行训练。我们测试了三种基于卷积神经网络和视觉转换器 (ViT) 的经典弱监督方法以及三种基于 MIL 的方法，其中包括和不包括额外的注意力模块。我们的结果从经验上证明，肾细胞癌的组织学肿瘤亚型分类是一项简单的任务，所有方法的接收者操作特征曲线 (AUROC) 都在 0.9 以上。相比之下，我们报告了在结直肠癌、胃癌和膀胱癌的突变预测等临床相关任务中存在显著的性能差异。在这些突变预测任务中，经典的弱监督工作流程优于基于 MIL 的弱监督方法，这令人惊讶，因为它们很简单。这表明计算病理学中的新端到端图像分析管道应该与经典的弱监督方法进行比较。此外，这些发现促使开发新的方法，这些方法将 MIL 的优雅假设与经典弱监督方法观察到的更高性能相结合。我们在 https://github.com/KatherLab/HIA 上公开了所有的源代码，允许将所有方法轻松应用于任何类似的任务。

相似文献

Benchmarking weakly-supervised deep learning pipelines for whole slide classification in computational pathology.基于计算病理学的全切片分类的弱监督深度学习管道的基准测试。

Med Image Anal. 2022 Jul;79:102474. doi: 10.1016/j.media.2022.102474. Epub 2022 May 4.

E2EFP-MIL: End-to-end and high-generalizability weakly supervised deep convolutional network for lung cancer classification from whole slide image.端到端和高泛化能力的弱监督深度卷积网络用于从全幻灯片图像分类肺癌。

Med Image Anal. 2023 Aug;88:102837. doi: 10.1016/j.media.2023.102837. Epub 2023 May 13.

Development and validation of a weakly supervised deep learning framework to predict the status of molecular pathways and key mutations in colorectal cancer from routine histology images: a retrospective study.开发和验证一种弱监督深度学习框架，以从常规组织学图像预测结直肠癌中分子通路和关键突变的状态：一项回顾性研究。

Lancet Digit Health. 2021 Dec;3(12):e763-e772. doi: 10.1016/S2589-7500(21)00180-1. Epub 2021 Oct 19.

Development and validation of artificial intelligence-based prescreening of large-bowel biopsies taken in the UK and Portugal: a retrospective cohort study.基于人工智能的英国和葡萄牙大结肠活检预筛查的开发和验证：一项回顾性队列研究。

Lancet Digit Health. 2023 Nov;5(11):e786-e797. doi: 10.1016/S2589-7500(23)00148-6.

Weakly-supervised learning for lung carcinoma classification using deep learning.基于深度学习的肺癌分类弱监督学习。

Sci Rep. 2020 Jun 9;10(1):9297. doi: 10.1038/s41598-020-66333-x.

Weakly supervised histopathology image segmentation with self-attention.基于自注意力机制的弱监督组织病理学图像分割

Med Image Anal. 2023 May;86:102791. doi: 10.1016/j.media.2023.102791. Epub 2023 Mar 11.

Weakly supervised learning for classification of lung cytological images using attention-based multiple instance learning.基于注意力的多实例学习在肺部细胞学图像分类中的弱监督学习。

Sci Rep. 2021 Oct 13;11(1):20317. doi: 10.1038/s41598-021-99246-4.

Weakly supervised annotation-free cancer detection and prediction of genotype in routine histopathology.无监督标注的癌症检测和常规组织病理学中基因型预测

J Pathol. 2022 Jan;256(1):50-60. doi: 10.1002/path.5800. Epub 2021 Oct 22.

Multi-scale representation attention based deep multiple instance learning for gigapixel whole slide image analysis.基于多尺度表示注意力的深度多重实例学习在千兆像素全幻灯片图像分析中的应用。

Med Image Anal. 2023 Oct;89:102890. doi: 10.1016/j.media.2023.102890. Epub 2023 Jul 8.

Masked hypergraph learning for weakly supervised histopathology whole slide image classification.基于掩蔽超图学习的弱监督病理切片图像分类。

Comput Methods Programs Biomed. 2024 Aug;253:108237. doi: 10.1016/j.cmpb.2024.108237. Epub 2024 May 23.

引用本文的文献

Vision transformer network discovers the prognostic value of pancreatic cancer pathology sections via interpretable risk scores.视觉Transformer网络通过可解释的风险评分发现胰腺癌病理切片的预后价值。

Discov Oncol. 2025 Sep 3;16(1):1679. doi: 10.1007/s12672-025-03547-3.

Applications of artificial intelligence in the analysis of histopathology images of gliomas: a review.人工智能在胶质瘤组织病理学图像分析中的应用：综述

Npj Imaging. 2024 Jul 1;2(1):16. doi: 10.1038/s44303-024-00020-8.

2.5D deep learning radiomics and clinical data for predicting occult lymph node metastasis in lung adenocarcinoma.用于预测肺腺癌隐匿性淋巴结转移的2.5D深度学习影像组学和临床数据

BMC Med Imaging. 2025 Jul 1;25(1):225. doi: 10.1186/s12880-025-01759-1.

Deep Gaussian process with uncertainty estimation for microsatellite instability and immunotherapy response prediction from histology.用于基于组织学的微卫星不稳定性和免疫治疗反应预测、带有不确定性估计的深度高斯过程

NPJ Digit Med. 2025 May 19;8(1):294. doi: 10.1038/s41746-025-01580-8.

Deep learning for fetal inflammatory response diagnosis in the umbilical cord.用于脐带中胎儿炎症反应诊断的深度学习

Placenta. 2025 Jun 26;167:1-10. doi: 10.1016/j.placenta.2025.04.013. Epub 2025 Apr 24.

Multiple instance learning-based prediction of programmed death-ligand 1 (PD-L1) expression from hematoxylin and eosin (H&E)-stained histopathological images in breast cancer.基于多实例学习从乳腺癌苏木精和伊红（H&E）染色的组织病理学图像预测程序性死亡配体1（PD-L1）表达

PeerJ. 2025 Apr 15;13:e19201. doi: 10.7717/peerj.19201. eCollection 2025.

Tumor Bud Classification in Colorectal Cancer Using Attention-Based Deep Multiple Instance Learning and Domain-Specific Foundation Models.基于注意力的深度多实例学习和特定领域基础模型在结直肠癌肿瘤芽分类中的应用

Cancers (Basel). 2025 Apr 7;17(7):1245. doi: 10.3390/cancers17071245.

Computational pathology for breast cancer: Where do we stand for prognostic applications?乳腺癌的计算病理学：在预后应用方面我们处于什么水平？

Breast. 2025 Jun;81:104464. doi: 10.1016/j.breast.2025.104464. Epub 2025 Mar 26.

Deep learning-driven survival prediction in pan-cancer studies by integrating multimodal histology-genomic data.通过整合多模态组织学-基因组数据，在泛癌研究中进行深度学习驱动的生存预测。

Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf121.

Multimodal deep learning: tumor and visceral fat impact on colorectal cancer occult peritoneal metastasis.多模态深度学习：肿瘤和内脏脂肪对结直肠癌隐匿性腹膜转移的影响

Eur Radiol. 2025 Feb 17. doi: 10.1007/s00330-025-11450-2.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于计算病理学的全切片分类的弱监督深度学习管道的基准测试。

Benchmarking weakly-supervised deep learning pipelines for whole slide classification in computational pathology.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献