当多实例学习遇上基础模型：推进组织学全切片图像分析

When multiple instance learning meets foundation models: Advancing histological whole slide image analysis.

作者信息

Xu Hongming, Wang Mingkang, Shi Duanbo, Qin Huamin, Zhang Yunpeng, Liu Zaiyi, Madabhushi Anant, Gao Peng, Cong Fengyu, Lu Cheng

机构信息

Cancer Hospital of Dalian University of Technology, Dalian, China; School of Biomedical Engineering, Faculty of Medicine, Dalian University of Technology, Dalian, China; Key Laboratory of Integrated Circuit and Biomedical Electronic System, Liaoning Province, Dalian University of Technology, Dalian, China; Dalian Key Laboratory of Digital Medicine for Critical Diseases, Dalian University of Technology, Dalian, China.

School of Biomedical Engineering, Faculty of Medicine, Dalian University of Technology, Dalian, China.

出版信息

Med Image Anal. 2025 Apr;101:103456. doi: 10.1016/j.media.2025.103456. Epub 2025 Jan 14.

DOI:10.1016/j.media.2025.103456

PMID:39842326

Abstract

Deep multiple instance learning (MIL) pipelines are the mainstream weakly supervised learning methodologies for whole slide image (WSI) classification. However, it remains unclear how these widely used approaches compare to each other, given the recent proliferation of foundation models (FMs) for patch-level embedding and the diversity of slide-level aggregations. This paper implemented and systematically compared six FMs and six recent MIL methods by organizing different feature extractions and aggregations across seven clinically relevant end-to-end prediction tasks using WSIs from 4044 patients with four different cancer types. We tested state-of-the-art (SOTA) FMs in computational pathology, including CTransPath, PathoDuet, PLIP, CONCH, and UNI, as patch-level feature extractors. Feature aggregators, such as attention-based pooling, transformers, and dynamic graphs were thoroughly tested. Our experiments on cancer grading, biomarker status prediction, and microsatellite instability (MSI) prediction suggest that (1) FMs like UNI, trained with more diverse histological images, outperform generic models with smaller training datasets in patch embeddings, significantly enhancing downstream MIL classification accuracy and model training convergence speed, (2) instance feature fine-tuning, known as online feature re-embedding, to capture both fine-grained details and spatial interactions can often further improve WSI classification performance, (3) FMs advance MIL models by enabling promising grading classifications, biomarker status, and MSI predictions without requiring pixel- or patch-level annotations. These findings encourage the development of advanced, domain-specific FMs, aimed at more universally applicable diagnostic tasks, aligning with the evolving needs of clinical AI in pathology.

摘要

深度多实例学习（MIL）管道是用于全切片图像（WSI）分类的主流弱监督学习方法。然而，鉴于用于补丁级嵌入的基础模型（FM）最近的激增以及切片级聚合的多样性，这些广泛使用的方法之间如何相互比较仍不清楚。本文通过组织跨越七个临床相关的端到端预测任务的不同特征提取和聚合，使用来自4044名患有四种不同癌症类型患者的WSI，实现并系统比较了六种FM和六种最新的MIL方法。我们测试了计算病理学中的最新（SOTA）FM，包括CTransPath、PathoDuet、PLIP、CONCH和UNI，作为补丁级特征提取器。对基于注意力的池化、变压器和动态图等特征聚合器进行了全面测试。我们在癌症分级、生物标志物状态预测和微卫星不稳定性（MSI）预测方面的实验表明：（1）像UNI这样用更多样化的组织学图像训练的FM，在补丁嵌入方面优于训练数据集较小的通用模型，显著提高了下游MIL分类的准确性和模型训练收敛速度；（2）实例特征微调，即所谓的在线特征重新嵌入，以捕获细粒度细节和空间相互作用，通常可以进一步提高WSI分类性能；（3）FM通过实现有前景的分级分类、生物标志物状态和MSI预测，而无需像素级或补丁级注释，推动了MIL模型的发展。这些发现鼓励开发先进的、特定领域的FM，以实现更普遍适用的诊断任务，符合病理学中临床人工智能不断发展的需求。

相似文献

When multiple instance learning meets foundation models: Advancing histological whole slide image analysis.当多实例学习遇上基础模型：推进组织学全切片图像分析

Med Image Anal. 2025 Apr;101:103456. doi: 10.1016/j.media.2025.103456. Epub 2025 Jan 14.

Skin-CAD: Explainable deep learning classification of skin cancer from dermoscopic images by feature selection of dual high-level CNNs features and transfer learning.皮肤 CAD：基于双高级 CNN 特征选择和迁移学习的皮肤镜图像皮肤癌可解释深度学习分类。

Comput Biol Med. 2024 Aug;178:108798. doi: 10.1016/j.compbiomed.2024.108798. Epub 2024 Jun 25.

Magnetic resonance perfusion for differentiating low-grade from high-grade gliomas at first presentation.首次就诊时磁共振灌注成像用于鉴别低级别与高级别胶质瘤

Cochrane Database Syst Rev. 2018 Jan 22;1(1):CD011551. doi: 10.1002/14651858.CD011551.pub2.

A deep learning approach to direct immunofluorescence pattern recognition in autoimmune bullous diseases.深度学习方法在自身免疫性大疱性疾病中的直接免疫荧光模式识别。

Br J Dermatol. 2024 Jul 16;191(2):261-266. doi: 10.1093/bjd/ljae142.

Advancing respiratory disease diagnosis: A deep learning and vision transformer-based approach with a novel X-ray dataset.推进呼吸系统疾病诊断：一种基于深度学习和视觉Transformer的方法及新型X射线数据集

Comput Biol Med. 2025 Aug;194:110501. doi: 10.1016/j.compbiomed.2025.110501. Epub 2025 Jun 9.

Semi-Supervised Learning Allows for Improved Segmentation With Reduced Annotations of Brain Metastases Using Multicenter MRI Data.半监督学习可利用多中心MRI数据，通过减少脑转移瘤的标注来改进分割。

J Magn Reson Imaging. 2025 Jun;61(6):2469-2479. doi: 10.1002/jmri.29686. Epub 2025 Jan 10.

Exploratory multi-cohort, multi-reader study on the clinical utility of a deep learning model for transforming cryosectioned to formalin-fixed, paraffin-embedded (FFPE) images in breast lesion diagnosis.关于深度学习模型在乳腺病变诊断中用于将冷冻切片图像转换为福尔马林固定石蜡包埋（FFPE）图像的临床效用的探索性多队列、多读者研究。

Breast Cancer Res. 2025 Jun 17;27(1):110. doi: 10.1186/s13058-025-02064-z.

Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中，如果患者出现以下症状和体征，可判断其是否患有 COVID-19。

Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

SeLa-MIL: Developing an instance-level classifier via weakly-supervised self-training for whole slide image classification.SeLa-MIL：通过弱监督自训练开发用于全幻灯片图像分类的实例级分类器。

Comput Methods Programs Biomed. 2025 Apr;261:108614. doi: 10.1016/j.cmpb.2025.108614. Epub 2025 Jan 27.

Survivor, family and professional experiences of psychosocial interventions for sexual abuse and violence: a qualitative evidence synthesis.性虐待和暴力的心理社会干预的幸存者、家庭和专业人员的经验：定性证据综合。

Cochrane Database Syst Rev. 2022 Oct 4;10(10):CD013648. doi: 10.1002/14651858.CD013648.pub2.

引用本文的文献

Towards Post-Genomic Oncology: Embracing Cancer Complexity via Artificial Intelligence, Multi-Targeted Therapeutics, Drug Repurposing, and Innovative Study Designs.迈向基因组后肿瘤学：通过人工智能、多靶点治疗、药物再利用和创新研究设计来应对癌症复杂性。

Int J Mol Sci. 2025 Aug 10;26(16):7723. doi: 10.3390/ijms26167723.

Digital pathology-based artificial intelligence model to predict microsatellite instability in gastroesophageal junction adenocarcinomas.基于数字病理学的人工智能模型预测胃食管交界腺癌中的微卫星不稳定性

Front Oncol. 2025 Aug 7;15:1486140. doi: 10.3389/fonc.2025.1486140. eCollection 2025.

Tumor Bud Classification in Colorectal Cancer Using Attention-Based Deep Multiple Instance Learning and Domain-Specific Foundation Models.基于注意力的深度多实例学习和特定领域基础模型在结直肠癌肿瘤芽分类中的应用

Cancers (Basel). 2025 Apr 7;17(7):1245. doi: 10.3390/cancers17071245.

A comprehensive evaluation of histopathology foundation models for ovarian cancer subtype classification.用于卵巢癌亚型分类的组织病理学基础模型综合评估

NPJ Precis Oncol. 2025 Jan 30;9(1):33. doi: 10.1038/s41698-025-00799-8.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

当多实例学习遇上基础模型：推进组织学全切片图像分析

When multiple instance learning meets foundation models: Advancing histological whole slide image analysis.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献