迈向计算病理学的通用基础模型。

Towards a general-purpose foundation model for computational pathology.

机构信息

Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.

Department of Pathology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.

出版信息

Nat Med. 2024 Mar;30(3):850-862. doi: 10.1038/s41591-024-02857-3. Epub 2024 Mar 19.

DOI:10.1038/s41591-024-02857-3

PMID:38504018

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11403354/

Abstract

Quantitative evaluation of tissue images is crucial for computational pathology (CPath) tasks, requiring the objective characterization of histopathological entities from whole-slide images (WSIs). The high resolution of WSIs and the variability of morphological features present significant challenges, complicating the large-scale annotation of data for high-performance applications. To address this challenge, current efforts have proposed the use of pretrained image encoders through transfer learning from natural image datasets or self-supervised learning on publicly available histopathology datasets, but have not been extensively developed and evaluated across diverse tissue types at scale. We introduce UNI, a general-purpose self-supervised model for pathology, pretrained using more than 100 million images from over 100,000 diagnostic H&E-stained WSIs (>77 TB of data) across 20 major tissue types. The model was evaluated on 34 representative CPath tasks of varying diagnostic difficulty. In addition to outperforming previous state-of-the-art models, we demonstrate new modeling capabilities in CPath such as resolution-agnostic tissue classification, slide classification using few-shot class prototypes, and disease subtyping generalization in classifying up to 108 cancer types in the OncoTree classification system. UNI advances unsupervised representation learning at scale in CPath in terms of both pretraining data and downstream evaluation, enabling data-efficient artificial intelligence models that can generalize and transfer to a wide range of diagnostically challenging tasks and clinical workflows in anatomic pathology.

摘要

组织图像的定量评估对于计算病理学（CPath）任务至关重要，需要从全切片图像（WSI）中客观地描述组织病理学实体。WSI 的高分辨率和形态特征的可变性带来了重大挑战，使得大规模注释数据以用于高性能应用变得复杂。为了解决这一挑战，目前的研究工作提出了使用经过预训练的图像编码器，通过从自然图像数据集进行迁移学习或在公开的组织病理学数据集上进行自我监督学习，但是这些方法尚未在不同的组织类型上进行广泛开发和评估。我们引入了 UNI，这是一种通用的病理学自我监督模型，使用来自超过 100,000 个诊断性 H&E 染色 WSI（超过 77TB 的数据）的超过 1 亿张图像进行预训练，涵盖了 20 种主要组织类型。该模型在 34 个具有不同诊断难度的代表性 CPath 任务上进行了评估。除了优于以前的最先进模型外，我们还在 CPath 中展示了新的建模能力，例如与分辨率无关的组织分类、使用少量样本类原型的幻灯片分类以及在多达 108 种癌症类型的 OncoTree 分类系统中进行疾病亚型分类的泛化能力。UNI 在 CPath 中的预训练数据和下游评估方面都推进了无监督表示学习的规模化，使能够进行数据高效的人工智能模型，这些模型可以泛化并转移到广泛的具有诊断挑战性的任务和临床工作流程中。

相似文献

Towards a general-purpose foundation model for computational pathology.

Nat Med. 2024 Mar;30(3):850-862. doi: 10.1038/s41591-024-02857-3. Epub 2024 Mar 19.

A General-Purpose Self-Supervised Model for Computational Pathology.

ArXiv. 2023 Aug 29:arXiv:2308.15474v1.

Self-supervised driven consistency training for annotation efficient histopathology image analysis.

Med Image Anal. 2022 Jan;75:102256. doi: 10.1016/j.media.2021.102256. Epub 2021 Oct 13.

A visual-language foundation model for computational pathology.

Nat Med. 2024 Mar;30(3):863-874. doi: 10.1038/s41591-024-02856-4. Epub 2024 Mar 19.

Masked hypergraph learning for weakly supervised histopathology whole slide image classification.

Comput Methods Programs Biomed. 2024 Aug;253:108237. doi: 10.1016/j.cmpb.2024.108237. Epub 2024 May 23.

Semantic annotation for computational pathology: multidisciplinary experience and best practice recommendations.

J Pathol Clin Res. 2022 Mar;8(2):116-128. doi: 10.1002/cjp2.256. Epub 2022 Jan 10.

Tailoring pretext tasks to improve self-supervised learning in histopathologic subtype classification of lung adenocarcinomas.

Comput Biol Med. 2023 Nov;166:107484. doi: 10.1016/j.compbiomed.2023.107484. Epub 2023 Sep 16.

SAMPLER: unsupervised representations for rapid analysis of whole slide tissue images.

EBioMedicine. 2024 Jan;99:104908. doi: 10.1016/j.ebiom.2023.104908. Epub 2023 Dec 14.

From whole-slide image to biomarker prediction: end-to-end weakly supervised deep learning in computational pathology.

Nat Protoc. 2025 Jan;20(1):293-316. doi: 10.1038/s41596-024-01047-2. Epub 2024 Sep 16.

Equipping computational pathology systems with artifact processing pipelines: a showcase for computation and performance trade-offs.

BMC Med Inform Decis Mak. 2024 Oct 7;24(1):288. doi: 10.1186/s12911-024-02676-z.

引用本文的文献

Multimodal integration strategies for clinical application in oncology.

Front Pharmacol. 2025 Aug 20;16:1609079. doi: 10.3389/fphar.2025.1609079. eCollection 2025.

Effective SMOTE boost with deep learning for IDC identification in whole-slide images.

PLoS One. 2025 Sep 3;20(9):e0329078. doi: 10.1371/journal.pone.0329078. eCollection 2025.

A generalizable pathology foundation model using a unified knowledge distillation pretraining framework.

Nat Biomed Eng. 2025 Sep 2. doi: 10.1038/s41551-025-01488-4.

An eyecare foundation model for clinical assistance: a randomized controlled trial.

Nat Med. 2025 Aug 28. doi: 10.1038/s41591-025-03900-7.

MERGE: Multi-faceted Hierarchical Graph-based GNN for Gene Expression Prediction from Whole Slide Histopathology Images.

Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit. 2025 Jun;2025:15611-15620. doi: 10.1109/cvpr52734.2025.01455. Epub 2025 Aug 13.

Artificial Intelligence for Multiscale Spatial Analysis in Oncology: Current Applications and Future Implications.

Int J Mol Sci. 2025 Aug 19;26(16):8002. doi: 10.3390/ijms26168002.

Multiparametric cellular and spatial organization in cancer tissue lesions with a streamlined pipeline.

Nat Biomed Eng. 2025 Aug 25. doi: 10.1038/s41551-025-01475-9.

GastritisMIL: An interpretable deep learning model for the comprehensive histological assessment of chronic gastritis.

Patterns (N Y). 2025 Jun 10;6(8):101286. doi: 10.1016/j.patter.2025.101286. eCollection 2025 Aug 8.

Histopathology-based protein multiplex generation using deep learning.

Nat Mach Intell. 2025;7(8):1292-1307. doi: 10.1038/s42256-025-01074-y. Epub 2025 Aug 4.

SpaICL: image-guided curriculum strategy-based graph contrastive learning for spatial transcriptomics clustering.

Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf433.

本文引用的文献

Rapid generation of long, chemically modified pegRNAs for prime editing.

Nat Biotechnol. 2024 Sep 30. doi: 10.1038/s41587-024-02394-x.

A population-level digital histologic biomarker for enhanced prognosis of invasive breast cancer.

Nat Med. 2024 Jan;30(1):85-97. doi: 10.1038/s41591-023-02643-7. Epub 2023 Nov 27.

Leakage and the reproducibility crisis in machine-learning-based science.

Patterns (N Y). 2023 Aug 4;4(9):100804. doi: 10.1016/j.patter.2023.100804. eCollection 2023 Sep 8.

SGCL: Spatial guided contrastive learning on whole-slide pathological images.

Med Image Anal. 2023 Oct;89:102845. doi: 10.1016/j.media.2023.102845. Epub 2023 May 24.

A visual-language foundation model for pathology image analysis using medical Twitter.

Nat Med. 2023 Sep;29(9):2307-2316. doi: 10.1038/s41591-023-02504-3. Epub 2023 Aug 17.

Histology-Based Prediction of Therapy Response to Neoadjuvant Chemotherapy for Esophageal and Esophagogastric Junction Adenocarcinomas Using Deep Learning.

JCO Clin Cancer Inform. 2023 Aug;7:e2300038. doi: 10.1200/CCI.23.00038.

Machine learning for cryosection pathology predicts the 2021 WHO classification of glioma.

Med. 2023 Aug 11;4(8):526-540.e4. doi: 10.1016/j.medj.2023.06.002. Epub 2023 Jul 7.

Algorithmic fairness in artificial intelligence for medicine and healthcare.

Nat Biomed Eng. 2023 Jun;7(6):719-742. doi: 10.1038/s41551-023-01056-8. Epub 2023 Jun 28.

Machine learning in computational histopathology: Challenges and opportunities.

Genes Chromosomes Cancer. 2023 Sep;62(9):540-556. doi: 10.1002/gcc.23177. Epub 2023 Jun 14.

Robust and data-efficient generalization of self-supervised machine learning for diagnostic imaging.

Nat Biomed Eng. 2023 Jun;7(6):756-779. doi: 10.1038/s41551-023-01049-7. Epub 2023 Jun 8.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

迈向计算病理学的通用基础模型。

Towards a general-purpose foundation model for computational pathology.

机构信息

Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.

Department of Pathology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.

出版信息

Nat Med. 2024 Mar;30(3):850-862. doi: 10.1038/s41591-024-02857-3. Epub 2024 Mar 19.

DOI:10.1038/s41591-024-02857-3

PMID:38504018

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11403354/

Abstract

摘要

迈向计算病理学的通用基础模型。

Towards a general-purpose foundation model for computational pathology.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

迈向计算病理学的通用基础模型。

Towards a general-purpose foundation model for computational pathology.

机构信息

出版信息