DeepSMILE：从结直肠癌和乳腺癌的 H&E 全切片图像中直接进行对比自监督预训练，有利于 MSI 和 HRD 分类。

DeepSMILE: Contrastive self-supervised pre-training benefits MSI and HRD classification directly from H&E whole-slide images in colorectal and breast cancer.

机构信息

Netherlands Cancer Institute, Plesmanlaan 121, Amsterdam, CX 1066, the Netherlands; University of Amsterdam, Science Park 402, Amsterdam, XH 1098, the Netherlands.

University of Amsterdam, Science Park 402, Amsterdam, XH 1098, the Netherlands; Ellogon AI B.V., the Netherlands.

出版信息

Med Image Anal. 2022 Jul;79:102464. doi: 10.1016/j.media.2022.102464. Epub 2022 Apr 29.

DOI:10.1016/j.media.2022.102464

PMID:35596966

Abstract

We propose a Deep learning-based weak label learning method for analyzing whole slide images (WSIs) of Hematoxylin and Eosin (H&E) stained tumor tissue not requiring pixel-level or tile-level annotations using Self-supervised pre-training and heterogeneity-aware deep Multiple Instance LEarning (DeepSMILE). We apply DeepSMILE to the task of Homologous recombination deficiency (HRD) and microsatellite instability (MSI) prediction. We utilize contrastive self-supervised learning to pre-train a feature extractor on histopathology tiles of cancer tissue. Additionally, we use variability-aware deep multiple instance learning to learn the tile feature aggregation function while modeling tumor heterogeneity. For MSI prediction in a tumor-annotated and color normalized subset of TCGA-CRC (n=360 patients), contrastive self-supervised learning improves the tile supervision baseline from 0.77 to 0.87 AUROC, on par with our proposed DeepSMILE method. On TCGA-BC (n=1041 patients) without any manual annotations, DeepSMILE improves HRD classification performance from 0.77 to 0.81 AUROC compared to tile supervision with either a self-supervised or ImageNet pre-trained feature extractor. Our proposed methods reach the baseline performance using only 40% of the labeled data on both datasets. These improvements suggest we can use standard self-supervised learning techniques combined with multiple instance learning in the histopathology domain to improve genomic label classification performance with fewer labeled data.

摘要

我们提出了一种基于深度学习的弱标签学习方法，用于分析苏木精和伊红（H&E）染色的肿瘤组织的全幻灯片图像（WSIs），而不需要像素级或瓦片级注释，使用自监督预训练和异质性感知深度多实例学习（DeepSMILE）。我们将 DeepSMILE 应用于同源重组缺陷（HRD）和微卫星不稳定性（MSI）预测任务。我们利用对比自监督学习在癌症组织的组织病理学瓦片上预训练特征提取器。此外，我们使用变异性感知的深度多实例学习来学习瓦片特征聚合函数，同时模拟肿瘤异质性。对于 TCGA-CRC（n=360 名患者）中具有肿瘤注释和颜色归一化的肿瘤子集的 MSI 预测，对比自监督学习将瓦片监督基线从 0.77 提高到 0.87 AUROC，与我们提出的 DeepSMILE 方法相当。在没有任何手动注释的 TCGA-BC（n=1041 名患者）上，与使用自监督或 ImageNet 预训练的特征提取器进行瓦片监督相比，DeepSMILE 将 HRD 分类性能从 0.77 提高到 0.81 AUROC。我们提出的方法在两个数据集上仅使用 40%的标记数据即可达到基线性能。这些改进表明，我们可以在组织病理学领域中使用标准的自监督学习技术结合多实例学习来提高基因组标签分类性能，同时使用更少的标记数据。

相似文献

DeepSMILE: Contrastive self-supervised pre-training benefits MSI and HRD classification directly from H&E whole-slide images in colorectal and breast cancer.

Med Image Anal. 2022 Jul;79:102464. doi: 10.1016/j.media.2022.102464. Epub 2022 Apr 29.

PPsNet: An improved deep learning model for microsatellite instability high prediction in colorectal cancer from whole slide images.

Comput Methods Programs Biomed. 2022 Oct;225:107095. doi: 10.1016/j.cmpb.2022.107095. Epub 2022 Aug 28.

Deep learning model for the prediction of microsatellite instability in colorectal cancer: a diagnostic study.

Lancet Oncol. 2021 Jan;22(1):132-141. doi: 10.1016/S1470-2045(20)30535-0.

Attention-based multiple instance learning with self-supervision to predict microsatellite instability in colorectal cancer from histology whole-slide images.

Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul;2022:3068-3071. doi: 10.1109/EMBC48229.2022.9871553.

Development and validation of a weakly supervised deep learning framework to predict the status of molecular pathways and key mutations in colorectal cancer from routine histology images: a retrospective study.

Lancet Digit Health. 2021 Dec;3(12):e763-e772. doi: 10.1016/S2589-7500(21)00180-1. Epub 2021 Oct 19.

Contrastive Multiple Instance Learning: An Unsupervised Framework for Learning Slide-Level Representations of Whole Slide Histopathology Images without Labels.

Cancers (Basel). 2022 Nov 24;14(23):5778. doi: 10.3390/cancers14235778.

SAMPLER: unsupervised representations for rapid analysis of whole slide tissue images.

EBioMedicine. 2024 Jan;99:104908. doi: 10.1016/j.ebiom.2023.104908. Epub 2023 Dec 14.

Lung Cancer Diagnosis on Virtual Histologically Stained Tissue Using Weakly Supervised Learning.

Mod Pathol. 2024 Jun;37(6):100487. doi: 10.1016/j.modpat.2024.100487. Epub 2024 Apr 7.

Iterative multiple instance learning for weakly annotated whole slide image classification.

Phys Med Biol. 2023 Jul 19;68(15). doi: 10.1088/1361-6560/acde3f.

Transformer-based unsupervised contrastive learning for histopathological image classification.

Med Image Anal. 2022 Oct;81:102559. doi: 10.1016/j.media.2022.102559. Epub 2022 Jul 30.

引用本文的文献

MorphoITH: a framework for deconvolving intra-tumor heterogeneity using tissue morphology.

Genome Med. 2025 Sep 19;17(1):101. doi: 10.1186/s13073-025-01504-x.

Multimodal integration strategies for clinical application in oncology.

Front Pharmacol. 2025 Aug 20;16:1609079. doi: 10.3389/fphar.2025.1609079. eCollection 2025.

A generalizable pathology foundation model using a unified knowledge distillation pretraining framework.

Nat Biomed Eng. 2025 Sep 2. doi: 10.1038/s41551-025-01488-4.

Synergistic H&E and IHC image analysis by AI predicts cancer biomarkers and survival outcomes in colorectal and breast cancer.

Commun Med (Lond). 2025 Aug 1;5(1):328. doi: 10.1038/s43856-025-01045-9.

Breast cancer homologous recombination deficiency prediction from pathological images with a sufficient and representative Transformer.

NPJ Precis Oncol. 2025 May 30;9(1):160. doi: 10.1038/s41698-025-00950-5.

Deep Gaussian process with uncertainty estimation for microsatellite instability and immunotherapy response prediction from histology.

NPJ Digit Med. 2025 May 19;8(1):294. doi: 10.1038/s41746-025-01580-8.

Understanding the Impact of Deep Learning Model Parameters on Breast Cancer Histopathological Classification Using ANOVA.

Cancers (Basel). 2025 Apr 24;17(9):1425. doi: 10.3390/cancers17091425.

Benchmarking histopathology foundation models for ovarian cancer bevacizumab treatment response prediction from whole slide images.

Discov Oncol. 2025 Feb 17;16(1):196. doi: 10.1007/s12672-025-01973-x.

Weakly supervised pathological differentiation of primary central nervous system lymphoma and glioblastoma on multi-site whole slide images.

J Med Imaging (Bellingham). 2025 Jan;12(1):017502. doi: 10.1117/1.JMI.12.1.017502. Epub 2025 Jan 11.

The development of an efficient artificial intelligence-based classification approach for colorectal cancer response to radiochemotherapy: deep learning vs. machine learning.

Sci Rep. 2025 Jan 2;15(1):62. doi: 10.1038/s41598-024-84023-w.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

DeepSMILE：从结直肠癌和乳腺癌的 H&E 全切片图像中直接进行对比自监督预训练，有利于 MSI 和 HRD 分类。

DeepSMILE: Contrastive self-supervised pre-training benefits MSI and HRD classification directly from H&E whole-slide images in colorectal and breast cancer.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献