通过序列缩减和基于文本预训练的变压器实现高效的全切片图像分类。

Efficient WSI classification with sequence reduction and transformers pretrained on text.

作者信息

Pisula Juan I, Bozek Katarzyna

机构信息

Institute for Biomedical Informatics, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany.

Center for Molecular Medicine Cologne (CMMC), Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany.

出版信息

Sci Rep. 2025 Feb 15;15(1):5612. doi: 10.1038/s41598-025-88139-5.

DOI:10.1038/s41598-025-88139-5

PMID:39955295

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11829941/

Abstract

From computer vision to protein fold prediction, Language Models (LMs) have proven successful in transferring their representation of sequential data to a broad spectrum of tasks beyond the domain of natural language processing. Whole Slide Image (WSI) analysis in digital pathology naturally fits to transformer-based architectures. In a pre-processing step analogous to text tokenization, large microscopy images are tessellated into smaller image patches. However, due to the massive size of WSIs comprising thousands of such patches, the problem of WSI classification has not been addressed via deep transformer architectures, let alone via available text-pre-trained deep transformer language models. We introduce SeqShort, a multi-head attention-based sequence shortening layer that summarizes a large WSI into a fixed- and short-sized sequence of feature vectors by removing redundant visual information. Our sequence shortening mechanism not only reduces the computational costs of self-attention on large inputs, it also allows to include standard positional encodings to the previously unordered bag of patches that compose a WSI. We use SeqShort to effectively classify WSIs in different digital pathology tasks using a deep, text pre-trained transformer model while fine-tuning less than 0.1% of its parameters, demonstrating that their knowledge about natural language transfers well to this domain.

摘要

从计算机视觉到蛋白质折叠预测，语言模型（LMs）已成功地将其对序列数据的表示应用于自然语言处理领域之外的广泛任务。数字病理学中的全切片图像（WSI）分析自然适合基于Transformer的架构。在类似于文本分词的预处理步骤中，大型显微镜图像被分割成较小的图像块。然而，由于包含数千个此类图像块的WSI规模巨大，WSI分类问题尚未通过深度Transformer架构解决，更不用说通过现有的文本预训练深度Transformer语言模型解决了。我们引入了SeqShort，这是一种基于多头注意力的序列缩短层，通过去除冗余视觉信息将大型WSI总结为固定大小的短特征向量序列。我们的序列缩短机制不仅降低了对大输入进行自注意力计算的成本，还允许将标准位置编码应用于构成WSI的先前无序的图像块集合。我们使用SeqShort，通过深度文本预训练Transformer模型在不同数字病理学任务中有效分类WSI，同时微调其不到0.1%的参数，证明其关于自然语言的知识能很好地迁移到该领域。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9099/11829941/344b9abe50bf/41598_2025_88139_Fig1_HTML.jpg

相似文献

Efficient WSI classification with sequence reduction and transformers pretrained on text.

Sci Rep. 2025 Feb 15;15(1):5612. doi: 10.1038/s41598-025-88139-5.

Positional encoding-guided transformer-based multiple instance learning for histopathology whole slide images classification.

Comput Methods Programs Biomed. 2025 Jan;258:108491. doi: 10.1016/j.cmpb.2024.108491. Epub 2024 Nov 9.

Masked pre-training of transformers for histology image analysis.

J Pathol Inform. 2024 May 31;15:100386. doi: 10.1016/j.jpi.2024.100386. eCollection 2024 Dec.

LESS: Label-efficient multi-scale learning for cytological whole slide image screening.

Med Image Anal. 2024 May;94:103109. doi: 10.1016/j.media.2024.103109. Epub 2024 Feb 20.

Masked hypergraph learning for weakly supervised histopathology whole slide image classification.

Comput Methods Programs Biomed. 2024 Aug;253:108237. doi: 10.1016/j.cmpb.2024.108237. Epub 2024 May 23.

A Graph-Transformer for Whole Slide Image Classification.

IEEE Trans Med Imaging. 2022 Nov;41(11):3003-3015. doi: 10.1109/TMI.2022.3176598. Epub 2022 Oct 27.

Multilayer outperforms single-layer slide scanning in AI-based classification of whole slide images with low-burden acid-fast mycobacteria (AFB).

Comput Methods Programs Biomed. 2023 Jun;234:107518. doi: 10.1016/j.cmpb.2023.107518. Epub 2023 Mar 28.

MC-ViT: Multi-path cross-scale vision transformer for thymoma histopathology whole slide image typing.

Front Oncol. 2022 Oct 31;12:925903. doi: 10.3389/fonc.2022.925903. eCollection 2022.

Do it the transformer way: A comprehensive review of brain and vision transformers for autism spectrum disorder diagnosis and classification.

Comput Biol Med. 2023 Dec;167:107667. doi: 10.1016/j.compbiomed.2023.107667. Epub 2023 Nov 3.

Dual attention model with reinforcement learning for classification of histology whole-slide images.

Comput Med Imaging Graph. 2024 Dec;118:102466. doi: 10.1016/j.compmedimag.2024.102466. Epub 2024 Nov 19.

引用本文的文献

Explainable, federated deep learning model predicts disease progression risk of cutaneous squamous cell carcinoma.

NPJ Precis Oncol. 2025 Jun 28;9(1):205. doi: 10.1038/s41698-025-00997-4.

本文引用的文献

Transformer-based biomarker prediction from colorectal cancer histology: A large-scale multicentric study.

Cancer Cell. 2023 Sep 11;41(9):1650-1661.e4. doi: 10.1016/j.ccell.2023.08.002. Epub 2023 Aug 30.

Transformer-based unsupervised contrastive learning for histopathological image classification.

Med Image Anal. 2022 Oct;81:102559. doi: 10.1016/j.media.2022.102559. Epub 2022 Jul 30.

A Graph-Transformer for Whole Slide Image Classification.

IEEE Trans Med Imaging. 2022 Nov;41(11):3003-3015. doi: 10.1109/TMI.2022.3176598. Epub 2022 Oct 27.

Dual-stream Multiple Instance Learning Network for Whole Slide Image Classification with Self-supervised Contrastive Learning.

Conf Comput Vis Pattern Recognit Workshops. 2021 Jun;2021:14318-14328. doi: 10.1109/CVPR46437.2021.01409. Epub 2021 Nov 13.

Data-efficient and weakly supervised computational pathology on whole-slide images.

Nat Biomed Eng. 2021 Jun;5(6):555-570. doi: 10.1038/s41551-020-00682-w. Epub 2021 Mar 1.

Hover-Net: Simultaneous segmentation and classification of nuclei in multi-tissue histology images.

Med Image Anal. 2019 Dec;58:101563. doi: 10.1016/j.media.2019.101563. Epub 2019 Sep 18.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

通过序列缩减和基于文本预训练的变压器实现高效的全切片图像分类。

Efficient WSI classification with sequence reduction and transformers pretrained on text.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献