基于 Transformer 的病理图像分析应用综述：新进展与未来方向。

A survey of Transformer applications for histopathological image analysis: New developments and future directions.

机构信息

School of Microelectronics and Communication Engineering, Chongqing University, Chongqing, 400044, China.

出版信息

Biomed Eng Online. 2023 Sep 25;22(1):96. doi: 10.1186/s12938-023-01157-0.

DOI:10.1186/s12938-023-01157-0

PMID:37749595

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10518923/

Abstract

Transformers have been widely used in many computer vision challenges and have shown the capability of producing better results than convolutional neural networks (CNNs). Taking advantage of capturing long-range contextual information and learning more complex relations in the image data, Transformers have been used and applied to histopathological image processing tasks. In this survey, we make an effort to present a thorough analysis of the uses of Transformers in histopathological image analysis, covering several topics, from the newly built Transformer models to unresolved challenges. To be more precise, we first begin by outlining the fundamental principles of the attention mechanism included in Transformer models and other key frameworks. Second, we analyze Transformer-based applications in the histopathological imaging domain and provide a thorough evaluation of more than 100 research publications across different downstream tasks to cover the most recent innovations, including survival analysis and prediction, segmentation, classification, detection, and representation. Within this survey work, we also compare the performance of CNN-based techniques to Transformers based on recently published papers, highlight major challenges, and provide interesting future research directions. Despite the outstanding performance of the Transformer-based architectures in a number of papers reviewed in this survey, we anticipate that further improvements and exploration of Transformers in the histopathological imaging domain are still required in the future. We hope that this survey paper will give readers in this field of study a thorough understanding of Transformer-based techniques in histopathological image analysis, and an up-to-date paper list summary will be provided at https://github.com/S-domain/Survey-Paper .

摘要

Transformers 在许多计算机视觉挑战中得到了广泛应用，并展示了比卷积神经网络 (CNN) 产生更好结果的能力。利用捕捉远程上下文信息和学习图像数据中更复杂的关系，Transformer 已被用于和应用于组织病理学图像处理任务。在本调查中，我们努力对 Transformer 在组织病理学图像分析中的应用进行全面分析，涵盖了从新构建的 Transformer 模型到未解决的挑战等几个主题。更准确地说，我们首先概述了 Transformer 模型中包含的注意力机制的基本原理和其他关键框架。其次，我们分析了 Transformer 在组织病理学成像领域的应用，并对超过 100 篇不同下游任务的研究论文进行了全面评估，以涵盖最新的创新，包括生存分析和预测、分割、分类、检测和表示。在这项调查工作中，我们还根据最近发表的论文将基于 CNN 的技术与基于 Transformer 的技术的性能进行了比较，突出了主要挑战，并提供了有趣的未来研究方向。尽管在本调查中审查的许多论文中基于 Transformer 的架构表现出色，但我们预计未来在组织病理学成像领域仍需要对 Transformer 进行进一步的改进和探索。我们希望本调查论文能够让该领域的读者对基于 Transformer 的技术在组织病理学图像分析中有一个全面的了解，并将在 https://github.com/S-domain/Survey-Paper 提供最新的论文列表摘要。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e4f6/10518923/74138a43ff58/12938_2023_1157_Fig1_HTML.jpg

相似文献

A survey of Transformer applications for histopathological image analysis: New developments and future directions.基于 Transformer 的病理图像分析应用综述：新进展与未来方向。

Biomed Eng Online. 2023 Sep 25;22(1):96. doi: 10.1186/s12938-023-01157-0.

Advances in medical image analysis with vision Transformers: A comprehensive review.基于视觉Transformer的医学图像分析进展：全面综述。

Med Image Anal. 2024 Jan;91:103000. doi: 10.1016/j.media.2023.103000. Epub 2023 Oct 19.

Transformers in medical imaging: A survey.医学成像中的变压器：综述。

Med Image Anal. 2023 Aug;88:102802. doi: 10.1016/j.media.2023.102802. Epub 2023 Apr 5.

Do it the transformer way: A comprehensive review of brain and vision transformers for autism spectrum disorder diagnosis and classification.采用变压器方法：自闭症谱系障碍诊断和分类的脑和视觉变压器的全面综述。

Comput Biol Med. 2023 Dec;167:107667. doi: 10.1016/j.compbiomed.2023.107667. Epub 2023 Nov 3.

Advantages of transformer and its application for medical image segmentation: a survey.Transformer 的优势及其在医学图像分割中的应用：综述。

Biomed Eng Online. 2024 Feb 3;23(1):14. doi: 10.1186/s12938-024-01212-4.

Vision Transformers for Computational Histopathology.基于视觉Transformer 的计算病理技术

IEEE Rev Biomed Eng. 2024;17:63-79. doi: 10.1109/RBME.2023.3297604. Epub 2024 Jan 12.

MSCT-UNET: multi-scale contrastive transformer within U-shaped network for medical image segmentation.MSCT-UNET：U 形网络中的多尺度对比变换用于医学图像分割。

Phys Med Biol. 2023 Dec 28;69(1). doi: 10.1088/1361-6560/ad135d.

Convolutional Networks and Transformers for Mammography Classification: An Experimental Study.卷积神经网络和 Transformer 在乳腺 X 线摄影分类中的应用：一项实验研究。

Sensors (Basel). 2023 Jan 20;23(3):1229. doi: 10.3390/s23031229.

CAEVT: Convolutional Autoencoder Meets Lightweight Vision Transformer for Hyperspectral Image Classification.CAEVT：用于高光谱图像分类的卷积自编码器与轻量级视觉转换器的结合

Sensors (Basel). 2022 May 20;22(10):3902. doi: 10.3390/s22103902.

VSmTrans: A hybrid paradigm integrating self-attention and convolution for 3D medical image segmentation.VSmTrans：一种融合自注意力机制和卷积的 3D 医学图像分割混合范式。

Med Image Anal. 2024 Dec;98:103295. doi: 10.1016/j.media.2024.103295. Epub 2024 Aug 24.

引用本文的文献

Accelerating biomedical discoveries in brain health through transformative neuropathology of aging and neurodegeneration.通过衰老和神经退行性变的变革性神经病理学加速脑健康方面的生物医学发现。

Neuron. 2025 Jul 15. doi: 10.1016/j.neuron.2025.06.014.

Enhanced nuclear information fusion and visual transformer for pathological breast cancer image classification.用于病理乳腺癌图像分类的增强核信息融合与视觉Transformer

Sci Rep. 2025 Jun 3;15(1):19490. doi: 10.1038/s41598-025-04344-2.

Enhancing basal cell carcinoma classification in preoperative biopsies via transfer learning with weakly supervised graph transformers.通过使用弱监督图变换器的迁移学习提高术前活检中基底细胞癌的分类

BMC Med Imaging. 2025 May 16;25(1):166. doi: 10.1186/s12880-025-01710-4.

Advanced hybrid deep learning model for enhanced evaluation of osteosarcoma histopathology images.用于增强骨肉瘤组织病理学图像评估的先进混合深度学习模型。

Front Med (Lausanne). 2025 Apr 16;12:1555907. doi: 10.3389/fmed.2025.1555907. eCollection 2025.

Vision Transformers for Low-Quality Histopathological Images: A Case Study on Squamous Cell Carcinoma Margin Classification.用于低质量组织病理学图像的视觉Transformer：以鳞状细胞癌边缘分类为例的研究

Diagnostics (Basel). 2025 Jan 23;15(3):260. doi: 10.3390/diagnostics15030260.

Instance-level semantic segmentation of nuclei based on multimodal structure encoding.基于多模态结构编码的细胞核实例级语义分割。

BMC Bioinformatics. 2025 Feb 6;26(1):42. doi: 10.1186/s12859-025-06066-8.

Equipping computational pathology systems with artifact processing pipelines: a showcase for computation and performance trade-offs.为计算病理学系统配备伪影处理管道：计算和性能权衡的展示。

BMC Med Inform Decis Mak. 2024 Oct 7;24(1):288. doi: 10.1186/s12911-024-02676-z.

Automated quantification of SARS-CoV-2 pneumonia with large vision model knowledge adaptation.基于大视觉模型知识适配的新型冠状病毒肺炎自动量化分析

New Microbes New Infect. 2024 Aug 15;62:101457. doi: 10.1016/j.nmni.2024.101457. eCollection 2024 Dec.

Advantages of transformer and its application for medical image segmentation: a survey.Transformer 的优势及其在医学图像分割中的应用：综述。

Biomed Eng Online. 2024 Feb 3;23(1):14. doi: 10.1186/s12938-024-01212-4.

本文引用的文献

Masked pre-training of transformers for histology image analysis.用于组织学图像分析的Transformer掩码预训练

J Pathol Inform. 2024 May 31;15:100386. doi: 10.1016/j.jpi.2024.100386. eCollection 2024 Dec.

Multi-Scale Efficient Graph-Transformer for Whole Slide Image Classification.多尺度高效图Transformer 用于全幻灯片图像分类。

IEEE J Biomed Health Inform. 2023 Dec;27(12):5926-5936. doi: 10.1109/JBHI.2023.3317067. Epub 2023 Dec 5.

Surformer: An interpretable pattern-perceptive survival transformer for cancer survival prediction from histopathology whole slide images.Surformer：一种可解释的模式感知生存转换器，用于从组织病理学全切片图像预测癌症生存情况。

Comput Methods Programs Biomed. 2023 Nov;241:107733. doi: 10.1016/j.cmpb.2023.107733. Epub 2023 Jul 28.

Region of interest (ROI) selection using vision transformer for automatic analysis using whole slide images.使用视觉转换器选择感兴趣区域 (ROI) ，以便对全幻灯片图像进行自动分析。

Sci Rep. 2023 Jul 13;13(1):11314. doi: 10.1038/s41598-023-38109-6.

Shared-Specific Feature Learning With Bottleneck Fusion Transformer for Multi-Modal Whole Slide Image Analysis.基于瓶颈融合 Transformer 的共享特定特征学习在多模态全切片图像分析中的应用。

IEEE Trans Med Imaging. 2023 Nov;42(11):3374-3383. doi: 10.1109/TMI.2023.3287256. Epub 2023 Oct 27.

dMIL-Transformer: Multiple Instance Learning Via Integrating Morphological and Spatial Information for Lymph Node Metastasis Classification.dMIL-Transformer：通过整合形态和空间信息进行淋巴结转移分类的多实例学习。

IEEE J Biomed Health Inform. 2023 Sep;27(9):4433-4443. doi: 10.1109/JBHI.2023.3285275. Epub 2023 Sep 6.

From modern CNNs to vision transformers: Assessing the performance, robustness, and classification strategies of deep learning models in histopathology.从现代卷积神经网络到视觉Transformer：评估深度学习模型在组织病理学中的性能、鲁棒性和分类策略。

Med Image Anal. 2023 Jul;87:102809. doi: 10.1016/j.media.2023.102809. Epub 2023 Apr 28.

Weakly supervised detection and classification of basal cell carcinoma using graph-transformer on whole slide images.基于图变换的全切片图像中基底细胞癌的弱监督检测与分类。

Sci Rep. 2023 May 9;13(1):7555. doi: 10.1038/s41598-023-33863-z.

Local-to-global spatial learning for whole-slide image representation and classification.基于局部到全局的空间学习的全切片图像表示和分类。

Comput Med Imaging Graph. 2023 Jul;107:102230. doi: 10.1016/j.compmedimag.2023.102230. Epub 2023 Apr 22.

Survival Prediction via Hierarchical Multimodal Co-Attention Transformer: A Computational Histology-Radiology Solution.基于层次化多模态协同注意力变换的生存预测：一种计算组织病理学-影像学解决方案。

IEEE Trans Med Imaging. 2023 Sep;42(9):2678-2689. doi: 10.1109/TMI.2023.3263010. Epub 2023 Aug 31.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于 Transformer 的病理图像分析应用综述：新进展与未来方向。

A survey of Transformer applications for histopathological image analysis: New developments and future directions.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献