School of Microelectronics and Communication Engineering, Chongqing University, Chongqing, 400044, China.
Biomed Eng Online. 2023 Sep 25;22(1):96. doi: 10.1186/s12938-023-01157-0.
Transformers have been widely used in many computer vision challenges and have shown the capability of producing better results than convolutional neural networks (CNNs). Taking advantage of capturing long-range contextual information and learning more complex relations in the image data, Transformers have been used and applied to histopathological image processing tasks. In this survey, we make an effort to present a thorough analysis of the uses of Transformers in histopathological image analysis, covering several topics, from the newly built Transformer models to unresolved challenges. To be more precise, we first begin by outlining the fundamental principles of the attention mechanism included in Transformer models and other key frameworks. Second, we analyze Transformer-based applications in the histopathological imaging domain and provide a thorough evaluation of more than 100 research publications across different downstream tasks to cover the most recent innovations, including survival analysis and prediction, segmentation, classification, detection, and representation. Within this survey work, we also compare the performance of CNN-based techniques to Transformers based on recently published papers, highlight major challenges, and provide interesting future research directions. Despite the outstanding performance of the Transformer-based architectures in a number of papers reviewed in this survey, we anticipate that further improvements and exploration of Transformers in the histopathological imaging domain are still required in the future. We hope that this survey paper will give readers in this field of study a thorough understanding of Transformer-based techniques in histopathological image analysis, and an up-to-date paper list summary will be provided at https://github.com/S-domain/Survey-Paper .
Transformers 在许多计算机视觉挑战中得到了广泛应用,并展示了比卷积神经网络 (CNN) 产生更好结果的能力。利用捕捉远程上下文信息和学习图像数据中更复杂的关系,Transformer 已被用于和应用于组织病理学图像处理任务。在本调查中,我们努力对 Transformer 在组织病理学图像分析中的应用进行全面分析,涵盖了从新构建的 Transformer 模型到未解决的挑战等几个主题。更准确地说,我们首先概述了 Transformer 模型中包含的注意力机制的基本原理和其他关键框架。其次,我们分析了 Transformer 在组织病理学成像领域的应用,并对超过 100 篇不同下游任务的研究论文进行了全面评估,以涵盖最新的创新,包括生存分析和预测、分割、分类、检测和表示。在这项调查工作中,我们还根据最近发表的论文将基于 CNN 的技术与基于 Transformer 的技术的性能进行了比较,突出了主要挑战,并提供了有趣的未来研究方向。尽管在本调查中审查的许多论文中基于 Transformer 的架构表现出色,但我们预计未来在组织病理学成像领域仍需要对 Transformer 进行进一步的改进和探索。我们希望本调查论文能够让该领域的读者对基于 Transformer 的技术在组织病理学图像分析中有一个全面的了解,并将在 https://github.com/S-domain/Survey-Paper 提供最新的论文列表摘要。