Department of Mechanical Engineering, National University of Singapore, 9 Engineering Drive 1, Singapore, 117575, Singapore.
Department of Mechanical Engineering, National University of Singapore, 9 Engineering Drive 1, Singapore, 117575, Singapore; School of Intelligent Systems Engineering, Sun Yat-sen University, No. 66, Gongchang Road, Guangming District, 518107, China.
Comput Biol Med. 2023 Sep;164:107268. doi: 10.1016/j.compbiomed.2023.107268. Epub 2023 Jul 20.
The transformer is primarily used in the field of natural language processing. Recently, it has been adopted and shows promise in the computer vision (CV) field. Medical image analysis (MIA), as a critical branch of CV, also greatly benefits from this state-of-the-art technique. In this review, we first recap the core component of the transformer, the attention mechanism, and the detailed structures of the transformer. After that, we depict the recent progress of the transformer in the field of MIA. We organize the applications in a sequence of different tasks, including classification, segmentation, captioning, registration, detection, enhancement, localization, and synthesis. The mainstream classification and segmentation tasks are further divided into eleven medical image modalities. A large number of experiments studied in this review illustrate that the transformer-based method outperforms existing methods through comparisons with multiple evaluation metrics. Finally, we discuss the open challenges and future opportunities in this field. This task-modality review with the latest contents, detailed information, and comprehensive comparison may greatly benefit the broad MIA community.
变压器主要用于自然语言处理领域。最近,它在计算机视觉(CV)领域得到了应用,并显示出了前景。医学图像分析(MIA)作为 CV 的一个重要分支,也极大地受益于这项最先进的技术。在这篇综述中,我们首先回顾了变压器的核心组成部分,即注意力机制,以及变压器的详细结构。之后,我们描述了变压器在 MIA 领域的最新进展。我们按照不同的任务序列组织应用,包括分类、分割、字幕生成、配准、检测、增强、定位和合成。主流的分类和分割任务进一步分为十一种医学图像模态。本综述中的大量实验研究表明,通过与多种评估指标的比较,基于变压器的方法优于现有方法。最后,我们讨论了该领域的开放挑战和未来机遇。这个具有最新内容、详细信息和全面比较的任务-模态综述可能会使广大 MIA 社区受益匪浅。