VerFormer：用于从CT图像自动分割脊柱的椎体感知Transformer

VerFormer: Vertebrae-Aware Transformer for Automatic Spine Segmentation from CT Images.

作者信息

Li Xinchen, Hong Yuan, Xu Yang, Hu Mu

机构信息

Department of Orthopedics, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China.

出版信息

Diagnostics (Basel). 2024 Aug 25;14(17):1859. doi: 10.3390/diagnostics14171859.

DOI:10.3390/diagnostics14171859

PMID:39272643

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11393940/

Abstract

The accurate and efficient segmentation of the spine is important in the diagnosis and treatment of spine malfunctions and fractures. However, it is still challenging because of large inter-vertebra variations in shape and cross-image localization of the spine. In previous methods, convolutional neural networks (CNNs) have been widely applied as a vision backbone to tackle this task. However, these methods are challenged in utilizing the global contextual information across the whole image for accurate spine segmentation because of the inherent locality of the convolution operation. Compared with CNNs, the Vision Transformer (ViT) has been proposed as another vision backbone with a high capacity to capture global contextual information. However, when the ViT is employed for spine segmentation, it treats all input tokens equally, including vertebrae-related tokens and non-vertebrae-related tokens. Additionally, it lacks the capability to locate regions of interest, thus lowering the accuracy of spine segmentation. To address this limitation, we propose a novel Vertebrae-aware Vision Transformer (VerFormer) for automatic spine segmentation from CT images. Our VerFormer is designed by incorporating a novel Vertebrae-aware Global (VG) block into the ViT backbone. In the VG block, the vertebrae-related global contextual information is extracted by a Vertebrae-aware Global Query (VGQ) module. Then, this information is incorporated into query tokens to highlight vertebrae-related tokens in the multi-head self-attention module. Thus, this VG block can leverage global contextual information to effectively and efficiently locate spines across the whole input, thus improving the segmentation accuracy of VerFormer. Driven by this design, the VerFormer demonstrates a solid capacity to capture more discriminative dependencies and vertebrae-related context in automatic spine segmentation. The experimental results on two spine CT segmentation tasks demonstrate the effectiveness of our VG block and the superiority of our VerFormer in spine segmentation. Compared with other popular CNN- or ViT-based segmentation models, our VerFormer shows superior segmentation accuracy and generalization.

摘要

脊柱的准确高效分割在脊柱功能障碍和骨折的诊断与治疗中至关重要。然而，由于脊柱形状的椎骨间差异较大以及在图像中的跨图像定位，这一任务仍然具有挑战性。在以往的方法中，卷积神经网络（CNN）已被广泛用作视觉主干来处理此任务。然而，由于卷积操作固有的局部性，这些方法在利用整个图像的全局上下文信息进行准确的脊柱分割时面临挑战。与CNN相比，视觉Transformer（ViT）已被提出作为另一种具有高能力捕获全局上下文信息的视觉主干。然而，当将ViT用于脊柱分割时，它平等对待所有输入令牌，包括与椎骨相关的令牌和与非椎骨相关的令牌。此外，它缺乏定位感兴趣区域的能力，从而降低了脊柱分割的准确性。为了解决这一限制，我们提出了一种新颖的椎体感知视觉Transformer（VerFormer），用于从CT图像中自动分割脊柱。我们的VerFormer通过将一个新颖的椎体感知全局（VG）块合并到ViT主干中进行设计。在VG块中，通过一个椎体感知全局查询（VGQ）模块提取与椎骨相关的全局上下文信息。然后，该信息被合并到查询令牌中，以在多头自注意力模块中突出与椎骨相关的令牌。因此，这个VG块可以利用全局上下文信息在整个输入中有效且高效地定位脊柱，从而提高VerFormer的分割准确性。受此设计驱动，VerFormer在自动脊柱分割中展现出强大的能力，能够捕获更多有区分性的依赖关系和与椎骨相关的上下文。在两个脊柱CT分割任务上的实验结果证明了我们的VG块的有效性以及我们的VerFormer在脊柱分割中的优越性。与其他基于CNN或ViT的流行分割模型相比，我们的VerFormer表现出更高的分割准确性和泛化能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc94/11393940/e7f4671bbc41/diagnostics-14-01859-g001.jpg

相似文献

VerFormer: Vertebrae-Aware Transformer for Automatic Spine Segmentation from CT Images.

Diagnostics (Basel). 2024 Aug 25;14(17):1859. doi: 10.3390/diagnostics14171859.

VerteFormer: A single-staged Transformer network for vertebrae segmentation from CT images with arbitrary field of views.

Med Phys. 2023 Oct;50(10):6296-6318. doi: 10.1002/mp.16467. Epub 2023 May 21.

SwinCross: Cross-modal Swin transformer for head-and-neck tumor segmentation in PET/CT images.

Med Phys. 2024 Mar;51(3):2096-2107. doi: 10.1002/mp.16703. Epub 2023 Sep 30.

TGDAUNet: Transformer and GCNN based dual-branch attention UNet for medical image segmentation.

Comput Biol Med. 2023 Dec;167:107583. doi: 10.1016/j.compbiomed.2023.107583. Epub 2023 Oct 21.

LumVertCancNet: A novel 3D lumbar vertebral body cancellous bone location and segmentation method based on hybrid Swin-transformer.

Comput Biol Med. 2024 Mar;171:108237. doi: 10.1016/j.compbiomed.2024.108237. Epub 2024 Feb 28.

CPFTransformer: transformer fusion context pyramid medical image segmentation network.

Front Neurosci. 2023 Dec 7;17:1288366. doi: 10.3389/fnins.2023.1288366. eCollection 2023.

Dual encoder network with transformer-CNN for multi-organ segmentation.

Med Biol Eng Comput. 2023 Mar;61(3):661-671. doi: 10.1007/s11517-022-02723-9. Epub 2022 Dec 29.

VSmTrans: A hybrid paradigm integrating self-attention and convolution for 3D medical image segmentation.

Med Image Anal. 2024 Dec;98:103295. doi: 10.1016/j.media.2024.103295. Epub 2024 Aug 24.

Automatic segmentation of spine x-ray images based on multiscale feature enhancement network.

Med Phys. 2024 Oct;51(10):7282-7294. doi: 10.1002/mp.17278. Epub 2024 Jun 30.

TranSegNet: Hybrid CNN-Vision Transformers Encoder for Retina Segmentation of Optical Coherence Tomography.

Life (Basel). 2023 Apr 10;13(4):976. doi: 10.3390/life13040976.

引用本文的文献

AI in Medical Imaging and Image Processing.

J Clin Med. 2025 Jun 11;14(12):4153. doi: 10.3390/jcm14124153.

本文引用的文献

Redefining Radiology: A Review of Artificial Intelligence Integration in Medical Imaging.

Diagnostics (Basel). 2023 Aug 25;13(17):2760. doi: 10.3390/diagnostics13172760.

MISSFormer: An Effective Transformer for 2D Medical Image Segmentation.

IEEE Trans Med Imaging. 2023 May;42(5):1484-1494. doi: 10.1109/TMI.2022.3230943. Epub 2023 May 2.

Automatic vertebrae localization and segmentation in CT with a two-stage Dense-U-Net.

Sci Rep. 2021 Nov 12;11(1):22156. doi: 10.1038/s41598-021-01296-1.

VerSe: A Vertebrae labelling and segmentation benchmark for multi-detector CT images.

Med Image Anal. 2021 Oct;73:102166. doi: 10.1016/j.media.2021.102166. Epub 2021 Jul 22.

Generalised Dice Overlap as a Deep Learning Loss Function for Highly Unbalanced Segmentations.

Deep Learn Med Image Anal Multimodal Learn Clin Decis Support (2017). 2017;2017:240-248. doi: 10.1007/978-3-319-67558-9_28. Epub 2017 Sep 9.

A Vertebral Segmentation Dataset with Fracture Grading.

Radiol Artif Intell. 2020 Jul 29;2(4):e190138. doi: 10.1148/ryai.2020190138. eCollection 2020 Jul.

Labeling Vertebrae with Two-dimensional Reformations of Multidetector CT Images: An Adversarial Approach for Incorporating Prior Knowledge of Spine Anatomy.

Radiol Artif Intell. 2020 Mar 25;2(2):e190074. doi: 10.1148/ryai.2020190074. eCollection 2020 Mar.

LPAQR-Net: Efficient Vertebra Segmentation From Biplanar Whole-Spine Radiographs.

IEEE J Biomed Health Inform. 2021 Jul;25(7):2710-2721. doi: 10.1109/JBHI.2021.3057647. Epub 2021 Jul 27.

nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation.

Nat Methods. 2021 Feb;18(2):203-211. doi: 10.1038/s41592-020-01008-z. Epub 2020 Dec 7.

Attention gated networks: Learning to leverage salient regions in medical images.

Med Image Anal. 2019 Apr;53:197-207. doi: 10.1016/j.media.2019.01.012. Epub 2019 Feb 5.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

VerFormer：用于从CT图像自动分割脊柱的椎体感知Transformer

VerFormer: Vertebrae-Aware Transformer for Automatic Spine Segmentation from CT Images.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献