LumVertCancNet：一种基于混合 Swin-Transformer 的新型 3D 腰椎松质骨定位与分割方法。

LumVertCancNet: A novel 3D lumbar vertebral body cancellous bone location and segmentation method based on hybrid Swin-transformer.

机构信息

Key Laboratory of Opto-Electronic Information Processing, Chinese Academy of Sciences, Shenyang 110016, China; Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang 110016, China; Institutes for Robotics and Intelligent Manufacturing, Chinese Academy of Sciences, Shenyang 110169, China; University of Chinese Academy of Sciences, Beijing 100049, China.

出版信息

Comput Biol Med. 2024 Mar;171:108237. doi: 10.1016/j.compbiomed.2024.108237. Epub 2024 Feb 28.

DOI:10.1016/j.compbiomed.2024.108237

PMID:38422966

Abstract

Lumbar vertebral body cancellous bone location and segmentation is crucial in an automated lumbar spine processing pipeline. Accurate and reliable analysis of lumbar spine image is expected to advantage practical medical diagnosis and population-based analysis of bone strength. However, the design of automated algorithms for lumbar spine processing is demanding due to significant anatomical variations and scarcity of publicly available data. In recent years, convolutional neural network (CNN) and vision transformers (Vits) have been the de facto standard in medical image segmentation. Although adept at capturing global features, the inherent bias of locality and weight sharing of CNN constrains its capacity to model long-range dependency. In contrast, Vits excel at long-range dependency modeling, but they may not generalize well with limited datasets due to the lack of inductive biases inherent to CNN. In this paper, we propose a deep learning-based two-stage coarse-to-fine solution to address the problem of automatic location and segmentation of lumbar vertebral body cancellous bone. Specifically, in the first stage, a Swin-transformer based model is applied to predict the heatmap of lumbar vertebral body centroids. Considering the characteristic anatomical structure of lumbar spine, we propose a novel loss function called LumAnatomy loss, which enforces the order and bend of the predicted vertebral body centroids. To inherit the excellence of CNN and Vits while preventing their respective limitations, in the second stage, we propose an encoder-decoder network to segment the identified lumbar vertebral body cancellous bone, which consists of two parallel encoders, i.e., a Swin-transformer encoder and a CNN encoder. To enhance the combination of CNNs and Vits, we propose a novel multi-scale attention feature fusion module (MSA-FFM), which address issues that arise when fusing features given at different encoders. To tackle the issue of lack of data, we raise the first large-scale lumbar vertebral body cancellous bone segmentation dataset called LumVBCanSeg containing a total of 185 CT scans annotated at voxel level by 3 physicians. Extensive experimental results on the LumVBCanSeg dataset demonstrate the proposed algorithm outperform other state-of-the-art medical image segmentation methods. The data is publicly available at: https://zenodo.org/record/8181250. The implementation of the proposed method is available at: https://github.com/sia405yd/LumVertCancNet.

摘要

腰椎椎体松质骨的定位和分割在自动化腰椎处理管道中至关重要。预计对腰椎图像进行准确可靠的分析将有利于实际的医学诊断和基于人群的骨强度分析。然而，由于解剖结构变化显著，并且公开可用的数据稀缺，因此对自动化腰椎处理算法的设计要求很高。近年来，卷积神经网络（CNN）和视觉转换器（Vits）已成为医学图像分割的事实上的标准。虽然擅长捕捉全局特征，但 CNN 的局部性和权重共享的固有偏差限制了其建模长程依赖的能力。相比之下，Vits 在长程依赖建模方面表现出色，但由于缺乏 CNN 固有的归纳偏差，它们可能无法很好地推广到有限的数据集。在本文中，我们提出了一种基于深度学习的两阶段粗到精解决方案，以解决自动定位和分割腰椎椎体松质骨的问题。具体来说，在第一阶段，应用基于 Swin-Transformer 的模型来预测腰椎椎体中心点的热图。考虑到腰椎的解剖结构特征，我们提出了一种新的损失函数，称为 LumAnatomy 损失，它强制预测的椎体中心点的顺序和弯曲。为了继承 CNN 和 Vits 的优点，同时防止它们各自的局限性，在第二阶段，我们提出了一个编码器-解码器网络来分割所识别的腰椎椎体松质骨，它由两个平行的编码器组成，即 Swin-Transformer 编码器和 CNN 编码器。为了增强 CNN 和 Vits 的组合，我们提出了一种新的多尺度注意力特征融合模块（MSA-FFM），该模块解决了当融合来自不同编码器的特征时出现的问题。为了解决数据不足的问题，我们提出了第一个大规模的腰椎椎体松质骨分割数据集，称为 LumVBCanSeg，该数据集总共包含 185 个 CT 扫描，由 3 名医生进行了体素级注释。在 LumVBCanSeg 数据集上进行的广泛实验结果表明，所提出的算法优于其他最先进的医学图像分割方法。该数据可在：https://zenodo.org/record/8181250 处获取。所提出方法的实现可在：https://github.com/sia405yd/LumVertCancNet 处获取。

相似文献

LumVertCancNet: A novel 3D lumbar vertebral body cancellous bone location and segmentation method based on hybrid Swin-transformer.

Comput Biol Med. 2024 Mar;171:108237. doi: 10.1016/j.compbiomed.2024.108237. Epub 2024 Feb 28.

Dual encoder network with transformer-CNN for multi-organ segmentation.

Med Biol Eng Comput. 2023 Mar;61(3):661-671. doi: 10.1007/s11517-022-02723-9. Epub 2022 Dec 29.

CoTrFuse: a novel framework by fusing CNN and transformer for medical image segmentation.

Phys Med Biol. 2023 Aug 22;68(17). doi: 10.1088/1361-6560/acede8.

SwinCross: Cross-modal Swin transformer for head-and-neck tumor segmentation in PET/CT images.

Med Phys. 2024 Mar;51(3):2096-2107. doi: 10.1002/mp.16703. Epub 2023 Sep 30.

Efficient brain tumor segmentation using Swin transformer and enhanced local self-attention.

Int J Comput Assist Radiol Surg. 2024 Feb;19(2):273-281. doi: 10.1007/s11548-023-03024-8. Epub 2023 Oct 5.

MedFuseNet: fusing local and global deep feature representations with hybrid attention mechanisms for medical image segmentation.

Sci Rep. 2025 Feb 11;15(1):5093. doi: 10.1038/s41598-025-89096-9.

VSmTrans: A hybrid paradigm integrating self-attention and convolution for 3D medical image segmentation.

Med Image Anal. 2024 Dec;98:103295. doi: 10.1016/j.media.2024.103295. Epub 2024 Aug 24.

SymTC: A symbiotic Transformer-CNN net for instance segmentation of lumbar spine MRI.

Comput Biol Med. 2024 Sep;179:108795. doi: 10.1016/j.compbiomed.2024.108795. Epub 2024 Jul 1.

TAC-UNet: transformer-assisted convolutional neural network for medical image segmentation.

Quant Imaging Med Surg. 2024 Dec 5;14(12):8824-8839. doi: 10.21037/qims-24-1229. Epub 2024 Nov 5.

Multi-scale dual-channel feature embedding decoder for biomedical image segmentation.

Comput Methods Programs Biomed. 2024 Dec;257:108464. doi: 10.1016/j.cmpb.2024.108464. Epub 2024 Oct 18.

引用本文的文献

Lumbar and pelvic CT image segmentation based on cross-scale feature fusion and linear self-attention mechanism.

Sci Rep. 2025 Aug 1;15(1):28131. doi: 10.1038/s41598-025-13569-0.

Artificial Intelligence in Orthopedic Surgery: Current Applications, Challenges, and Future Directions.

MedComm (2020). 2025 Jun 25;6(7):e70260. doi: 10.1002/mco2.70260. eCollection 2025 Jul.

Development of a deep learning model for measuring sagittal parameters on cervical spine X-ray.

Eur Spine J. 2025 Jun 5. doi: 10.1007/s00586-025-08946-2.

The Application of Artificial Intelligence in Spine Surgery: A Scoping Review.

J Am Acad Orthop Surg Glob Res Rev. 2025 Apr 10;9(4). doi: 10.5435/JAAOSGlobal-D-24-00405. eCollection 2025 Apr 1.

WGAN-based multi-structure segmentation of vertebral cross-section MRI using ResU-Net and clustered transformer.

Sci Rep. 2024 Nov 11;14(1):27474. doi: 10.1038/s41598-024-79244-y.

Pediatric Solid-State 3D Models of Lumbar Vertebrae and Spine.

Cureus. 2024 Apr 24;16(4):e58938. doi: 10.7759/cureus.58938. eCollection 2024 Apr.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

LumVertCancNet：一种基于混合 Swin-Transformer 的新型 3D 腰椎松质骨定位与分割方法。

LumVertCancNet: A novel 3D lumbar vertebral body cancellous bone location and segmentation method based on hybrid Swin-transformer.

机构信息

出版信息

Comput Biol Med. 2024 Mar;171:108237. doi: 10.1016/j.compbiomed.2024.108237. Epub 2024 Feb 28.

DOI:10.1016/j.compbiomed.2024.108237

PMID:38422966

Abstract

摘要

LumVertCancNet：一种基于混合 Swin-Transformer 的新型 3D 腰椎松质骨定位与分割方法。

LumVertCancNet: A novel 3D lumbar vertebral body cancellous bone location and segmentation method based on hybrid Swin-transformer.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

LumVertCancNet：一种基于混合 Swin-Transformer 的新型 3D 腰椎松质骨定位与分割方法。

LumVertCancNet: A novel 3D lumbar vertebral body cancellous bone location and segmentation method based on hybrid Swin-transformer.

机构信息

出版信息

相似文献

引用本文的文献