基于视觉Transformer的CT成像中腰椎间盘突出症的Grad-CAM可解释性诊断

Vision transformer-based diagnosis of lumbar disc herniation with grad-CAM interpretability in CT imaging.

作者信息

Chu Qingsong, Wang Xingyu, Lv Hao, Zhou Yao, Jiang Ting

机构信息

The First Affiliated Hospital of Anhui University of Chinese Medicine, Hefei, China.

Anhui University of Chinese Medicine, Hefei, China.

出版信息

BMC Musculoskelet Disord. 2025 Apr 29;26(1):419. doi: 10.1186/s12891-025-08602-2.

DOI:10.1186/s12891-025-08602-2

PMID:40301802

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12039304/

Abstract

BACKGROUND

In this study, a computed tomography (CT)-vision transformer (ViT) framework for diagnosing lumbar disc herniation (LDH) was proposed for the first time by taking advantage of the multidirectional advantages of CT and a ViT.

METHODS

The proposed ViT model was trained and validated on a dataset consisting of 983 patients, including 2100 CT images. We compared the performance of the ViT model with that of several convolutional neural networks (CNNs), including ResNet18, ResNet50, LeNet, AlexNet, and VGG16, across two primary tasks: vertebra localization and disc abnormality classification.

RESULTS

The integration of a ViT with CT imaging allowed the constructed model to capture the complex spatial relationships and global dependencies within scans, outperforming CNN models and achieving accuracies of 97.13% and 93.63% in terms of vertebra localization and disc abnormality classification, respectively. The performance of the model was further validated via gradient-weighted class activation mapping (Grad-CAM), providing interpretable insights into the regions of the CT scans that contributed to the model predictions.

CONCLUSION

This study demonstrated the potential of a ViT for diagnosing LDH using CT imaging. The results highlight the promising clinical applications of this approach, particularly for enhancing the diagnostic efficiency and transparency of medical AI systems.

摘要

背景

在本研究中，首次提出了一种利用计算机断层扫描（CT）和视觉Transformer（ViT）的多方向优势来诊断腰椎间盘突出症（LDH）的CT-ViT框架。

方法

在一个由983名患者（包括2100张CT图像）组成的数据集上对所提出的ViT模型进行训练和验证。我们在两个主要任务（椎体定位和椎间盘异常分类）上比较了ViT模型与几个卷积神经网络（CNN）（包括ResNet18、ResNet50、LeNet、AlexNet和VGG16）的性能。

结果

ViT与CT成像的集成使构建的模型能够捕捉扫描内复杂的空间关系和全局依赖性，优于CNN模型，在椎体定位和椎间盘异常分类方面的准确率分别达到97.13%和93.63%。通过梯度加权类激活映射（Grad-CAM）进一步验证了模型的性能，为有助于模型预测的CT扫描区域提供了可解释的见解。

结论

本研究证明了ViT在利用CT成像诊断LDH方面的潜力。结果突出了这种方法在临床应用中的前景，特别是在提高医学人工智能系统的诊断效率和透明度方面。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6a00/12039304/90233952e9d3/12891_2025_8602_Fig1_HTML.jpg

相似文献

Vision transformer-based diagnosis of lumbar disc herniation with grad-CAM interpretability in CT imaging.

BMC Musculoskelet Disord. 2025 Apr 29;26(1):419. doi: 10.1186/s12891-025-08602-2.

Enhanced tuberculosis detection using Vision Transformers and explainable AI with a Grad-CAM approach on chest X-rays.

BMC Med Imaging. 2025 Mar 24;25(1):96. doi: 10.1186/s12880-025-01630-3.

Development and application of AI assisted automatic reconstruction of axial lumbar disc CT images and diagnosis of lumbar disc herniation.

Eur J Radiol. 2025 Apr;185:112003. doi: 10.1016/j.ejrad.2025.112003. Epub 2025 Feb 13.

ResViT FusionNet Model: An explainable AI-driven approach for automated grading of diabetic retinopathy in retinal images.

Comput Biol Med. 2025 Mar;186:109656. doi: 10.1016/j.compbiomed.2025.109656. Epub 2025 Jan 16.

Deep Learning for Lumbar Disc Herniation Diagnosis and Treatment Decision-Making Using Magnetic Resonance Imagings: A Retrospective Study.

World Neurosurg. 2025 Mar;195:123728. doi: 10.1016/j.wneu.2025.123728. Epub 2025 Feb 26.

Vision transformer and deep learning based weighted ensemble model for automated spine fracture type identification with GAN generated CT images.

Sci Rep. 2025 Apr 25;15(1):14408. doi: 10.1038/s41598-025-98518-7.

Parametric modeling of the intervertebral disc space in 3D: application to CT images of the lumbar spine.

Comput Med Imaging Graph. 2014 Oct;38(7):596-605. doi: 10.1016/j.compmedimag.2014.04.008. Epub 2014 May 14.

SymTC: A symbiotic Transformer-CNN net for instance segmentation of lumbar spine MRI.

Comput Biol Med. 2024 Sep;179:108795. doi: 10.1016/j.compbiomed.2024.108795. Epub 2024 Jul 1.

A novel hybrid ViT-LSTM model with explainable AI for brain stroke detection and classification in CT images: A case study of Rajshahi region.

Comput Biol Med. 2025 Mar;186:109711. doi: 10.1016/j.compbiomed.2025.109711. Epub 2025 Jan 22.

Pure Vision Transformer (CT-ViT) with Noise2Neighbors Interpolation for Low-Dose CT Image Denoising.

J Imaging Inform Med. 2024 Oct;37(5):2669-2687. doi: 10.1007/s10278-024-01108-8. Epub 2024 Apr 15.

本文引用的文献

Detection of breast cancer in digital breast tomosynthesis with vision transformers.

Sci Rep. 2024 Sep 27;14(1):22149. doi: 10.1038/s41598-024-72707-2.

Comparison of Vision Transformers and Convolutional Neural Networks in Medical Image Analysis: A Systematic Review.

J Med Syst. 2024 Sep 12;48(1):84. doi: 10.1007/s10916-024-02105-8.

Enhanced deep leaning model for detection and grading of lumbar disc herniation from MRI.

Med Biol Eng Comput. 2024 Dec;62(12):3709-3719. doi: 10.1007/s11517-024-03161-5. Epub 2024 Jul 5.

Ensemble Vision Transformer for Dementia Diagnosis.

IEEE J Biomed Health Inform. 2024 Sep;28(9):5551-5561. doi: 10.1109/JBHI.2024.3412812. Epub 2024 Sep 5.

Advancements in diagnosing oral potentially malignant disorders: leveraging Vision transformers for multi-class detection.

Clin Oral Investig. 2024 Jun 8;28(7):364. doi: 10.1007/s00784-024-05762-8.

Advantages of transformer and its application for medical image segmentation: a survey.

Biomed Eng Online. 2024 Feb 3;23(1):14. doi: 10.1186/s12938-024-01212-4.

Advances in medical image analysis with vision Transformers: A comprehensive review.

Med Image Anal. 2024 Jan;91:103000. doi: 10.1016/j.media.2023.103000. Epub 2023 Oct 19.

Recent progress in transformer-based medical image analysis.

Comput Biol Med. 2023 Sep;164:107268. doi: 10.1016/j.compbiomed.2023.107268. Epub 2023 Jul 20.

A Domain-Shift Invariant CNN Framework for Cardiac MRI Segmentation Across Unseen Domains.

J Digit Imaging. 2023 Oct;36(5):2148-2163. doi: 10.1007/s10278-023-00873-2. Epub 2023 Jul 10.

Transformers in medical imaging: A survey.

Med Image Anal. 2023 Aug;88:102802. doi: 10.1016/j.media.2023.102802. Epub 2023 Apr 5.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于视觉Transformer的CT成像中腰椎间盘突出症的Grad-CAM可解释性诊断

Vision transformer-based diagnosis of lumbar disc herniation with grad-CAM interpretability in CT imaging.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSION

背景

方法

结果

结论

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献