基于视觉Transformer的胸部 X 射线图像肺病检测的优化。

Optimization of vision transformer-based detection of lung diseases from chest X-ray images.

机构信息

Department of Physiology, Ajou University School of Medicine, Suwon, Republic of Korea.

Department of Biomedical Science, Graduate School, Ajou University, Suwon, Republic of Korea.

出版信息

BMC Med Inform Decis Mak. 2024 Jul 8;24(1):191. doi: 10.1186/s12911-024-02591-3.

DOI:10.1186/s12911-024-02591-3

PMID:38978027

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11232177/

Abstract

BACKGROUND

Recent advances in Vision Transformer (ViT)-based deep learning have significantly improved the accuracy of lung disease prediction from chest X-ray images. However, limited research exists on comparing the effectiveness of different optimizers for lung disease prediction within ViT models. This study aims to systematically evaluate and compare the performance of various optimization methods for ViT-based models in predicting lung diseases from chest X-ray images.

METHODS

This study utilized a chest X-ray image dataset comprising 19,003 images containing both normal cases and six lung diseases: COVID-19, Viral Pneumonia, Bacterial Pneumonia, Middle East Respiratory Syndrome (MERS), Severe Acute Respiratory Syndrome (SARS), and Tuberculosis. Each ViT model (ViT, FastViT, and CrossViT) was individually trained with each optimization method (Adam, AdamW, NAdam, RAdam, SGDW, and Momentum) to assess their performance in lung disease prediction.

RESULTS

When tested with ViT on the dataset with balanced-sample sized classes, RAdam demonstrated superior accuracy compared to other optimizers, achieving 95.87%. In the dataset with imbalanced sample size, FastViT with NAdam achieved the best performance with an accuracy of 97.63%.

CONCLUSIONS

We provide comprehensive optimization strategies for developing ViT-based model architectures, which can enhance the performance of these models for lung disease prediction from chest X-ray images.

摘要

背景

基于 Vision Transformer（ViT）的深度学习的最新进展极大地提高了从胸部 X 光图像预测肺部疾病的准确性。然而，关于比较不同优化器在 ViT 模型中预测肺部疾病的有效性的研究有限。本研究旨在系统评估和比较各种优化方法在基于 ViT 的模型中预测胸部 X 光图像中肺部疾病的性能。

方法

本研究使用了一个包含 19003 张图像的胸部 X 光图像数据集，其中包含正常病例和六种肺部疾病：COVID-19、病毒性肺炎、细菌性肺炎、中东呼吸综合征（MERS）、严重急性呼吸综合征（SARS）和结核病。每个 ViT 模型（ViT、FastViT 和 CrossViT）都分别使用每个优化方法（Adam、AdamW、NAdam、RAdam、SGDW 和 Momentum）进行训练，以评估它们在肺部疾病预测中的性能。

结果

当在具有平衡样本大小类别的数据集上使用 ViT 进行测试时，RAdam 与其他优化器相比表现出更高的准确性，达到 95.87%。在具有不平衡样本大小的数据集上，使用 NAdam 的 FastViT 实现了最佳性能，准确率为 97.63%。

结论

我们为开发基于 ViT 的模型架构提供了全面的优化策略，这可以提高这些模型从胸部 X 光图像预测肺部疾病的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bafc/11232177/6fa4e1d89218/12911_2024_2591_Fig1_HTML.jpg

相似文献

Optimization of vision transformer-based detection of lung diseases from chest X-ray images.基于视觉Transformer的胸部 X 射线图像肺病检测的优化。

BMC Med Inform Decis Mak. 2024 Jul 8;24(1):191. doi: 10.1186/s12911-024-02591-3.

IEViT: An enhanced vision transformer architecture for chest X-ray image classification.IEViT：一种用于胸部 X 射线图像分类的增强型视觉Transformer 架构。

Comput Methods Programs Biomed. 2022 Nov;226:107141. doi: 10.1016/j.cmpb.2022.107141. Epub 2022 Sep 16.

LDDNet: A Deep Learning Framework for the Diagnosis of Infectious Lung Diseases.LDDNet：一种用于诊断感染性肺部疾病的深度学习框架。

Sensors (Basel). 2023 Jan 2;23(1):480. doi: 10.3390/s23010480.

Lung pneumonia severity scoring in chest X-ray images using transformers.基于变换模型的胸部 X 射线图像中肺炎严重程度评分

Med Biol Eng Comput. 2024 Aug;62(8):2389-2407. doi: 10.1007/s11517-024-03066-3. Epub 2024 Apr 9.

PneuNet: deep learning for COVID-19 pneumonia diagnosis on chest X-ray image analysis using Vision Transformer.PneuNet：使用 Vision Transformer 进行胸部 X 射线图像分析的 COVID-19 肺炎诊断的深度学习。

Med Biol Eng Comput. 2023 Jun;61(6):1395-1408. doi: 10.1007/s11517-022-02746-2. Epub 2023 Jan 31.

XRayWizard: Reconstructing 3-D lung surfaces from a single 2-D chest x-ray image via Vision Transformer.基于 Vision Transformer 从单张二维胸部 X 光图像重建三维肺部表面

Med Phys. 2024 Apr;51(4):2806-2816. doi: 10.1002/mp.16781. Epub 2023 Oct 11.

A novel adaptive cubic quasi-Newton optimizer for deep learning based medical image analysis tasks, validated on detection of COVID-19 and segmentation for COVID-19 lung infection, liver tumor, and optic disc/cup.一种用于深度学习的新型自适应三次拟牛顿优化器，在 COVID-19 检测和 COVID-19 肺部感染、肝脏肿瘤以及视盘/杯分割等医学图像分析任务中得到验证。

Med Phys. 2023 Mar;50(3):1528-1538. doi: 10.1002/mp.15969. Epub 2022 Oct 6.

Identification of Asymptomatic COVID-19 Patients on Chest CT Images Using Transformer-Based or Convolutional Neural Network-Based Deep Learning Models.基于 Transformer 或卷积神经网络的深度学习模型在胸部 CT 图像中识别无症状 COVID-19 患者。

J Digit Imaging. 2023 Jun;36(3):827-836. doi: 10.1007/s10278-022-00754-0. Epub 2023 Jan 3.

Deep Learning Algorithm for COVID-19 Classification Using Chest X-Ray Images.基于胸部 X 光图像的 COVID-19 分类深度学习算法。

Comput Math Methods Med. 2021 Nov 9;2021:9269173. doi: 10.1155/2021/9269173. eCollection 2021.

CovXNet: A multi-dilation convolutional neural network for automatic COVID-19 and other pneumonia detection from chest X-ray images with transferable multi-receptive feature optimization.CovXNet：一种多扩张卷积神经网络，用于从胸部 X 光图像中自动检测 COVID-19 和其他肺炎，具有可转移的多感受野特征优化。

Comput Biol Med. 2020 Jul;122:103869. doi: 10.1016/j.compbiomed.2020.103869. Epub 2020 Jun 20.

引用本文的文献

A Deep Convolutional Neural Network Model for Lung Disease Detection Using Chest X-Ray Imaging.一种使用胸部X光成像进行肺病检测的深度卷积神经网络模型。

Pulm Med. 2025 Jun 24;2025:6614016. doi: 10.1155/pm/6614016. eCollection 2025.

A ubiquitous and interoperable deep learning model for automatic detection of pleomorphic gastroesophageal lesions.一种用于自动检测多形性胃食管病变的通用且可互操作的深度学习模型。

Sci Rep. 2025 Jul 2;15(1):22889. doi: 10.1038/s41598-025-03397-7.

Metaheuristic optimizers integrated with vision transformer model for severity detection and classification via multimodal COVID-19 images.通过多模态新冠肺炎图像，将元启发式优化器与视觉Transformer模型集成用于严重程度检测和分类。

Sci Rep. 2025 Apr 22;15(1):13941. doi: 10.1038/s41598-025-98593-w.

Automated classification of chest X-rays: a deep learning approach with attention mechanisms.胸部X光片的自动分类：一种具有注意力机制的深度学习方法。

BMC Med Imaging. 2025 Mar 4;25(1):71. doi: 10.1186/s12880-025-01604-5.

本文引用的文献

A review of deep learning in medical imaging: Imaging traits, technology trends, case studies with progress highlights, and future promises.医学成像中的深度学习综述：成像特征、技术趋势、具有进展亮点的案例研究及未来展望。

Proc IEEE Inst Electr Electron Eng. 2021 May;109(5):820-838. doi: 10.1109/JPROC.2021.3054390. Epub 2021 Feb 26.

High-precision multiclass classification of lung disease through customized MobileNetV2 from chest X-ray images.通过定制的MobileNetV2从胸部X光图像实现肺部疾病的高精度多类别分类。

Comput Biol Med. 2023 Mar;155:106646. doi: 10.1016/j.compbiomed.2023.106646. Epub 2023 Feb 10.

IEViT: An enhanced vision transformer architecture for chest X-ray image classification.IEViT：一种用于胸部 X 射线图像分类的增强型视觉Transformer 架构。

Comput Methods Programs Biomed. 2022 Nov;226:107141. doi: 10.1016/j.cmpb.2022.107141. Epub 2022 Sep 16.

A deep learning-based COVID-19 classification from chest X-ray image: case study.基于深度学习的胸部X光图像COVID-19分类：案例研究

Eur Phys J Spec Top. 2022;231(18-20):3767-3777. doi: 10.1140/epjs/s11734-022-00647-x. Epub 2022 Aug 18.

Explainable Vision Transformers and Radiomics for COVID-19 Detection in Chest X-rays.用于胸部X光片中COVID-19检测的可解释视觉Transformer与放射组学

J Clin Med. 2022 May 26;11(11):3013. doi: 10.3390/jcm11113013.

COVID-19 Detection in CT/X-ray Imagery Using Vision Transformers.使用视觉Transformer在CT/X光图像中检测新冠病毒

J Pers Med. 2022 Feb 18;12(2):310. doi: 10.3390/jpm12020310.

xViTCOS: Explainable Vision Transformer Based COVID-19 Screening Using Radiography.基于可解释视觉Transformer的 COVID-19 胸片筛查系统（xViTCOS）

IEEE J Transl Eng Health Med. 2021 Dec 8;10:1100110. doi: 10.1109/JTEHM.2021.3134096. eCollection 2022.

COVID-Transformer: Interpretable COVID-19 Detection Using Vision Transformer for Healthcare.COVID-Transformer：用于医疗保健的基于视觉Transformer 的可解释 COVID-19 检测

Int J Environ Res Public Health. 2021 Oct 21;18(21):11086. doi: 10.3390/ijerph182111086.

Exploiting Multiple Optimizers with Transfer Learning Techniques for the Identification of COVID-19 Patients.利用迁移学习技术与多种优化器结合进行 COVID-19 患者的识别。

J Healthc Eng. 2020 Nov 23;2020:8889412. doi: 10.1155/2020/8889412. eCollection 2020.

COVID-Net: a tailored deep convolutional neural network design for detection of COVID-19 cases from chest X-ray images.COVID-Net：一种针对胸部 X 光图像中 COVID-19 病例检测的定制化深度卷积神经网络设计。

Sci Rep. 2020 Nov 11;10(1):19549. doi: 10.1038/s41598-020-76550-z.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于视觉Transformer的胸部 X 射线图像肺病检测的优化。

Optimization of vision transformer-based detection of lung diseases from chest X-ray images.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献