Suppr
超能文献

用于高效阿尔茨海默病分类的视觉Transformer架构集成

Ensemble of vision transformer architectures for efficient Alzheimer's Disease classification.

作者信息

Shaffi Noushath, Viswan Vimbi, Mahmud Mufti

机构信息

Department of Computer Science, College of Science, Sultan Qaboos University, P.O. Box: 36, Al-Khod, 123, Muscat, Sultanate of Oman.

College of Computing and Information Sciences, University of Technology and Applied Sciences, OM 311, Sohar, Sultanate of Oman.

出版信息

Brain Inform. 2024 Oct 3;11(1):25. doi: 10.1186/s40708-024-00238-7.

DOI:10.1186/s40708-024-00238-7

PMID:39363122

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11450128/

Abstract

Transformers have dominated the landscape of Natural Language Processing (NLP) and revolutionalized generative AI applications. Vision Transformers (VT) have recently become a new state-of-the-art for computer vision applications. Motivated by the success of VTs in capturing short and long-range dependencies and their ability to handle class imbalance, this paper proposes an ensemble framework of VTs for the efficient classification of Alzheimer's Disease (AD). The framework consists of four vanilla VTs, and ensembles formed using hard and soft-voting approaches. The proposed model was tested using two popular AD datasets: OASIS and ADNI. The ADNI dataset was employed to assess the models' efficacy under imbalanced and data-scarce conditions. The ensemble of VT saw an improvement of around 2% compared to individual models. Furthermore, the results are compared with state-of-the-art and custom-built Convolutional Neural Network (CNN) architectures and Machine Learning (ML) models under varying data conditions. The experimental results demonstrated an overall performance gain of 4.14% and 4.72% accuracy over the ML and CNN algorithms, respectively. The study has also identified specific limitations and proposes avenues for future research. The codes used in the study are made publicly available.

摘要

Transformer在自然语言处理（NLP）领域占据主导地位，并彻底改变了生成式人工智能应用。视觉Transformer（VT）最近已成为计算机视觉应用的新的最先进技术。受VT在捕捉短程和长程依赖关系方面的成功及其处理类别不平衡能力的启发，本文提出了一个VT集成框架，用于阿尔茨海默病（AD）的高效分类。该框架由四个普通VT以及使用硬投票和软投票方法形成的集成组成。所提出的模型使用两个流行的AD数据集进行了测试：OASIS和ADNI。ADNI数据集用于评估模型在不平衡和数据稀缺条件下的有效性。与单个模型相比，VT集成的性能提高了约2%。此外，还将结果与不同数据条件下的最先进的定制卷积神经网络（CNN）架构和机器学习（ML）模型进行了比较。实验结果表明，与ML和CNN算法相比，总体性能分别提高了4.14%和4.72%的准确率。该研究还确定了具体局限性，并提出了未来研究的方向。该研究中使用的代码已公开提供。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f96/11450128/985fe9b04d9f/40708_2024_238_Fig1_HTML.jpg

相似文献

Ensemble of vision transformer architectures for efficient Alzheimer's Disease classification.

Brain Inform. 2024 Oct 3;11(1):25. doi: 10.1186/s40708-024-00238-7.

Do it the transformer way: A comprehensive review of brain and vision transformers for autism spectrum disorder diagnosis and classification.

Comput Biol Med. 2023 Dec;167:107667. doi: 10.1016/j.compbiomed.2023.107667. Epub 2023 Nov 3.

ConTraNet: A hybrid network for improving the classification of EEG and EMG signals with limited training data.

Comput Biol Med. 2024 Jan;168:107649. doi: 10.1016/j.compbiomed.2023.107649. Epub 2023 Nov 2.

Dual encoder network with transformer-CNN for multi-organ segmentation.

Med Biol Eng Comput. 2023 Mar;61(3):661-671. doi: 10.1007/s11517-022-02723-9. Epub 2022 Dec 29.

Analyzing Transfer Learning of Vision Transformers for Interpreting Chest Radiography.

J Digit Imaging. 2022 Dec;35(6):1445-1462. doi: 10.1007/s10278-022-00666-z. Epub 2022 Jul 11.

Efficient brain tumor segmentation using Swin transformer and enhanced local self-attention.

Int J Comput Assist Radiol Surg. 2024 Feb;19(2):273-281. doi: 10.1007/s11548-023-03024-8. Epub 2023 Oct 5.

Classification of Mobile-Based Oral Cancer Images Using the Vision Transformer and the Swin Transformer.

Cancers (Basel). 2024 Feb 29;16(5):987. doi: 10.3390/cancers16050987.

LumVertCancNet: A novel 3D lumbar vertebral body cancellous bone location and segmentation method based on hybrid Swin-transformer.

Comput Biol Med. 2024 Mar;171:108237. doi: 10.1016/j.compbiomed.2024.108237. Epub 2024 Feb 28.

Improving diagnosis and prognosis of lung cancer using vision transformers: a scoping review.

BMC Med Imaging. 2023 Sep 15;23(1):129. doi: 10.1186/s12880-023-01098-z.

Deep learning for mango leaf disease identification: A vision transformer perspective.

Heliyon. 2024 Aug 22;10(17):e36361. doi: 10.1016/j.heliyon.2024.e36361. eCollection 2024 Sep 15.

引用本文的文献

Mechanistic exploration of obesity-related indicators and motor cognitive risk syndrome: a mediated effect based on C-reactive protein triglyceride glucose index.

Front Aging Neurosci. 2025 Jul 30;17:1623148. doi: 10.3389/fnagi.2025.1623148. eCollection 2025.

Early detection of Alzheimer's disease progression stages using hybrid of CNN and transformer encoder models.

Sci Rep. 2025 May 14;15(1):16799. doi: 10.1038/s41598-025-01072-5.

本文引用的文献

Interpreting artificial intelligence models: a systematic review on the application of LIME and SHAP in Alzheimer's disease detection.

Brain Inform. 2024 Apr 5;11(1):10. doi: 10.1186/s40708-024-00222-1.

Performance Evaluation of Deep, Shallow and Ensemble Machine Learning Methods for the Automated Classification of Alzheimer's Disease.

Int J Neural Syst. 2024 Jul;34(7):2450029. doi: 10.1142/S0129065724500291. Epub 2024 Apr 5.

Artificial Cognition for Detection of Mental Disability: A Vision Transformer Approach for Alzheimer's Disease.

Healthcare (Basel). 2023 Oct 18;11(20):2763. doi: 10.3390/healthcare11202763.

OViTAD: Optimized Vision Transformer to Predict Various Stages of Alzheimer's Disease Using Resting-State fMRI and Structural MRI Data.

Brain Sci. 2023 Feb 3;13(2):260. doi: 10.3390/brainsci13020260.

A Survey on Vision Transformer.

IEEE Trans Pattern Anal Mach Intell. 2023 Jan;45(1):87-110. doi: 10.1109/TPAMI.2022.3152247. Epub 2022 Dec 5.

Deep Learning Approach for Early Detection of Alzheimer's Disease.

Cognit Comput. 2022;14(5):1711-1727. doi: 10.1007/s12559-021-09946-2. Epub 2021 Nov 3.

Early-Stage Identification and Pathological Development of Alzheimer's Disease Using Multimodal MRI.

J Alzheimers Dis. 2019;68(3):1013-1027. doi: 10.3233/JAD-181049.

Neurobiological pathways to Alzheimer's disease: Amyloid-beta, TAU protein or both?

Dement Neuropsychol. 2009 Jul-Sep;3(3):188-194. doi: 10.1590/S1980-57642009DN30300003.

Three-dimensional magnetization-prepared rapid gradient-echo imaging (3D MP RAGE).

Magn Reson Med. 1990 Jul;15(1):152-7. doi: 10.1002/mrm.1910150117.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

用于高效阿尔茨海默病分类的视觉Transformer架构集成

Ensemble of vision transformer architectures for efficient Alzheimer's Disease classification.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译