基于相对位置编码和残差 MLP 的 VIT-B/16 脑肿瘤分类。

Brain tumor classification in VIT-B/16 based on relative position encoding and residual MLP.

机构信息

School of Information Science and Engineering, Wuhan University of Science and Technology, Wuhan, Hubei, China.

出版信息

PLoS One. 2024 Jul 2;19(7):e0298102. doi: 10.1371/journal.pone.0298102. eCollection 2024.

DOI:10.1371/journal.pone.0298102

PMID:38954731

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11218980/

Abstract

Brain tumors pose a significant threat to health, and their early detection and classification are crucial. Currently, the diagnosis heavily relies on pathologists conducting time-consuming morphological examinations of brain images, leading to subjective outcomes and potential misdiagnoses. In response to these challenges, this study proposes an improved Vision Transformer-based algorithm for human brain tumor classification. To overcome the limitations of small existing datasets, Homomorphic Filtering, Channels Contrast Limited Adaptive Histogram Equalization, and Unsharp Masking techniques are applied to enrich dataset images, enhancing information and improving model generalization. Addressing the limitation of the Vision Transformer's self-attention structure in capturing input token sequences, a novel relative position encoding method is employed to enhance the overall predictive capabilities of the model. Furthermore, the introduction of residual structures in the Multi-Layer Perceptron tackles convergence degradation during training, leading to faster convergence and enhanced algorithm accuracy. Finally, this study comprehensively analyzes the network model's performance on validation sets in terms of accuracy, precision, and recall. Experimental results demonstrate that the proposed model achieves a classification accuracy of 91.36% on an augmented open-source brain tumor dataset, surpassing the original VIT-B/16 accuracy by 5.54%. This validates the effectiveness of the proposed approach in brain tumor classification, offering potential reference for clinical diagnoses by medical practitioners.

摘要

脑肿瘤对健康构成重大威胁，早期发现和分类至关重要。目前，诊断主要依赖病理学家对脑图像进行耗时的形态学检查，导致结果主观且存在潜在误诊。针对这些挑战，本研究提出了一种改进的基于 Vision Transformer 的人脑肿瘤分类算法。为了克服现有小数据集的局限性，应用同态滤波、通道对比度受限自适应直方图均衡化和非锐化掩模技术来丰富数据集图像，增强信息并提高模型泛化能力。针对 Vision Transformer 自注意力结构在捕获输入令牌序列方面的局限性，采用了一种新颖的相对位置编码方法来增强模型的整体预测能力。此外，在多层感知机中引入残差结构解决了训练过程中的收敛退化问题，实现更快的收敛和更高的算法精度。最后，本研究全面分析了网络模型在验证集上的性能，包括准确性、精度和召回率。实验结果表明，该模型在增强的开源脑肿瘤数据集上的分类准确率达到 91.36%，超过了原始 VIT-B/16 的准确率 5.54%。这验证了该方法在脑肿瘤分类中的有效性，为临床医生的诊断提供了潜在的参考。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/016c/11218980/9d3b0db6f21f/pone.0298102.g001.jpg

相似文献

Brain tumor classification in VIT-B/16 based on relative position encoding and residual MLP.

PLoS One. 2024 Jul 2;19(7):e0298102. doi: 10.1371/journal.pone.0298102. eCollection 2024.

Self-attention-based generative adversarial network optimized with color harmony algorithm for brain tumor classification.

Electromagn Biol Med. 2024 Apr 2;43(1-2):31-45. doi: 10.1080/15368378.2024.2312363. Epub 2024 Feb 18.

Brain tumor detection and classification in MRI using hybrid ViT and GRU model with explainable AI in Southern Bangladesh.

Sci Rep. 2024 Oct 1;14(1):22797. doi: 10.1038/s41598-024-71893-3.

Refining neural network algorithms for accurate brain tumor classification in MRI imagery.

BMC Med Imaging. 2024 May 21;24(1):118. doi: 10.1186/s12880-024-01285-6.

Brain tumor classification for MRI images using dual-discriminator conditional generative adversarial network.

Electromagn Biol Med. 2024 Apr 2;43(1-2):81-94. doi: 10.1080/15368378.2024.2321352. Epub 2024 Mar 10.

Transformer guided self-adaptive network for multi-scale skin lesion image segmentation.

Comput Biol Med. 2024 Feb;169:107846. doi: 10.1016/j.compbiomed.2023.107846. Epub 2023 Dec 23.

Abdomen CT multi-organ segmentation using token-based MLP-Mixer.

Med Phys. 2023 May;50(5):3027-3038. doi: 10.1002/mp.16135. Epub 2022 Dec 20.

Classification of Brain Tumor from Magnetic Resonance Imaging Using Vision Transformers Ensembling.

Curr Oncol. 2022 Oct 7;29(10):7498-7511. doi: 10.3390/curroncol29100590.

LTPLN: Automatic pavement distress detection.

PLoS One. 2024 Oct 10;19(10):e0309172. doi: 10.1371/journal.pone.0309172. eCollection 2024.

Investigating the Role of Image Fusion in Brain Tumor Classification Models Based on Machine Learning Algorithm for Personalized Medicine.

Comput Math Methods Med. 2022 Feb 7;2022:7137524. doi: 10.1155/2022/7137524. eCollection 2022.

引用本文的文献

Classification of fashion e-commerce products using ResNet-BERT multi-modal deep learning and transfer learning optimization.

PLoS One. 2025 May 22;20(5):e0324621. doi: 10.1371/journal.pone.0324621. eCollection 2025.

Deep learning in assisting dermatologists in classifying basal cell carcinoma from seborrheic keratosis.

Front Oncol. 2025 Apr 24;15:1507322. doi: 10.3389/fonc.2025.1507322. eCollection 2025.

本文引用的文献

COVID-19 detection and analysis from lung CT images using novel channel boosted CNNs.

Expert Syst Appl. 2023 Nov 1;229:120477. doi: 10.1016/j.eswa.2023.120477. Epub 2023 May 16.

Attention-guided multi-scale deep object detection framework for lymphocyte analysis in IHC histological images.

Microscopy (Oxf). 2023 Feb 8;72(1):27-42. doi: 10.1093/jmicro/dfac051.

COVID-19 detection in chest X-ray images using deep boosted hybrid learning.

Comput Biol Med. 2021 Oct;137:104816. doi: 10.1016/j.compbiomed.2021.104816. Epub 2021 Aug 29.

Microscopic brain tumor detection and classification using 3D CNN and feature selection architecture.

Microsc Res Tech. 2021 Jan;84(1):133-149. doi: 10.1002/jemt.23597. Epub 2020 Sep 21.

An overview of deep learning in medical imaging focusing on MRI.

Z Med Phys. 2019 May;29(2):102-127. doi: 10.1016/j.zemedi.2018.11.002. Epub 2018 Dec 13.

Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries.

CA Cancer J Clin. 2018 Nov;68(6):394-424. doi: 10.3322/caac.21492. Epub 2018 Sep 12.

Computer-aided diagnosis in medical imaging: historical review, current status and future potential.

Comput Med Imaging Graph. 2007 Jun-Jul;31(4-5):198-211. doi: 10.1016/j.compmedimag.2007.02.002. Epub 2007 Mar 8.

Central nervous system tumors in donors: misdiagnosis carries a high morbidity and mortality.

Transplant Proc. 2005 Mar;37(2):583-4. doi: 10.1016/j.transproceed.2004.12.125.

High resolution quantitative relaxation and diffusion MRI of three different experimental brain tumors in rat.

Magn Reson Med. 1995 Dec;34(6):835-44. doi: 10.1002/mrm.1910340608.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于相对位置编码和残差 MLP 的 VIT-B/16 脑肿瘤分类。

Brain tumor classification in VIT-B/16 based on relative position encoding and residual MLP.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献