Aiya Aditya Jayesh, Wani Nishant, Ramani Mayur, Kumar Anuj, Pant Sangeeta, Kotecha Ketan, Kulkarni Ambarish, Al-Danakh Abdullah
School of Computer Science Engineering and Applications, D Y Patil International University (DYPIU), Akrudi, Pune, 411044, Maharashtra, India.
Department of Applied Sciences, Symbiosis Institute of Technology, Symbiosis International (Deemed University), Lavale, Pune, 412115, Maharashtra, India.
Sci Rep. 2025 Aug 26;15(1):31386. doi: 10.1038/s41598-025-04591-3.
Brain tumor classification (BTC) from Magnetic Resonance Imaging (MRI) is a critical diagnosis task, which is highly important for treatment planning. In this study, we propose a hybrid deep learning (DL) model that integrates VGG16, an attention mechanism, and optimized hyperparameters to classify brain tumors into different categories as glioma, meningioma, pituitary tumor, and no tumor. The approach leverages state-of-the-art preprocessing techniques, transfer learning, and Gradient-weighted Class Activation Mapping (Grad-CAM) visualization on a dataset of 7023 MRI images to enhance both performance and interpretability. The proposed model achieves 99% test accuracy and impressive precision and recall figures and outperforms traditional approaches like Support Vector Machines (SVM) with Histogram of Oriented Gradients (HOG), Local Binary Pattern (LBP) and Principal Component Analysis (PCA) by a significant margin. Moreover, the model eliminates the need for manual labelling-a common challenge in this domain-by employing end-to-end learning, which allows the proposed model to derive meaningful features hence reducing human input. The integration of attention mechanisms further promote feature selection, in turn improving classification accuracy, while Grad-CAM visualizations show which regions of the image had the greatest impact on classification decisions, leading to increased transparency in clinical settings. Overall, the synergy of superior prediction, automatic feature extraction, and improved predictability confirms the model as an important application to neural networks approaches for brain tumor classification with valuable potential for enhancing medical imaging (MI) and clinical decision-making.
基于磁共振成像(MRI)的脑肿瘤分类(BTC)是一项关键的诊断任务,对治疗规划极为重要。在本研究中,我们提出了一种混合深度学习(DL)模型,该模型整合了VGG16、注意力机制和优化的超参数,以将脑肿瘤分为不同类别,如胶质瘤、脑膜瘤、垂体瘤和无肿瘤。该方法利用了先进的预处理技术、迁移学习以及对7023张MRI图像数据集进行梯度加权类激活映射(Grad-CAM)可视化,以提高性能和可解释性。所提出的模型实现了99%的测试准确率以及令人印象深刻的精确率和召回率,并且在很大程度上优于传统方法,如结合方向梯度直方图(HOG)、局部二值模式(LBP)和主成分分析(PCA)的支持向量机(SVM)。此外,该模型通过采用端到端学习消除了手动标注的需求——这是该领域常见的挑战——这使得所提出的模型能够提取有意义的特征,从而减少人工输入。注意力机制的整合进一步促进了特征选择,进而提高了分类准确率,而Grad-CAM可视化展示了图像的哪些区域对分类决策产生了最大影响,从而提高了临床环境中的透明度。总体而言,卓越的预测、自动特征提取和改进的可预测性之间的协同作用证实了该模型作为神经网络方法在脑肿瘤分类中的重要应用,具有增强医学成像(MI)和临床决策的宝贵潜力。