Huang Kian A, Prakash Neelesh
Radiology, University of South Florida (USF) Health Morsani College of Medicine, Tampa, USA.
Cureus. 2025 Mar 20;17(3):e80872. doi: 10.7759/cureus.80872. eCollection 2025 Mar.
Background Magnetic resonance imaging (MRI) is essential for brain tumor diagnosis. Deep learning models, such as Residual Network 50 Version 2 (ResNet50V2), have demonstrated strong performance in tumor classification. However, integrating attention mechanisms may further enhance diagnostic accuracy. This study evaluates the impact of different attention mechanisms on a ResNet50V2-based MRI tumor classification model for distinguishing between meningioma, glioma, pituitary tumors, and cases with no tumor. Methods A ResNet50V2-based model was trained on 3,096 annotated MRI scans from a publicly available dataset on Kaggle. Five model configurations were evaluated: baseline ResNet50V2, Squeeze-and-Excitation (SE), Convolutional Block Attention Module (CBAM), Self-Attention (SA), and Attention Gated Network (AGNet). Performance was assessed using accuracy, area under the receiver operating characteristic curve (AUC), precision, and recall. Two-proportion Z-tests were conducted to compare classification accuracies among models. Results The SE-enhanced model achieved the highest classification performance, with an accuracy of 98.4% and an AUC of 1.00, outperforming the base ResNet50V2 (92.6%) and other attention-based frameworks (CBAM: 93.5%, SA: 91.6%, AGNet: 94.2%). Compared to the baseline model, the SE model also demonstrated improved meningioma and pituitary tumor classification (Z = 2.485, p = 0.013 and Z = 2.423, p = 0.015, respectively). Additionally, the SE model demonstrated superior precision and recall across all tumor classes. Conclusion Incorporating attention mechanisms significantly improves MRI-based tumor classification, with SE proving to be the most effective. These findings suggest that SE-enhanced models may improve diagnostic accuracy in both research and clinical applications. Future research should explore hybrid attention mechanisms, such as transformer-based models, and their broader applications in medical imaging.
磁共振成像(MRI)对于脑肿瘤诊断至关重要。深度学习模型,如残差网络50版本2(ResNet50V2),在肿瘤分类中表现出强大性能。然而,整合注意力机制可能会进一步提高诊断准确性。本研究评估不同注意力机制对基于ResNet50V2的MRI肿瘤分类模型的影响,该模型用于区分脑膜瘤、胶质瘤、垂体瘤和无肿瘤病例。方法:基于ResNet50V2的模型在Kaggle上一个公开可用数据集中的3096份带注释的MRI扫描图像上进行训练。评估了五种模型配置:基线ResNet50V2、挤压激励(SE)、卷积块注意力模块(CBAM)、自注意力(SA)和注意力门控网络(AGNet)。使用准确率、受试者工作特征曲线下面积(AUC)、精确率和召回率评估性能。进行双比例Z检验以比较模型之间的分类准确率。结果:SE增强模型实现了最高的分类性能,准确率为98.4%,AUC为1.00,优于基础ResNet50V2(92.6%)和其他基于注意力的框架(CBAM:93.5%,SA:91.6%,AGNet:94.2%)。与基线模型相比,SE模型在脑膜瘤和垂体瘤分类方面也表现出改善(Z = 2.485,p = 0.013;Z = 2.423,p = 0.015)。此外,SE模型在所有肿瘤类别中都表现出更高的精确率和召回率。结论:纳入注意力机制显著提高了基于MRI的肿瘤分类,其中SE被证明是最有效的。这些发现表明,SE增强模型可能会提高研究和临床应用中的诊断准确性。未来的研究应探索混合注意力机制,如基于Transformer的模型,及其在医学成像中的更广泛应用。