Department of Electronics and Communication Engineering, School of Engineering and Sciences, SRM University-AP, Amaravati 522503, India.
Department of Applied Data Science, Noroff University College, 4612 Kristiansand, Norway.
Curr Oncol. 2022 Oct 7;29(10):7498-7511. doi: 10.3390/curroncol29100590.
The automated classification of brain tumors plays an important role in supporting radiologists in decision making. Recently, vision transformer (ViT)-based deep neural network architectures have gained attention in the computer vision research domain owing to the tremendous success of transformer models in natural language processing. Hence, in this study, the ability of an ensemble of standard ViT models for the diagnosis of brain tumors from T1-weighted (T1w) magnetic resonance imaging (MRI) is investigated. Pretrained and finetuned ViT models (B/16, B/32, L/16, and L/32) on ImageNet were adopted for the classification task. A brain tumor dataset from figshare, consisting of 3064 T1w contrast-enhanced (CE) MRI slices with meningiomas, gliomas, and pituitary tumors, was used for the cross-validation and testing of the ensemble ViT model's ability to perform a three-class classification task. The best individual model was L/32, with an overall test accuracy of 98.2% at 384 × 384 resolution. The ensemble of all four ViT models demonstrated an overall testing accuracy of 98.7% at the same resolution, outperforming individual model's ability at both resolutions and their ensembling at 224 × 224 resolution. In conclusion, an ensemble of ViT models could be deployed for the computer-aided diagnosis of brain tumors based on T1w CE MRI, leading to radiologist relief.
脑肿瘤的自动分类在辅助放射科医生做出决策方面发挥着重要作用。最近,基于视觉转换器(ViT)的深度神经网络架构在计算机视觉研究领域引起了关注,这要归功于转换器模型在自然语言处理方面的巨大成功。因此,在这项研究中,研究了基于集成标准 ViT 模型从 T1 加权(T1w)磁共振成像(MRI)诊断脑肿瘤的能力。采用了在 ImageNet 上预训练和微调的 ViT 模型(B/16、B/32、L/16 和 L/32)进行分类任务。使用来自 figshare 的脑肿瘤数据集,其中包含 3064 个脑膜瘤、胶质瘤和垂体瘤的 T1w 对比增强(CE)MRI 切片,用于交叉验证和测试集成 ViT 模型进行三分类任务的能力。最佳的单个模型是 L/32,在 384×384 分辨率下的总体测试准确率为 98.2%。所有四个 ViT 模型的集成在相同分辨率下的总体测试准确率为 98.7%,在两个分辨率下都优于单个模型的能力,在 224×224 分辨率下的集成表现也更好。总之,基于 T1w CE MRI,可以部署基于 ViT 模型的集成来进行脑肿瘤的计算机辅助诊断,从而减轻放射科医生的负担。