Lazli Lilia
Department of Computer and Software Engineering, Polytechnique Montréal, University of Montreal, 2500 Chem de Polytechnique, Montreal, QC, H3T 1J4, Canada, 1(514) 340-5121 ext 3750.
JMIRx Med. 2025 Apr 21;6:e60866. doi: 10.2196/60866.
Alzheimer disease (AD) is a severe neurological brain disorder. While not curable, earlier detection can help improve symptoms substantially. Machine learning (ML) models are popular and well suited for medical image processing tasks such as computer-aided diagnosis. These techniques can improve the process for an accurate diagnosis of AD.
In this paper, a complete computer-aided diagnosis system for the diagnosis of AD has been presented. We investigate the performance of some of the most used ML techniques for AD detection and classification using neuroimages from the Open Access Series of Imaging Studies (OASIS) and Alzheimer's Disease Neuroimaging Initiative (ADNI) datasets.
The system uses artificial neural networks (ANNs) and support vector machines (SVMs) as classifiers, and dimensionality reduction techniques as feature extractors. To retrieve features from the neuroimages, we used principal component analysis (PCA), linear discriminant analysis, and t-distributed stochastic neighbor embedding. These features are fed into feedforward neural networks (FFNNs) and SVM-based ML classifiers. Furthermore, we applied the vision transformer (ViT)-based ANNs in conjunction with data augmentation to distinguish patients with AD from healthy controls.
Experiments were performed on magnetic resonance imaging and positron emission tomography scans. The OASIS dataset included a total of 300 patients, while the ADNI dataset included 231 patients. For OASIS, 90 (30%) patients were healthy and 210 (70%) were severely impaired by AD. Likewise for the ADNI database, a total of 149 (64.5%) patients with AD were detected and 82 (35.5%) patients were used as healthy controls. An important difference was established between healthy patients and patients with AD (P=.02). We examined the effectiveness of the three feature extractors and classifiers using 5-fold cross-validation and confusion matrix-based standard classification metrics, namely, accuracy, sensitivity, specificity, precision, F1-score, and area under the receiver operating characteristic curve (AUROC). Compared with the state-of-the-art performing methods, the success rate was satisfactory for all the created ML models, but SVM and FFNN performed best with the PCA extractor, while the ViT classifier performed best with more data. The data augmentation/ViT approach worked better overall, achieving accuracies of 93.2% (sensitivity=87.2, specificity=90.5, precision=87.6, F1-score=88.7, and AUROC=92) for OASIS and 90.4% (sensitivity=85.4, specificity=88.6, precision=86.9, F1-score=88, and AUROC=90) for ADNI.
Effective ML models using neuroimaging data could help physicians working on AD diagnosis and will assist them in prescribing timely treatment to patients with AD. Good results were obtained on the OASIS and ADNI datasets with all the proposed classifiers, namely, SVM, FFNN, and ViTs. However, the results show that the ViT model is much better at predicting AD than the other models when a sufficient amount of data are available to perform the training. This highlights that the data augmentation process could impact the overall performance of the ViT model.
阿尔茨海默病(AD)是一种严重的神经脑部疾病。虽然无法治愈,但早期检测有助于显著改善症状。机器学习(ML)模型很受欢迎,非常适合诸如计算机辅助诊断等医学图像处理任务。这些技术可以改进AD的准确诊断过程。
本文提出了一个用于AD诊断的完整计算机辅助诊断系统。我们使用来自开放获取影像研究系列(OASIS)和阿尔茨海默病神经影像倡议(ADNI)数据集的神经影像,研究了一些最常用的ML技术在AD检测和分类方面的性能。
该系统使用人工神经网络(ANN)和支持向量机(SVM)作为分类器,并使用降维技术作为特征提取器。为了从神经影像中检索特征,我们使用了主成分分析(PCA)、线性判别分析和t分布随机邻域嵌入。这些特征被输入到前馈神经网络(FFNN)和基于SVM的ML分类器中。此外,我们将基于视觉Transformer(ViT)的ANN与数据增强相结合,以区分AD患者和健康对照。
对磁共振成像和正电子发射断层扫描进行了实验。OASIS数据集共有300名患者,而ADNI数据集有231名患者。对于OASIS,90名(30%)患者健康,210名(70%)患者因AD严重受损。同样,对于ADNI数据库,共检测到149名(64.5%)AD患者,82名(35.5%)患者作为健康对照。健康患者和AD患者之间存在显著差异(P = 0.02)。我们使用5折交叉验证和基于混淆矩阵的标准分类指标,即准确率、灵敏度、特异性、精确率、F1分数和受试者工作特征曲线下面积(AUROC),检验了三种特征提取器和分类器的有效性。与现有最佳性能方法相比,所有创建的ML模型的成功率都令人满意,但SVM和FFNN与PCA提取器配合表现最佳,而ViT分类器在数据更多时表现最佳。数据增强/ViT方法总体效果更好,OASIS数据集的准确率为93.2%(灵敏度 = 87.2,特异性 = 90.5,精确率 = 87.6,F1分数 = 88.7,AUROC = 92),ADNI数据集的准确率为90.4%(灵敏度 = 85.4,特异性 = 88.6,精确率 = 86.9,F1分数 = 88,AUROC = 90)。
使用神经影像数据的有效ML模型可以帮助从事AD诊断的医生,并协助他们为AD患者及时开出处方治疗。所有提出的分类器,即SVM、FFNN和ViT,在OASIS和ADNI数据集上都取得了良好的结果。然而,结果表明,当有足够的数据进行训练时,ViT模型在预测AD方面比其他模型要好得多。这突出表明数据增强过程可能会影响ViT模型的整体性能。