Hussain Sayyed Shahid, Degang Xu, Shah Pir Masoom, Khan Hikmat, Zeb Adnan
School of Automation, Central South University, Changsha, 410083, China.
School of Automation, Central South University, Changsha, 410083, China.
Comput Med Imaging Graph. 2025 Sep;124:102638. doi: 10.1016/j.compmedimag.2025.102638. Epub 2025 Aug 20.
Alzheimer's disease (AD) is the most common neurodegenerative progressive disorder and the fifth-leading cause of death in older people. The detection of AD is a very challenging task for clinicians and radiologists due to the complex nature of this disease, thus requiring automatic data-driven machine-learning models to enhance diagnostic accuracy and support expert decision-making. However, machine learning models are hindered by three key limitations, in AD classification:(i) diffuse and subtle structural changes in the brain that make it difficult to capture global pathology (ii) non-uniform alterations across MRI planes, which limit single-view learning and (iii) the lack of deep integration of demographic context, which is often ignored despite its clinical importance. To address these challenges in this paper, we propose a novel multi-modal deep learning framework, named AlzFormer, that dynamically integrates 3D MRI with demographic features represented as knowledge graph embeddings for AD classification. Specifically, (i) to capture global and volumetric features, a 3D CNN is employed; (ii) to model plane-specific information, three parallel 2D CNNs are used for tri-planar processing (axial, coronal, sagittal), combined with a Transformer encoder; and (iii) to incorporate demographic context, we integrate demographic features as knowledge graph embeddings through a novel Adaptive Attention Gating mechanism that balances contributions from both modalities (i.e., MRI and demographics). Comprehensive experiments on two real-world datasets, including generalization tests, ablation studies, and robustness evaluation under noisy conditions, demonstrate that the proposed model provides a robust and effective solution for AD diagnosis. These results suggest strong potential for integration into Clinical Decision Support Systems (CDSS), offering a more interpretable and personalized approach to early Alzheimer's detection.
阿尔茨海默病(AD)是最常见的神经退行性进展性疾病,也是老年人死亡的第五大原因。由于这种疾病的复杂性,AD的检测对临床医生和放射科医生来说是一项极具挑战性的任务,因此需要自动的数据驱动机器学习模型来提高诊断准确性并支持专家决策。然而,机器学习模型在AD分类中受到三个关键限制:(i)大脑中弥漫性和细微的结构变化使得难以捕捉整体病理情况;(ii)MRI平面上的不均匀变化限制了单视图学习;(iii)人口统计学背景缺乏深度整合,尽管其具有临床重要性,但往往被忽视。为了解决本文中的这些挑战,我们提出了一种新颖的多模态深度学习框架,名为AlzFormer,它将3D MRI与表示为知识图谱嵌入的人口统计学特征动态集成用于AD分类。具体而言,(i)为了捕捉全局和体积特征,采用了3D卷积神经网络(CNN);(ii)为了对特定平面信息进行建模,使用三个并行的2D CNN进行三平面处理(轴向、冠状、矢状),并结合一个Transformer编码器;(iii)为了纳入人口统计学背景,我们通过一种新颖的自适应注意力门控机制将人口统计学特征作为知识图谱嵌入进行整合,该机制平衡了两种模态(即MRI和人口统计学)的贡献。在两个真实世界数据集上进行的综合实验,包括泛化测试、消融研究以及噪声条件下的鲁棒性评估,表明所提出的模型为AD诊断提供了一种强大而有效的解决方案。这些结果表明该模型在集成到临床决策支持系统(CDSS)方面具有巨大潜力,为早期阿尔茨海默病检测提供了一种更具可解释性和个性化的方法。