Genentech, Inc, 1 DNA Way, South San Francisco, CA, 94080, USA.
Sci Rep. 2024 Nov 4;14(1):26668. doi: 10.1038/s41598-024-77829-1.
Early detection of Alzheimer's disease (AD) is crucial to maximize clinical outcomes. Most disease progression analyses include people with diagnoses of cognitive impairment, limiting understanding of AD risk among those with normal cognition. The objective was to establish AD progression models through a deep learning approach to analyze heterogeneous, multi-modal datasets, including clustering analyses of population subsets. A multi-head deep-learning architecture was built to process and learn from biomedical and imaging data from the National Alzheimer's Coordinating Center. Shapley additive explanation algorithms for feature importance ranking and pairwise correlation analysis were used to identify predictors of disease progression. Four primary disease progression clusters (slow, moderate and rapid converters or non-converters) were subdivided into groups by race and sex, yielding 16 sub-clusters of participants with distinct progression patterns. A multi-head and early-fusion convolutional neural network achieved the most competitive performance and demonstrated superiority over a single-head deep learning architecture and conventional tree-based machine-learning methods, with 97% test accuracy, 96% F1 score and 0.19 root mean square error. From 447 features, 2 sets of 100 predictors of disease progression were extracted. Feature importance ranking, correlation analysis and descriptive statistics further enriched cluster analysis and validation of the heterogeneity of risk factors.
早期发现阿尔茨海默病(AD)对于最大限度地提高临床效果至关重要。大多数疾病进展分析都包括认知障碍诊断的人群,这限制了对认知正常人群 AD 风险的了解。本研究的目的是通过深度学习方法建立 AD 进展模型,以分析包括人群亚组聚类分析在内的异质多模态数据集。构建了一个多头深度学习架构,用于处理和学习来自国家阿尔茨海默病协调中心的生物医学和成像数据。使用 Shapley 可加性解释算法进行特征重要性排名和成对相关性分析,以确定疾病进展的预测因子。将四个主要的疾病进展簇(缓慢、中度和快速转化者或非转化者)按种族和性别进一步细分为亚组,从而产生了具有不同进展模式的参与者的 16 个亚组。多头和早期融合卷积神经网络的表现最为出色,优于单头深度学习架构和传统的基于树的机器学习方法,测试准确率为 97%,F1 得分为 96%,均方根误差为 0.19。从 447 个特征中提取了两组 100 个疾病进展预测因子。特征重要性排名、相关性分析和描述性统计进一步丰富了聚类分析和风险因素异质性的验证。