Center for Advanced Technology and Education, Department of Electrical and Computer Engineering, Florida International University, Miami, FL, USA.
Department of Neurology, University of Florida, Gainesville, FL, USA.
J Alzheimers Dis. 2021;84(4):1497-1514. doi: 10.3233/JAD-210064.
Machine learning is a promising tool for biomarker-based diagnosis of Alzheimer's disease (AD). Performing multimodal feature selection and studying the interaction between biological and clinical AD can help to improve the performance of the diagnosis models.
This study aims to formulate a feature ranking metric based on the mutual information index to assess the relevance and redundancy of regional biomarkers and improve the AD classification accuracy.
From the Alzheimer's Disease Neuroimaging Initiative (ADNI), 722 participants with three modalities, including florbetapir-PET, flortaucipir-PET, and MRI, were studied. The multivariate mutual information metric was utilized to capture the redundancy and complementarity of the predictors and develop a feature ranking approach. This was followed by evaluating the capability of single-modal and multimodal biomarkers in predicting the cognitive stage.
Although amyloid-β deposition is an earlier event in the disease trajectory, tau PET with feature selection yielded a higher early-stage classification F1-score (65.4%) compared to amyloid-β PET (63.3%) and MRI (63.2%). The SVC multimodal scenario with feature selection improved the F1-score to 70.0% and 71.8% for the early and late-stage, respectively. When age and risk factors were included, the scores improved by 2 to 4%. The Amyloid-Tau-Neurodegeneration [AT(N)] framework helped to interpret the classification results for different biomarker categories.
The results underscore the utility of a novel feature selection approach to reduce the dimensionality of multimodal datasets and enhance model performance. The AT(N) biomarker framework can help to explore the misclassified cases by revealing the relationship between neuropathological biomarkers and cognition.
机器学习是一种很有前途的工具,可以用于基于生物标志物的阿尔茨海默病(AD)诊断。进行多模态特征选择并研究生物和临床 AD 之间的相互作用,可以帮助提高诊断模型的性能。
本研究旨在制定一种基于互信息指数的特征排名指标,以评估区域生物标志物的相关性和冗余性,并提高 AD 的分类准确性。
从阿尔茨海默病神经影像学倡议(ADNI)中,研究了 722 名具有三种模态的参与者,包括 florbetapir-PET、flortaucipir-PET 和 MRI。利用多元互信息度量来捕捉预测器的冗余性和互补性,并开发特征排名方法。然后评估单模态和多模态生物标志物预测认知阶段的能力。
尽管淀粉样蛋白-β沉积是疾病轨迹中的早期事件,但经过特征选择的 tau PET 比淀粉样蛋白-β PET(63.3%)和 MRI(63.2%)产生了更高的早期分类 F1 分数(65.4%)。具有特征选择的 SVC 多模态方案将 F1 分数分别提高到 70.0%和 71.8%,用于早期和晚期阶段。当纳入年龄和危险因素时,分数提高了 2 到 4 个百分点。淀粉样蛋白-tau-神经退行性变[AT(N)]框架有助于解释不同生物标志物类别下的分类结果。
研究结果强调了一种新的特征选择方法的实用性,该方法可以减少多模态数据集的维度并提高模型性能。AT(N)生物标志物框架可以帮助通过揭示神经病理生物标志物与认知之间的关系,来探索分类错误的病例。