Saeed Amna, Waris Asim, Fuwad Ahmed, Iqbal Javaid, Khan Jawad, AlQahtani Dokhyl, Gilani Omer, Shah Umer Hameed
Department of Biomedical Engineering and Sciences, School of Mechanical and Manufacturing Engineering, National University of Sciences and Technology (NUST), Islamabad, Pakistan.
Department of Electrical Engineering, School of Engineering, Prince Sattam Bin Abdul Aziz University, Al-Kharj, Saudi Arabia.
PLoS One. 2024 Dec 13;19(12):e0314725. doi: 10.1371/journal.pone.0314725. eCollection 2024.
With a clinical trial failure rate of 99.6% for Alzheimer's Disease (AD), early diagnosis is critical. Machine learning (ML) models have shown promising results in early AD prediction, with survival ML models outperforming typical classifiers by providing probabilities of disease progression over time. This study utilized various ML survival models to predict the time-to-conversion to AD for early (eMCI) and late (lMCI) Mild Cognitive Impairment stages, considering their different progression rates. ADNI data, consisting of 291 eMCI and 546 lMCI cases, was preprocessed to handle missing values and data imbalance. The models used included Random Survival Forest (RSF), Extra Survival Trees (XST), Gradient Boosting (GB), Survival Tree (ST), Cox-net, and Cox Proportional Hazard (CoxPH). We evaluated cognitive, cerebrospinal fluid (CSF) biomarkers, and neuroimaging modalities, both individually and combined, to identify the most influential features. Our results indicate that RSF outperformed traditional CoxPH and other ML models. For eMCI, RSF trained on multimodal data achieved a C-Index of 0.90 and an IBS of 0.10. For lMCI, the C-Index was 0.82 and the IBS was 0.16. Cognitive tests showed a statistically significant improvement over other modalities, underscoring their reliability in early prediction. Furthermore, RSF-generated individual survival curves from baseline data facilitate clinical decision-making, aiding clinicians in developing personalized treatment plans and implementing preventive measures to slow or prevent AD progression in prodromal stages.
阿尔茨海默病(AD)临床试验的失败率为99.6%,因此早期诊断至关重要。机器学习(ML)模型在AD早期预测方面已显示出有前景的结果,生存ML模型通过提供疾病随时间进展的概率,其表现优于典型分类器。本研究利用各种ML生存模型来预测早期(eMCI)和晚期(lMCI)轻度认知障碍阶段转化为AD的时间,同时考虑到它们不同的进展速度。对包含291例eMCI和546例lMCI病例的ADNI数据进行预处理,以处理缺失值和数据不平衡问题。所使用的模型包括随机生存森林(RSF)、额外生存树(XST)、梯度提升(GB)、生存树(ST)、Cox-net和Cox比例风险模型(CoxPH)。我们分别评估了认知、脑脊液(CSF)生物标志物和神经影像学模态,以及它们的组合,以确定最具影响力的特征。我们的结果表明,RSF的表现优于传统的CoxPH和其他ML模型。对于eMCI,基于多模态数据训练的RSF的C指数为0.90,IBS为0.10。对于lMCI,C指数为0.82,IBS为0.16。认知测试显示在统计学上比其他模态有显著改善,突出了它们在早期预测中的可靠性。此外,RSF从基线数据生成的个体生存曲线有助于临床决策,帮助临床医生制定个性化治疗方案,并实施预防措施,以减缓或预防前驱期AD的进展。