Aragon Institute on Engineering Research, University of Zaragoza, Zaragoza, Spain.
PLoS One. 2022 May 6;17(5):e0264695. doi: 10.1371/journal.pone.0264695. eCollection 2022.
The Alzheimer's Disease Prediction Of Longitudinal Evolution (TADPOLE) Challenge is the most comprehensive challenge to date with regard to the number of subjects, considered features, and challenge participants. The initial objective of TADPOLE was the identification of the most predictive data, features, and methods for the progression of subjects at risk of developing Alzheimer's. The challenge was successful in recognizing tree-based ensemble methods such as gradient boosting and random forest as the best methods for the prognosis of the clinical status in Alzheimer's disease (AD). However, the challenge outcome was limited to which combination of data processing and methods exhibits the best accuracy; hence, it is difficult to determine the contribution of the methods to the accuracy. The quantification of feature importance was globally approached by all the challenge participant methods. In addition, TADPOLE provided general answers that focused on improving performance while ignoring important issues such as interpretability. The purpose of this study is to intensively explore the models of the top three TADPOLE Challenge methods in a common framework for fair comparison. In addition, for these models, the most meaningful features for the prognosis of the clinical status of AD are studied and the contribution of each feature to the accuracy of the methods is quantified. We provide plausible explanations as to why the methods achieve such accuracy, and we investigate whether the methods use information coherent with clinical knowledge. Finally, we approach these issues through the analysis of SHapley Additive exPlanations (SHAP) values, a technique that has recently attracted increasing attention in the field of explainable artificial intelligence (XAI).
阿尔茨海默病纵向演变预测(TADPOLE)挑战赛是迄今为止在参与者数量、考虑的特征和挑战方面最全面的挑战。TADPOLE 的最初目标是确定最具预测性的数据、特征和方法,以预测有患阿尔茨海默病风险的受试者的进展。该挑战赛成功地识别了基于树的集成方法,如梯度提升和随机森林,作为阿尔茨海默病(AD)临床状态预测的最佳方法。然而,挑战赛的结果仅限于哪种数据处理和方法的组合表现出最佳的准确性;因此,很难确定方法对准确性的贡献。所有挑战赛参与者的方法都从全局上对特征重要性进行了量化。此外,TADPOLE 提供了一般的答案,侧重于提高性能,而忽略了可解释性等重要问题。本研究的目的是在一个通用框架中深入探讨 TADPOLE 挑战赛前三名方法的模型,以进行公平比较。此外,对于这些模型,我们研究了对 AD 临床状态预后最有意义的特征,并量化了每个特征对方法准确性的贡献。我们提供了合理的解释,说明为什么这些方法能够达到如此高的准确性,并研究了这些方法是否使用了与临床知识一致的信息。最后,我们通过分析最近在可解释人工智能(XAI)领域引起越来越多关注的 SHapley Additive exPlanations(SHAP)值来解决这些问题。