Development and validation of a novel predictive model for dementia risk in middle-aged and elderly depression individuals: a large and longitudinal machine learning cohort study.

BACKGROUND: Depression serves as a prodromal symptom of dementia, and individuals with depression exhibit a significantly higher risk of developing dementia. The aim of this study is to develop and validate a novel dementia risk prediction tool among middle-aged and elderly individuals with depression based on machine learning algorithms. METHODS: This study included 31,587 middle-aged and elderly individuals with depression who did not have a diagnosis of dementia at baseline from a large UK population-based prospective cohort. A rigorous variable selection strategy was employed to identify risk and protective factors of dementia from an initial pool of 190 candidate variables, ultimately retaining 27 variables. Eight distinct data analysis strategies were utilized to develop and validate the dementia risk prediction model. The DeLong's test was applied to compare the statistical differences between different models. RESULTS: During a median follow-up of 7.98 years, 896 incident dementia cases were identified among study participants. In model development employing an 8:2 data split (fivefold cross-validation for training), the Adaboost classifier achieved the optimal performance (AUC 0.861 ± 0.003), followed by XGBoost (AUC 0.839 ± 0.005) and CatBoost (AUC 0.828 ± 0.007) classifiers. To facilitate community generalization and clinical applicability, we develop a simplified model through a forward feature subset selection algorithm, retaining 12 variables. The simplified model maintained robust performance, with AdaBoost achieving the highest discriminative ability (AUC 0.859 ± 0.002), followed by XGBoost (AUC 0.835 ± 0.001) and CatBoost (AUC 0.821 ± 0.005). The DeLong's test revealed no statistically significant difference in AUC values between models using 12 and 27 variables (p = 0.278). For practical implementation, we deployed the optimal model to a web application for visualization and dementia risk assessment, named DRP-Depression. CONCLUSIONS: We developed a practical and easy-to-promote risk prediction model based on machine learning algorithms, and deployed it to a web application to provide a new and convenient tool for dementia risk prediction in the middle-aged and elderly individuals with depression.

新学期，新优惠

Suppr 超能文献

新学期，新优惠

Suppr 超能文献

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

推荐工具