Xiao Xuan, Li Yihui, Wu Qiaoboyang, Liu Xinting, Cao Xu, Li Maiping, Liu Jianjing, Gong Lianggeng, Dai Xi-Jian
Department of Radiology, The Second Affiliated Hospital, Jiangxi Medical College, Nanchang University, Minde Road No. 1, Nanchang, Jiangxi Province, 330006, China.
Jiangxi Provincial Key Laboratory of Intelligent Medical Imaging, Nanchang, 330006, China.
Alzheimers Res Ther. 2025 May 13;17(1):103. doi: 10.1186/s13195-025-01750-6.
BACKGROUND: Depression serves as a prodromal symptom of dementia, and individuals with depression exhibit a significantly higher risk of developing dementia. The aim of this study is to develop and validate a novel dementia risk prediction tool among middle-aged and elderly individuals with depression based on machine learning algorithms. METHODS: This study included 31,587 middle-aged and elderly individuals with depression who did not have a diagnosis of dementia at baseline from a large UK population-based prospective cohort. A rigorous variable selection strategy was employed to identify risk and protective factors of dementia from an initial pool of 190 candidate variables, ultimately retaining 27 variables. Eight distinct data analysis strategies were utilized to develop and validate the dementia risk prediction model. The DeLong's test was applied to compare the statistical differences between different models. RESULTS: During a median follow-up of 7.98 years, 896 incident dementia cases were identified among study participants. In model development employing an 8:2 data split (fivefold cross-validation for training), the Adaboost classifier achieved the optimal performance (AUC 0.861 ± 0.003), followed by XGBoost (AUC 0.839 ± 0.005) and CatBoost (AUC 0.828 ± 0.007) classifiers. To facilitate community generalization and clinical applicability, we develop a simplified model through a forward feature subset selection algorithm, retaining 12 variables. The simplified model maintained robust performance, with AdaBoost achieving the highest discriminative ability (AUC 0.859 ± 0.002), followed by XGBoost (AUC 0.835 ± 0.001) and CatBoost (AUC 0.821 ± 0.005). The DeLong's test revealed no statistically significant difference in AUC values between models using 12 and 27 variables (p = 0.278). For practical implementation, we deployed the optimal model to a web application for visualization and dementia risk assessment, named DRP-Depression. CONCLUSIONS: We developed a practical and easy-to-promote risk prediction model based on machine learning algorithms, and deployed it to a web application to provide a new and convenient tool for dementia risk prediction in the middle-aged and elderly individuals with depression.
JAMA Netw Open. 2022-11-1
Nat Rev Neurol. 2022-6