Xiao Yue, Zhao Zejin, Su Chen-Guang, Li Jian, Liu Jinlong
Department of Hepatobiliary Surgery, The Affiliated Hospital of Chengde Medical University, Chengde, 067000, Hebei Province, China.
Hebei Key Laboratory of Panvascular Diseases, Chengde, 067000, Hebei Province, China.
BMC Psychiatry. 2025 Jul 1;25(1):610. doi: 10.1186/s12888-025-07074-x.
BACKGROUND: Depression is very common in middle-aged and elderly cancer patients, which will seriously damage the quality of life and treatment effect of patients. This study aims to use machine learning methods to develop a predictive model to identify depression risk. However, since the traditional machine learning models have 'black box nature', Shapley Additive exPlanation is used to determine the key risk factors. METHODS: This study included 743 cancer patients aged 45 and above from the 2011-2020 China Health and Retirement Longitudinal Study (CHARLS), and analyzed a total of 19 variables including demographic factors, economic factors, health factors, family factors, and personal factors. After screening the predictive features by LASSO regression, in order to determine the best model for prediction, six machine learning models-Support Vector Machine, K-Nearest Neighbors, Multi-layer Perceptron, Decision Tree, XGBoost and Random Forest were trained. RESULTS: After training, the random forest model showed the best decision performance, AUC (95% CI): 0.774 (0.740-0.809). Subsequently, the model was interpreted by Shapley Additive exPlanation, and five key characteristics affecting the risk of depression were found. The feature importance plot intuitively shows that the predicted depression risk is significantly increased for patients with poor life satisfaction. CONCLUSIONS: We developed a clinical visualization model for health care providers to estimate the risk of depression in middle-aged and elderly cancer patients. As a powerful tool for early identification of depressive symptoms in middle-aged and elderly cancer patients, this model enables medical workers to implement clinical interventions earlier to obtain better clinical benefits.
背景:抑郁症在中老年癌症患者中非常常见,这将严重损害患者的生活质量和治疗效果。本研究旨在使用机器学习方法开发一个预测模型来识别抑郁风险。然而,由于传统机器学习模型具有“黑箱性质”,因此使用夏普利值加法解释(Shapley Additive exPlanation)来确定关键风险因素。 方法:本研究纳入了2011 - 2020年中国健康与养老追踪调查(CHARLS)中743名年龄在45岁及以上的癌症患者,共分析了包括人口统计学因素、经济因素、健康因素、家庭因素和个人因素在内的19个变量。通过LASSO回归筛选预测特征后,为了确定最佳预测模型,对六个机器学习模型——支持向量机、K近邻、多层感知器、决策树、XGBoost和随机森林进行了训练。 结果:训练后,随机森林模型显示出最佳决策性能,AUC(95%置信区间):0.774(0.740 - 0.809)。随后,通过夏普利值加法解释对该模型进行解释,发现了影响抑郁风险的五个关键特征。特征重要性图直观地显示,生活满意度差的患者预测抑郁风险显著增加。 结论:我们为医疗保健提供者开发了一种临床可视化模型,以估计中老年癌症患者的抑郁风险。作为早期识别中老年癌症患者抑郁症状的有力工具,该模型使医务人员能够更早地实施临床干预,以获得更好的临床效益。
Health Technol Assess. 2006-9
Clin Orthop Relat Res. 2024-9-1
BMC Psychiatry. 2025-2-19
PeerJ Comput Sci. 2023-11-29
Neurosci Res. 2025-2
J Clin Oncol. 2023-6-20