Suppr超能文献

中国中老年癌症患者抑郁预测的可解释机器学习模型:基于中国健康与养老追踪调查(CHARLS)队列的研究

An interpretable machine learning model for predicting depression in middle-aged and elderly cancer patients in China: a study based on the CHARLS cohort.

作者信息

Xiao Yue, Zhao Zejin, Su Chen-Guang, Li Jian, Liu Jinlong

机构信息

Department of Hepatobiliary Surgery, The Affiliated Hospital of Chengde Medical University, Chengde, 067000, Hebei Province, China.

Hebei Key Laboratory of Panvascular Diseases, Chengde, 067000, Hebei Province, China.

出版信息

BMC Psychiatry. 2025 Jul 1;25(1):610. doi: 10.1186/s12888-025-07074-x.

Abstract

BACKGROUND

Depression is very common in middle-aged and elderly cancer patients, which will seriously damage the quality of life and treatment effect of patients. This study aims to use machine learning methods to develop a predictive model to identify depression risk. However, since the traditional machine learning models have 'black box nature', Shapley Additive exPlanation is used to determine the key risk factors.

METHODS

This study included 743 cancer patients aged 45 and above from the 2011-2020 China Health and Retirement Longitudinal Study (CHARLS), and analyzed a total of 19 variables including demographic factors, economic factors, health factors, family factors, and personal factors. After screening the predictive features by LASSO regression, in order to determine the best model for prediction, six machine learning models-Support Vector Machine, K-Nearest Neighbors, Multi-layer Perceptron, Decision Tree, XGBoost and Random Forest were trained.

RESULTS

After training, the random forest model showed the best decision performance, AUC (95% CI): 0.774 (0.740-0.809). Subsequently, the model was interpreted by Shapley Additive exPlanation, and five key characteristics affecting the risk of depression were found. The feature importance plot intuitively shows that the predicted depression risk is significantly increased for patients with poor life satisfaction.

CONCLUSIONS

We developed a clinical visualization model for health care providers to estimate the risk of depression in middle-aged and elderly cancer patients. As a powerful tool for early identification of depressive symptoms in middle-aged and elderly cancer patients, this model enables medical workers to implement clinical interventions earlier to obtain better clinical benefits.

摘要

背景

抑郁症在中老年癌症患者中非常常见,这将严重损害患者的生活质量和治疗效果。本研究旨在使用机器学习方法开发一个预测模型来识别抑郁风险。然而,由于传统机器学习模型具有“黑箱性质”,因此使用夏普利值加法解释(Shapley Additive exPlanation)来确定关键风险因素。

方法

本研究纳入了2011 - 2020年中国健康与养老追踪调查(CHARLS)中743名年龄在45岁及以上的癌症患者,共分析了包括人口统计学因素、经济因素、健康因素、家庭因素和个人因素在内的19个变量。通过LASSO回归筛选预测特征后,为了确定最佳预测模型,对六个机器学习模型——支持向量机、K近邻、多层感知器、决策树、XGBoost和随机森林进行了训练。

结果

训练后,随机森林模型显示出最佳决策性能,AUC(95%置信区间):0.774(0.740 - 0.809)。随后,通过夏普利值加法解释对该模型进行解释,发现了影响抑郁风险的五个关键特征。特征重要性图直观地显示,生活满意度差的患者预测抑郁风险显著增加。

结论

我们为医疗保健提供者开发了一种临床可视化模型,以估计中老年癌症患者的抑郁风险。作为早期识别中老年癌症患者抑郁症状的有力工具,该模型使医务人员能够更早地实施临床干预,以获得更好的临床效益。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ecc/12210965/12ac09c347f8/12888_2025_7074_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验