Ke Haopeng, Xu Anning, Zhou Haofeng, Chen Junnian, Wu Wenjing, He Qian, Cao Huanyi
Department of Endocrinology, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, China; Guangdong Cardiovascular Institute, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou 510000, China.
Guangdong Cardiovascular Institute, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou 510000, China.
J Affect Disord. 2025 Oct 15;387:119494. doi: 10.1016/j.jad.2025.119494. Epub 2025 May 27.
The incidence of cardiovascular metabolic diseases (CMD) is increasing, and depression in CMD patients significantly impacts prognosis. Therefore, this study aimed to develop and validate a predictive model for depression in CMD patients using machine learning methods.
The study utilized data from the Survey of Health, Ageing, and Retirement in Europe (SHARE) for model derivation and internal validation, and data from the China Health and Retirement Longitudinal Study (CHARLS) for external validation. Logistic Regression, K-nearest neighbors, Support Vector Machine, Random Forest, Gradient Boosting Machine (GBM), and Light Gradient Boosting Machine were used to construct depression prediction models. The model performance was assessed mainly using area under the receiver operator characteristic curve (AUC), brier score, calibration plots and decision curve analysis (DCA). Model interpretations were generated using the Shapley additive explanations (SHAP) method.
Among the 14,884 participants in SHARE and 1128 in CHARLS, 5456 and 474 had depression, respectively. The Gradient Boosting Machine (GBM) model demonstrated the best performance in terms of discrimination and calibration, with an AUC of 0.823 in the external validation cohort, and the DCA also verified that the GBM model had the best clinical practicality. The SHAP method revealed that trouble sleep, life satisfaction and loneliness were the top 3 predictors of depression. For the convenience of clinicians, we developed a clinical support system based on GBM model.
We integrated the GBM model into a clinical support system which could assist clinicians in early identifying CMD patients at high risk for depression.
心血管代谢疾病(CMD)的发病率正在上升,CMD患者的抑郁显著影响预后。因此,本研究旨在使用机器学习方法开发并验证CMD患者抑郁的预测模型。
本研究利用欧洲健康、老龄化和退休调查(SHARE)的数据进行模型推导和内部验证,并利用中国健康与养老追踪调查(CHARLS)的数据进行外部验证。使用逻辑回归、K近邻、支持向量机、随机森林、梯度提升机(GBM)和轻量级梯度提升机构建抑郁预测模型。主要使用受试者工作特征曲线下面积(AUC)、布里尔评分、校准图和决策曲线分析(DCA)评估模型性能。使用夏普利值加法解释(SHAP)方法生成模型解释。
在SHARE的14884名参与者和CHARLS的1128名参与者中,分别有5456名和474名患有抑郁症。梯度提升机(GBM)模型在区分度和校准方面表现最佳,在外部验证队列中的AUC为0.823,DCA也证实GBM模型具有最佳的临床实用性。SHAP方法显示,睡眠障碍、生活满意度和孤独感是抑郁症的前三大预测因素。为方便临床医生,我们基于GBM模型开发了一个临床支持系统。
我们将GBM模型整合到一个临床支持系统中,该系统可以帮助临床医生早期识别有抑郁高风险的CMD患者。