Zuo Wenwei, Yang Xuelian
University of Shanghai for Science and Technology, No.516, Jungong Road, Yangpu Area, Shanghai, 200093, China.
Department of Neurology, Gongli Hospital of Shanghai Pudong New Area, No. 219 Miaopu Road, Pudong New Area, Shanghai, 200135, China.
BMC Geriatr. 2025 Mar 22;25(1):193. doi: 10.1186/s12877-025-05837-5.
BACKGROUND: Depression is a common complication after a stroke that may lead to increased disability and decreased quality of life. The objective of this study was to develop and validate an interpretable predictive model to assess the risk of depression in stroke patients using machine learning (ML) methods. METHODS: This study included 1143 stroke patients from the NHANES database between 2005 and 2020. First, risk factors for depression in stroke patients were determined by univariate and multivariate logistic regression analysis. Next, five machine learning algorithms were used to construct predictive models, and several evaluation metrics (including area under the curve (AUC)) were used to compare the predictive performance of the models. In addition, the SHAP (Shapley Additive Explanations) method was used to rank the importance of features and to interpret the final model. RESULTS: We screened seven features to construct a predictive model. Among the 5 machine learning models, the XGBoost (extreme gradient boosting) model showed the best discriminative ability, with an AUC of the ROC (receiver operating characteristic curve) in the test set of 0.746 and an accuracy of 0.834. In addition, the prediction results of the XGBoost model were interpreted in detail using the SHAP algorithm. We also developed a web-based calculator that provides a convenient tool for predicting the risk of depression in stroke patients at the following link: https://prediction-model-for-depression.streamlit.app . CONCLUSIONS: Our interpretable machine learning model serves as an auxiliary tool for clinical judgment, aimed at early and effective identification of depression risk in stroke patients.
背景:抑郁症是中风后的常见并发症,可能导致残疾增加和生活质量下降。本研究的目的是开发并验证一种可解释的预测模型,使用机器学习(ML)方法评估中风患者患抑郁症的风险。 方法:本研究纳入了2005年至2020年间来自美国国家健康与营养检查调查(NHANES)数据库的1143名中风患者。首先,通过单因素和多因素逻辑回归分析确定中风患者抑郁症的危险因素。接下来,使用五种机器学习算法构建预测模型,并使用几种评估指标(包括曲线下面积(AUC))比较模型的预测性能。此外,使用SHAP(Shapley加性解释)方法对特征的重要性进行排名并解释最终模型。 结果:我们筛选出七个特征来构建预测模型。在五个机器学习模型中,XGBoost(极端梯度提升)模型表现出最佳的判别能力,测试集中受试者工作特征曲线(ROC)的AUC为0.746,准确率为0.834。此外,使用SHAP算法详细解释了XGBoost模型的预测结果。我们还开发了一个基于网络的计算器,通过以下链接为预测中风患者患抑郁症的风险提供了一个便捷工具:https://prediction-model-for-depression.streamlit.app 。 结论:我们的可解释机器学习模型可作为临床判断的辅助工具,旨在早期有效识别中风患者的抑郁症风险。
Eur J Med Res. 2024-11-14
BMC Med Inform Decis Mak. 2023-10-19
Front Psychiatry. 2023-6-19