Zheng Ying, Zhang Taotao, Yang Shu, Wang Fuzhi, Zhang Li, Liu Yuwen
School of Nursing, Bengbu Medical University, Bengbu, China.
School of Health Management, Bengbu Medical University, Bengbu, China.
BMC Psychiatry. 2024 Dec 2;24(1):870. doi: 10.1186/s12888-024-06299-6.
Older adults with chronic diseases are at higher risk of depressive symptoms than those without. For the onset of depressive symptoms, the prediction ability of changes in common risk factors over a 2-year follow-up period is unclear in the Chinese older population. This study aimed to build risk prediction models (RPMs) to estimate the probability of incident 2-year depression using data from the China Health and Retirement Longitudinal Study (CHARLS).
Four ML algorithms (logistic regression [LR], AdaBoost, random forest [RF] and k-nearest neighbor [kNN]) were applied to develop RPMs using the 2011-2015 cohort data. These developed models were then validated with 2018-2020 survey data. We evaluated the model performance using discrimination and calibration metrics, including an area under the receiver operating characteristic curve (AUROC) and the precision-recall curve (AUPRC), accuracy, sensitivity and calibrations plot. Finally, we explored the key factors of depressive symptoms by the selected best predictive models.
This study finally included 7,121 participants to build models to predict depressive symptoms, finding a 21.5% prevalence of depression. Combining the Synthetic Minority Oversampling Technique (SMOTE) with the logistic regression model (LR-SM) exhibited superior precision to predict depression than other models, with an AUROC and AUPRC of 0.612 and 0.468, respectively, an accuracy of 0.619 and a sensitivity of 0.546. In additiona, external validation of the LR-SM model using data from the 2018-2020 data also demonstrated good predictive ability with an AUROC of 0.623 (95% CI: 0.555- 0.673). Sex, self-rated health status, occupation, eyesight, memory and life satisfaction were identified as impactful predictors of depression.
Our developed models exhibited high accuracy, good discrimination and calibration profiles in predicting two-year risk of depression among older adults with chronic diseases. This model can be used to identify Chinese older population at high risk of depression and intervene in a timely manner.
患有慢性疾病的老年人比未患慢性疾病的老年人出现抑郁症状的风险更高。对于抑郁症状的发作,在中国老年人群中,常见风险因素在2年随访期内变化的预测能力尚不清楚。本研究旨在利用中国健康与养老追踪调查(CHARLS)的数据建立风险预测模型(RPM),以估计2年内发生抑郁的概率。
应用四种机器学习算法(逻辑回归[LR]、自适应增强算法、随机森林[RF]和k近邻算法[kNN]),利用2011 - 2015年队列数据建立RPM。然后用2018 - 2020年调查数据对这些开发的模型进行验证。我们使用区分度和校准指标评估模型性能,包括受试者工作特征曲线下面积(AUROC)、精确率-召回率曲线下面积(AUPRC)、准确率、灵敏度和校准图。最后,我们通过所选的最佳预测模型探索抑郁症状的关键因素。
本研究最终纳入7121名参与者建立预测抑郁症状的模型,发现抑郁症患病率为21.5%。将合成少数过采样技术(SMOTE)与逻辑回归模型(LR - SM)相结合,在预测抑郁症方面表现出比其他模型更高的精度,AUROC和AUPRC分别为0.612和0.468,准确率为0.619,灵敏度为0.546。此外,使用2018 - 2020年数据对LR - SM模型进行外部验证,其AUROC为0.623(95%CI:0.555 - 0.673),也显示出良好的预测能力。性别、自评健康状况、职业、视力、记忆力和生活满意度被确定为抑郁症的有影响力的预测因素。
我们开发的模型在预测患有慢性疾病的老年人两年内患抑郁症的风险方面表现出高精度、良好的区分度和校准情况。该模型可用于识别中国有高抑郁风险的老年人群并及时进行干预。