Department of Psychology, Henan University, Kaifeng, China.
Beijing Anding Hospital, Capital Medical University, Beijing, China.
J Affect Disord. 2024 Dec 15;367:554-561. doi: 10.1016/j.jad.2024.08.185. Epub 2024 Aug 31.
Depression is very prevalent in middle-aged and older smokers. Therefore, we aimed to identify the risk of depression among middle-aged and older adults with frequent and infrequent nicotine use, as this is quite necessary for supporting their well-being.
This study included a total of 10,821 participants, which were derived from the China Health and Retirement Longitudinal Study Wave 5, 2020 (CHARLS-5). Five machine learning (ML) algorithms were employed. Some metrics were used to evaluate the performance of models, including area under the receiver operating characteristic curve (AUC), positive predictive value (PPV), specificity, accuracy.
10,821 participants (6472 males, 4349 females) had a mean age of 60.47 ± 8.98, with a score of 8.90 ± 6.53 on depression scale. For middle-aged and older adults with frequent nicotine use, random forest (RF) achieved the highest AUC value, PPV and specificity (0.75, 0.74 and 0.88, respectively). For the other group, support vector machines (SVM) showed the highest PPV (0.74), and relatively high accuracy and specificity (0.72 and 0.87, respectively). Feature importance analysis indicated that "dissatisfaction with life" was the most important variable of identifying the risk of depression in the SVM model, while "attitude towards expected life span" was the most important one in the RF model.
CHARLS-5 was collected during the COVID-19, so our results may be influenced by the pandemic.
This study indicated that certain ML models can ideally identify the risk of depression in middle-aged and older adults, which holds significant value for their health management.
抑郁症在中年和老年吸烟者中非常普遍。因此,我们旨在确定频繁和不频繁使用尼古丁的中年和老年成年人患抑郁症的风险,因为这对于支持他们的健康非常必要。
本研究共纳入 10821 名参与者,均来自中国健康与养老追踪调查(CHARLS)第五轮(2020 年)。使用了五种机器学习(ML)算法。使用一些指标来评估模型的性能,包括接收者操作特征曲线下的面积(AUC)、阳性预测值(PPV)、特异性、准确性。
共有 10821 名参与者(6472 名男性,4349 名女性),平均年龄为 60.47±8.98 岁,抑郁量表评分为 8.90±6.53。对于频繁使用尼古丁的中年和老年成年人,随机森林(RF)算法的 AUC 值、PPV 和特异性最高(分别为 0.75、0.74 和 0.88)。对于另一组,支持向量机(SVM)算法的 PPV 最高(0.74),准确性和特异性相对较高(分别为 0.72 和 0.87)。特征重要性分析表明,在 SVM 模型中,“对生活的不满”是识别抑郁风险的最重要变量,而在 RF 模型中,“对预期寿命的态度”是最重要的变量。
CHARLS-5 是在 COVID-19 期间收集的,因此我们的结果可能会受到疫情的影响。
本研究表明,某些 ML 模型可以理想地识别中年和老年成年人患抑郁症的风险,这对他们的健康管理具有重要意义。