Verma Abhishek, Roy Aratrika, Junaid K P
Dept. of Psychiatry, Postgraduate Institute of Medical Education and Research, Chandigarh, India.
Dept. of Psychology, Panjab University, Chandigarh, India.
Indian J Psychol Med. 2025 Jan 25:02537176241311196. doi: 10.1177/02537176241311196.
Depression among the elderly is a growing public health concern, especially in India. This study aimed to investigate the predictive validity of physiological, psychological, and functional health factors in classifying the level of depressive symptoms among the elderly using the extreme gradient boosting (XGBoost) technique. Additionally, we compared the performance of models trained on original and resampled data.
This study is entirely based on secondary data analysis of the Longitudinal Aging Study in India wave 1 data. We classified the observations into "high depressive symptom" and "low/no depressive symptom" groups based on the predictors, including physiological, psychological, and functional health factors, along with socio-demographic factors. We developed three models (Models 1, 2, and 3) trained on original, over-sampled, and under-sampled data, respectively. Model performance was evaluated using the metrics of balanced accuracy, sensitivity, specificity, and area under the receiver operating characteristics curve (AUC).
The study included 26,065 individuals aged 60 and above. Model 3, trained on under-sampled data, demonstrated the best overall performance. It achieved a balanced accuracy of 64%, with a sensitivity of 62.8% and specificity of 65.2%. The AUC for Model 3 was 0.692. Feature importance analysis revealed that life satisfaction, instrumental activities of daily living, mobility, caste, and monthly per capita expenditure quintiles were among the most influential factors in predicting the level of depressive symptoms.
The XGBoost models demonstrate promise in predicting depressive symptoms among the elderly. These findings suggest that machine learning models can be envisaged for early detection and management of depression, especially in primary care.
老年人抑郁症是一个日益严重的公共卫生问题,在印度尤为如此。本研究旨在使用极端梯度提升(XGBoost)技术,调查生理、心理和功能健康因素在分类老年人抑郁症状水平方面的预测效度。此外,我们比较了在原始数据和重采样数据上训练的模型的性能。
本研究完全基于印度纵向老龄化研究第1波数据的二次数据分析。我们根据预测因素,包括生理、心理和功能健康因素以及社会人口学因素,将观察对象分为“高抑郁症状”和“低/无抑郁症状”组。我们分别开发了在原始数据、过采样数据和欠采样数据上训练的三个模型(模型1、模型2和模型3)。使用平衡准确率、灵敏度、特异性和受试者工作特征曲线下面积(AUC)等指标评估模型性能。
该研究纳入了26065名60岁及以上的个体。在欠采样数据上训练的模型3表现出最佳的整体性能。它的平衡准确率为64%,灵敏度为62.8%,特异性为65.2%。模型3的AUC为0.692。特征重要性分析表明,生活满意度、日常生活工具性活动、 mobility、种姓和每月人均支出五分位数是预测抑郁症状水平最具影响力的因素。
XGBoost模型在预测老年人抑郁症状方面显示出前景。这些发现表明,可以设想使用机器学习模型进行抑郁症的早期检测和管理,尤其是在初级保健中。