Yan Ran, Hu Yizhen, Yang Juxiang, Wang Hongchu, Wang Yi, Song Gang
School of Physical Education, Southwest University, Chongqing, China.
School of Mathematical Science, South China Normal University, Guangzhou, China.
Glob Health Med. 2025 Jun 30;7(3):241-251. doi: 10.35772/ghm.2025.01061.
This study aims to identify and predict latent trajectories of depression and chronic disease among middle-aged and older adults in China using data-driven and interpretable machine learning methods, and to explore key factors that promote healthy aging. To achieve this, we analyzed longitudinal data from 13,073 middle-aged and older adults in the China Health and Retirement Longitudinal Study (CHARLS). Group-based multi-trajectory modeling (GBMTM) was applied to identify latent trajectory groups for depression and chronic disease statuses. Predictive factors included sociodemographic characteristics, health conditions, and lifestyle factors. Machine learning models and dynamic nomograms were used to predict trajectory groups, and model performance was evaluated using the area under the receiver operating characteristic curve (AUROC) and decision curve analysis (DCA). As a result, three main trajectory groups were identified: a normal healthy trajectory group (26.9%), a potential depression and disease increase trajectory group (55.6%), and a high depression and disease burden trajectory group (17.5%). Additionally, the study found that older age, disability, shorter sleep duration, and poor self-reported health status were associated with a higher likelihood of belonging to the latent depression and disease increase trajectory group or the high disease burden trajectory group, particularly among urban women. In conclusion, this study demonstrates that the GBMTM and machine learning models can effectively identify and predict depression and chronic disease trajectories. The identified predictors are crucial for developing targeted interventions to promote healthy aging among the middle-aged and older adults.
本研究旨在运用数据驱动且可解释的机器学习方法,识别并预测中国中老年人抑郁和慢性病的潜在轨迹,并探索促进健康老龄化的关键因素。为实现这一目标,我们分析了中国健康与养老追踪调查(CHARLS)中13073名中老年人的纵向数据。应用基于群体的多轨迹建模(GBMTM)来识别抑郁和慢性病状态的潜在轨迹组。预测因素包括社会人口学特征、健康状况和生活方式因素。使用机器学习模型和动态列线图来预测轨迹组,并使用受试者操作特征曲线下面积(AUROC)和决策曲线分析(DCA)评估模型性能。结果,识别出三个主要轨迹组:正常健康轨迹组(26.9%)、潜在抑郁和疾病增加轨迹组(55.6%)以及高抑郁和疾病负担轨迹组(17.5%)。此外,研究发现年龄较大、残疾、睡眠时间较短以及自我报告的健康状况较差与属于潜在抑郁和疾病增加轨迹组或高疾病负担轨迹组的可能性较高相关,尤其是在城市女性中。总之,本研究表明GBMTM和机器学习模型能够有效地识别和预测抑郁和慢性病轨迹。所识别出的预测因素对于制定有针对性的干预措施以促进中老年人的健康老龄化至关重要。