School of Nursing, Sun Yat-Sen University, No. 74, 2nd Yat-Sen Rd, Yuexiu District, Guangzhou City, Guangdong Province, China; Department of Health Technology and Informatics, The Hong Kong Polytechnic University, Hunghom, Hong Kong SAR, China.
School of Nursing, Sun Yat-Sen University, No. 74, 2nd Yat-Sen Rd, Yuexiu District, Guangzhou City, Guangdong Province, China.
J Affect Disord. 2025 Jan 1;368:537-546. doi: 10.1016/j.jad.2024.09.111. Epub 2024 Sep 19.
Youths face significant mental health challenges exacerbated by stressful life events, particularly in the context of the COVID-19 pandemic. Immature coping strategies can worsen mental health outcomes.
This study utilised a two-wave cross-sectional survey design with data collected from Chinese youth aged 14-25 years. Wave 1 (N = 3038) and Wave 2 (N = 539) datasets were used for model development and external validation, respectively. Twenty-five features, encompassing dimensions related to demographic information, stressful life events, social support, coping strategies, and emotional intelligence, were input into the model to predict the mental health status of youth, which was considered their coping outcome. Shapley additive explanation (SHAP) was used to determine the importance of each risk factor in the feature selection. The intersection of top 10 features identified by random forest and XGBoost were considered the most influential predictors of mental health during the feature selection process, and was then taken as the final set of features for model development. Machine learning models, including logistic regression, AdaBoost, and a backpropagation neural network (BPNN), were trained to predict the outcomes. The optimum model was selected according to the performance in both internal and external validation.
This study identified six key features that were significantly associated with mental health outcomes: punishment, adaptation issues, self-regulation of emotions, learning pressure, use of social support, and recognition of others' emotions. The BPNN model, optimized through feature selection methods like SHAP, demonstrated superior performance in internal validation (C-index [95 % CI] = 0.9120 [0.9111, 0.9129], F-score [95 % CI] = 0.8861 [0.8853, 0.8869]). Additionally, external validation showed the model had strong discrimination (C-index = 0.9749, F-score = 0.8442) and calibration (Brier score = 0.029) capabilities.
Although the clinical prediction model performed well, the study it still limited by self-reported data and representativeness of samples. Causal relationships need to be established to interpret the coping mechanism from multiple perspectives. Also, the limited data on minority groups may lead to algorithmic unfairness.
Machine learning models effectively identified and predicted mental health outcomes among youths, with the SHAP+BPNN model showing promising clinical applicability. These findings emphasise the importance and effectiveness of targeted interventions with the help of clinical prediction model.
年轻人面临着严重的心理健康挑战,这些挑战因生活压力事件而加剧,尤其是在 COVID-19 大流行的背景下。不成熟的应对策略会使心理健康状况恶化。
本研究采用了两波横向调查设计,数据来自中国 14-25 岁的青年。波 1(N=3038)和波 2(N=539)数据集分别用于模型开发和外部验证。将 25 个特征(包括与人口统计信息、生活压力事件、社会支持、应对策略和情绪智力相关的维度)输入到模型中,以预测青年的心理健康状况,这被视为他们的应对结果。使用 Shapley 加性解释 (SHAP) 确定每个风险因素在特征选择中的重要性。随机森林和 XGBoost 确定的前 10 个特征的交集被认为是特征选择过程中影响心理健康的最主要预测因素,并被视为模型开发的最终特征集。使用逻辑回归、AdaBoost 和反向传播神经网络 (BPNN) 等机器学习模型来预测结果。根据内部和外部验证中的性能选择最佳模型。
本研究确定了六个与心理健康结果显著相关的关键特征:惩罚、适应问题、情绪自我调节、学习压力、社会支持的利用和他人情绪的识别。通过 SHAP 等特征选择方法优化的 BPNN 模型在内部验证中表现出优异的性能(C 指数[95%CI]为 0.9120[0.9111, 0.9129],F 分数[95%CI]为 0.8861[0.8853, 0.8869])。此外,外部验证表明该模型具有较强的区分能力(C 指数=0.9749,F 分数=0.8442)和校准能力(Brier 得分=0.029)。
尽管临床预测模型表现良好,但该研究仍受到自我报告数据和样本代表性的限制。需要建立因果关系来从多个角度解释应对机制。此外,少数群体的数据有限可能导致算法不公平。
机器学习模型有效地识别和预测了年轻人的心理健康结果,SHAP+BPNN 模型具有良好的临床适用性。这些发现强调了通过临床预测模型进行有针对性的干预的重要性和有效性。