Zhou Yiwei, Zhang Zejie, Li Qin, Mao Guangyun, Zhou Zumu
Business School, University of Shanghai for Science and Technology, 200093, Shanghai, China.
School of Intelligent Emergency Management, University of Shanghai for Science and Technology, 200093, Shanghai, China.
BMC Psychol. 2024 Apr 24;12(1):230. doi: 10.1186/s40359-024-01696-8.
COVID-19 epidemics often lead to elevated levels of depression. To accurately identify and predict depression levels in home-quarantined individuals during a COVID-19 epidemic, this study constructed a depression prediction model based on multiple machine learning algorithms and validated its effectiveness.
A cross-sectional method was used to examine the depression status of individuals quarantined at home during the epidemic via the network. Characteristics included variables on sociodemographics, COVID-19 and its prevention and control measures, impact on life, work, health and economy after the city was sealed off, and PHQ-9 scale scores. The home-quarantined subjects were randomly divided into training set and validation set according to the ratio of 7:3, and the performance of different machine learning models were compared by 10-fold cross-validation, and the model algorithm with the best performance was selected from 15 models to construct and validate the depression prediction model for home-quarantined subjects. The validity of different models was compared based on accuracy, precision, receiver operating characteristic (ROC) curve, and area under the ROC curve (AUC), and the best model suitable for the data framework of this study was identified.
The prevalence of depression among home-quarantined individuals during the epidemic was 31.66% (202/638), and the constructed Adaboost depression prediction model had an ACC of 0.7917, an accuracy of 0.7180, and an AUC of 0.7803, which was better than the other 15 models on the combination of various performance measures. In the validation sets, the AUC was greater than 0.83.
The Adaboost machine learning algorithm developed in this study can be used to construct a depression prediction model for home-quarantined individuals that has better machine learning performance, as well as high effectiveness, robustness, and generalizability.
新型冠状病毒肺炎(COVID-19)疫情常导致抑郁水平升高。为准确识别和预测COVID-19疫情期间居家隔离人员的抑郁水平,本研究基于多种机器学习算法构建了抑郁预测模型并验证其有效性。
采用横断面研究方法,通过网络调查疫情期间居家隔离人员的抑郁状况。特征包括社会人口统计学变量、COVID-19及其防控措施、封城后对生活、工作、健康和经济的影响以及患者健康问卷-9(PHQ-9)量表评分。将居家隔离对象按7:3的比例随机分为训练集和验证集,通过10折交叉验证比较不同机器学习模型的性能,从15种模型中选出性能最佳的模型算法构建并验证居家隔离对象的抑郁预测模型。基于准确率、精确率、受试者工作特征(ROC)曲线及ROC曲线下面积(AUC)比较不同模型的有效性,确定最适合本研究数据框架的最佳模型。
疫情期间居家隔离人员的抑郁患病率为31.66%(202/638),构建的Adaboost抑郁预测模型的ACC为0.7917,精确率为0.7180,AUC为0.7803,在各项性能指标综合表现上优于其他15种模型。在验证集中,AUC大于0.83。
本研究开发的Adaboost机器学习算法可用于构建居家隔离人员的抑郁预测模型,该模型具有较好的机器学习性能,以及较高的有效性、稳健性和泛化能力。