Gan Yao, Kuang Li, Xu Xiao-Ming, Ai Ming, He Jing-Lan, Wang Wo, Hong Su, Chen Jian Mei, Cao Jun, Zhang Qi
Department of Psychiatry, The First Affiliated Hospital of Chongqing Medical University, Chongqing, China.
Mental Health Center, University-Town Hospital of Chongqing Medical University, Chongqing, China.
Front Psychiatry. 2025 Apr 1;15:1521051. doi: 10.3389/fpsyt.2024.1521051. eCollection 2024.
To explore the risk factors affecting adolescents' Internet addiction behavior and build a prediction model for adolescents' Internet addiction behavior based on machine learning algorithms.
A total of 4461 high school students in Chongqing were selected using stratified cluster sampling, and questionnaires were administered. Based on the presence of Internet addiction behavior, students were categorized into an Internet addiction group (n=1210) and a non-Internet addiction group (n=3115). Gender, age, residence type, and other data were compared between the groups, and independent risk factors for adolescent Internet addiction were analyzed using a logistic regression model. Six methods-multi-level perceptron, random forest, K-nearest neighbor, support vector machine, logistic regression, and extreme gradient boosting-were used to construct the model. The model's indicators under each algorithm were compared, evaluated with a confusion matrix, and the optimal model was selected.
The proportion of male adolescents, urban household registration, and scores on the family function, planning, action, and cognitive subscales, along with psychoticism, introversion-extroversion, neuroticism, somatization, obsessive-compulsiveness, interpersonal sensitivity, depression, anxiety, hostility, paranoia, and psychosis, were significantly higher in the Internet addiction group than in the non-Internet addiction group (P < 0.05). No significant differences were found in age or only-child status (P > 0.05). Statistically significant variables were analyzed using a logistic regression model, revealing that gender, household registration type, and scores on planning, action, introversion-extroversion, psychoticism, neuroticism, cognitive, obsessive-compulsive, depression, and hostility scales are independent risk factors for adolescent Internet addiction. The area under the curve (AUC) for multi-level perceptron, random forest, K-nearest neighbor, support vector machine, logistic regression, and extreme gradient boosting models were 0.843, 0.817, 0.778, 0.846, 0.847, and 0.836, respectively, with extreme gradient boosting showing the best predictive performance among these models.
The detection rate of Internet addiction is higher in males than in females, and adolescents with impulsive, extroverted, psychotic, neurotic, obsessive, depressive, and hostile traits are more prone to developing Internet addiction. While the overall performance of the machine learning models for predicting adolescent Internet addiction is moderate, the extreme gradient boosting method outperforms others, effectively identifying risk factors and enabling targeted interventions.
探讨影响青少年网络成瘾行为的危险因素,并基于机器学习算法构建青少年网络成瘾行为的预测模型。
采用分层整群抽样法选取重庆市4461名高中生进行问卷调查。根据是否存在网络成瘾行为,将学生分为网络成瘾组(n = 1210)和非网络成瘾组(n = 3115)。比较两组间的性别、年龄、居住类型等数据,并使用逻辑回归模型分析青少年网络成瘾的独立危险因素。采用多层感知器、随机森林、K近邻、支持向量机、逻辑回归和极端梯度提升六种方法构建模型。比较各算法下模型的指标,用混淆矩阵进行评估,选出最优模型。
网络成瘾组男性青少年比例、城镇户籍比例以及家庭功能、计划性、行动性和认知分量表得分,以及精神质、内外向、神经质、躯体化、强迫症状、人际敏感、抑郁、焦虑、敌对、偏执和精神病性得分均显著高于非网络成瘾组(P < 0.05)。年龄和独生子女状况差异无统计学意义(P > 0.05)。使用逻辑回归模型分析具有统计学意义的变量,结果显示性别、户籍类型以及计划性、行动性、内外向、精神质、神经质、认知、强迫症状、抑郁和敌对量表得分是青少年网络成瘾的独立危险因素。多层感知器、随机森林、K近邻、支持向量机、逻辑回归和极端梯度提升模型的曲线下面积(AUC)分别为0.843、0.817、0.778、0.846、0.847和0.836,其中极端梯度提升在这些模型中表现出最佳的预测性能。
男性青少年网络成瘾检出率高于女性,具有冲动、外向、精神病性、神经质、强迫、抑郁和敌对特质的青少年更容易发生网络成瘾。虽然机器学习模型预测青少年网络成瘾的整体性能中等,但极端梯度提升方法优于其他方法,能有效识别危险因素并实现针对性干预。