预测中国高中生超重的发病情况：一项为期一年的前瞻性队列研究中的机器学习方法。

Predicting the onset of overweight in Chinese high school students: a machine-learning approach in a one-year prospective cohort study.

作者信息

Zhang Zikang, Peng Wei, Sun Shaoming, Ma Jianguo, Sun Yining, Zhang Fangwen

机构信息

Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei, 230031, PR China.

University of Science and Technology of China, Hefei, 230026, PR China.

出版信息

Endocrine. 2024 Nov;86(2):600-611. doi: 10.1007/s12020-024-03902-4. Epub 2024 Jun 10.

DOI:10.1007/s12020-024-03902-4

PMID:38856840

Abstract

OBJECTIVE

This study aimed to develop and evaluate machine-learning models for predicting the onset of overweight in adolescents aged 14‒17, utilizing easily collectible personal information.

METHODS

This study was a one-year prospective cohort study. Baseline data were collected through anthropometric measurements and questionnaires, and the incidence of overweight was calculated one year later via anthropometric measurements. Predictive factors were selected through univariate analysis. Six machine-learning models were developed for predicting the onset of overweight. The SHapley Additive exPlanations (SHAP) was used for global and local interpretation of the models.

RESULTS

Out of 1,241 adolescents, 204 (16.4%) were identified as overweight after one year. Nineteen features were associated with the overweight incidence in univariable analysis. Participants were randomly divided into a training group and a testing group in a 7:3 ratio. The Light Gradient Boosting Machine (LGBM) algorithm achieved outperformed other models, achieving the following metrics: Accuracy (0.956), Recall (0.812), Specificity (0.983), F1-score (0.855), AUC (0.961). Importance ranking revealed that the top 11 minimal feature set can maintain the stability of model performance.

CONCLUSIONS

The onset of overweight in adolescents was accurately predicted using easily collectible personal information. The LGBM-based model exhibited superior performance. Oversampling technique notably improved model performance. The model interpretation technique provided innovative strategies for managing adolescent overweight/obesity.

摘要

目的

本研究旨在开发并评估利用易于收集的个人信息预测14至17岁青少年超重发病情况的机器学习模型。

方法

本研究为为期一年的前瞻性队列研究。通过人体测量和问卷调查收集基线数据，并在一年后通过人体测量计算超重发病率。通过单因素分析选择预测因素。开发了六个用于预测超重发病的机器学习模型。使用SHapley加法解释（SHAP）对模型进行全局和局部解释。

结果

在1241名青少年中，一年后有204名（16.4%）被确定为超重。单因素分析中有19个特征与超重发病率相关。参与者以7:3的比例随机分为训练组和测试组。轻梯度提升机（LGBM）算法的表现优于其他模型，取得了以下指标：准确率（0.956）、召回率（0.812）、特异性（0.983）、F1分数（0.855）、曲线下面积（AUC，0.961）。重要性排序显示，前11个最小特征集可保持模型性能的稳定性。

结论

利用易于收集的个人信息可准确预测青少年超重的发病情况。基于LGBM的模型表现出卓越性能。过采样技术显著提高了模型性能。模型解释技术为管理青少年超重/肥胖提供了创新策略。

相似文献

Predicting the onset of overweight in Chinese high school students: a machine-learning approach in a one-year prospective cohort study.预测中国高中生超重的发病情况：一项为期一年的前瞻性队列研究中的机器学习方法。

Endocrine. 2024 Nov;86(2):600-611. doi: 10.1007/s12020-024-03902-4. Epub 2024 Jun 10.

A Risk Prediction Model for Physical Restraints Among Older Chinese Adults in Long-term Care Facilities: Machine Learning Study.长期护理机构中老年人身体约束的风险预测模型：机器学习研究。

J Med Internet Res. 2023 Apr 6;25:e43815. doi: 10.2196/43815.

Predictive etiological classification of acute ischemic stroke through interpretable machine learning algorithms: a multicenter, prospective cohort study.通过可解释的机器学习算法对急性缺血性脑卒中进行预测病因分类：一项多中心前瞻性队列研究。

BMC Med Res Methodol. 2024 Sep 10;24(1):199. doi: 10.1186/s12874-024-02331-1.

Disability risk prediction model based on machine learning among Chinese healthy older adults: results from the China Health and Retirement Longitudinal Study.基于机器学习的中国健康老年人残疾风险预测模型：来自中国健康与养老追踪调查的结果。

Front Public Health. 2023 Nov 9;11:1271595. doi: 10.3389/fpubh.2023.1271595. eCollection 2023.

Predicting risk of obesity in overweight adults using interpretable machine learning algorithms.使用可解释的机器学习算法预测超重成年人的肥胖风险。

Front Endocrinol (Lausanne). 2023 Nov 17;14:1292167. doi: 10.3389/fendo.2023.1292167. eCollection 2023.

Development and Validation of an Explainable Machine Learning Model for Predicting Myocardial Injury After Noncardiac Surgery in Two Centers in China: Retrospective Study.中国两个中心用于预测非心脏手术后心肌损伤的可解释机器学习模型的开发与验证：一项回顾性研究

JMIR Aging. 2024 Jul 26;7:e54872. doi: 10.2196/54872.

Prediction of lateral lymph node metastasis with short diameter less than 8 mm in papillary thyroid carcinoma based on radiomics.基于放射组学的甲状腺乳头状癌短径小于 8mm 预测侧颈部淋巴结转移

Cancer Imaging. 2024 Nov 15;24(1):155. doi: 10.1186/s40644-024-00803-7.

AKIML: An interpretable machine learning model for predicting acute kidney injury within seven days in critically ill patients based on a prospective cohort study.AKIML：基于一项前瞻性队列研究的用于预测危重症患者7天内急性肾损伤的可解释机器学习模型。

Clin Chim Acta. 2024 Jun 1;559:119705. doi: 10.1016/j.cca.2024.119705. Epub 2024 May 1.

Interpretable machine learning for allergic rhinitis prediction among preschool children in Urumqi, China.中国乌鲁木齐学龄前儿童变应性鼻炎预测的可解释机器学习。

Sci Rep. 2024 Sep 27;14(1):22281. doi: 10.1038/s41598-024-73733-w.

Prediction of adolescent weight status by machine learning: a population-based study.基于人群的机器学习预测青少年体重状况的研究。

BMC Public Health. 2024 May 20;24(1):1351. doi: 10.1186/s12889-024-18830-1.

本文引用的文献

Machine learning approach to predict body weight in adults.机器学习方法预测成年人的体重。

Front Public Health. 2023 Jun 15;11:1090146. doi: 10.3389/fpubh.2023.1090146. eCollection 2023.

Secular trends and sociodemographic determinants of thinness, overweight and obesity among Chinese children and adolescents aged 7-18 years from 2010 to 2018.2010 年至 2018 年中国 7-18 岁儿童青少年消瘦、超重和肥胖的流行趋势及社会人口学决定因素。

Front Public Health. 2023 May 4;11:1128552. doi: 10.3389/fpubh.2023.1128552. eCollection 2023.

Does multidimensional daily information predict the onset of myopia? A 1-year prospective cohort study.多维日常信息能否预测近视的发生？一项为期 1 年的前瞻性队列研究。

Biomed Eng Online. 2023 May 13;22(1):45. doi: 10.1186/s12938-023-01109-8.

Predicting risk of overweight or obesity in Chinese preschool-aged children using artificial intelligence techniques.利用人工智能技术预测中国学龄前儿童超重或肥胖的风险。

Endocrine. 2022 Jun;77(1):63-72. doi: 10.1007/s12020-022-03072-1. Epub 2022 May 18.

Association between Physical Activity, Sedentary Behaviors, Sleep, Diet, and Adiposity among Children and Adolescents in China.中国儿童和青少年身体活动、久坐行为、睡眠、饮食与肥胖的关系。

Obes Facts. 2022;15(1):26-35. doi: 10.1159/000519268. Epub 2021 Nov 16.

A systematic literature review on obesity: Understanding the causes & consequences of obesity and reviewing various machine learning approaches used to predict obesity.关于肥胖的系统文献综述：了解肥胖的成因与后果，并回顾用于预测肥胖的各种机器学习方法。

Comput Biol Med. 2021 Sep;136:104754. doi: 10.1016/j.compbiomed.2021.104754. Epub 2021 Aug 16.

Thinness, overweight and obesity among 6- to 17-year-old Malaysians: secular trends and sociodemographic determinants from 2006 to 2015.马来西亚 6 至 17 岁儿童的消瘦、超重和肥胖：2006 至 2015 年的长期趋势和社会人口学决定因素。

Public Health Nutr. 2021 Dec;24(18):6309-6322. doi: 10.1017/S1368980021003190. Epub 2021 Aug 5.

Does Physical Activity Predict Obesity-A Machine Learning and Statistical Method-Based Analysis.体力活动与肥胖的关系：基于机器学习和统计方法的分析。

Int J Environ Res Public Health. 2021 Apr 9;18(8):3966. doi: 10.3390/ijerph18083966.

Machine learning methods to predict mechanical ventilation and mortality in patients with COVID-19.机器学习方法预测 COVID-19 患者的机械通气和死亡率。

PLoS One. 2021 Apr 1;16(4):e0249285. doi: 10.1371/journal.pone.0249285. eCollection 2021.

Weight self-perception in adolescents: evidence from a population-based study.青少年的体重自我认知：基于人群的研究证据。

Public Health Nutr. 2021 May;24(7):1648-1656. doi: 10.1017/S1368980021000690. Epub 2021 Mar 2.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

预测中国高中生超重的发病情况：一项为期一年的前瞻性队列研究中的机器学习方法。

Predicting the onset of overweight in Chinese high school students: a machine-learning approach in a one-year prospective cohort study.

作者信息

机构信息

出版信息

OBJECTIVE

METHODS

RESULTS

CONCLUSIONS

目的

方法

结果

结论

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献