• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

机器学习识别儿童和青少年短期和长期体重状况的关键预测因子。

Robust identification key predictors of short- and long-term weight status in children and adolescents by machine learning.

机构信息

School of Nursing, The University of Hong Kong, Pokfulam, Hong Kong SAR, China.

Department of Electrical and Electronic Engineering, The University of Hong Kong, Pokfulam, Hong Kong SAR, China.

出版信息

Front Public Health. 2024 Sep 24;12:1414046. doi: 10.3389/fpubh.2024.1414046. eCollection 2024.

DOI:10.3389/fpubh.2024.1414046
PMID:39381765
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11458556/
Abstract

BACKGROUND

Early identification of high-risk individuals for weight problems in children and adolescents is crucial for implementing timely preventive measures. While machine learning (ML) techniques have shown promise in addressing this complex challenge with high-dimensional data, feature selection is vital for identifying the key predictors that can facilitate effective and targeted interventions. This study aims to utilize feature selection process to identify a robust and minimal set of predictors that can aid in the early prediction of short- and long-term weight problems in children and adolescents.

METHODS

We utilized demographic, physical, and psychological wellbeing predictors to model weight status (normal, underweight, overweight, and obese) for 1-, 3-, and 5-year periods. To select the most influential features, we employed four feature selection methods: (1) Chi-Square test; (2) Information Gain; (3) Random Forest; (4) eXtreme Gradient Boosting (XGBoost) with six ML approaches. The stability of the feature selection methods was assessed by Jaccard's index, Spearman's rank correlation and Pearson's correlation. Model evaluation was performed by various accuracy metrics.

RESULTS

With 3,862,820 million student-visits were included in this population-based study, the mean age of 11.6 (SD = 3.64) for the training set and 10.8 years (SD = 3.50) for the temporal test set. From the initial set of 38 predictors, we identified 6, 9, and 13 features for 1-, 3-, and 5-year predictions, respectively, by the best performed feature selection method of Chi-Square test in XGBoost models. These feature sets demonstrated excellent stability and achieved prediction accuracies of 0.82, 0.73, and 0.70; macro-AUCs of 0.94, 0.86, and 0.83; micro-AUCs of 0.96, 0.93, and 0.92 for different prediction windows, respectively. Weight, height, sex, total score of self-esteem, and age were consistently the most influential predictors across all prediction windows. Additionally, several psychological and social wellbeing predictors showed relatively high importance in long-term weight status prediction.

CONCLUSIONS

We demonstrate the potential of ML in identifying key predictors of weight status in children and adolescents. While traditional anthropometric measures remain important, psychological and social wellbeing factors also emerge as crucial predictors, potentially informing targeted interventions to address childhood and adolescence weight problems.

摘要

背景

早期识别儿童和青少年体重问题的高危个体对于及时实施预防措施至关重要。虽然机器学习(ML)技术在处理具有高维数据的复杂挑战方面显示出了前景,但特征选择对于识别能够促进有效和有针对性干预的关键预测因素至关重要。本研究旨在利用特征选择过程,确定一组强大且最小的预测因素,以帮助早期预测儿童和青少年的短期和长期体重问题。

方法

我们利用人口统计学、身体和心理健康福利预测因素来建模体重状况(正常、体重不足、超重和肥胖),预测期为 1 年、3 年和 5 年。为了选择最具影响力的特征,我们采用了四种特征选择方法:(1)卡方检验;(2)信息增益;(3)随机森林;(4)极端梯度提升(XGBoost)与六种 ML 方法。特征选择方法的稳定性通过杰卡德指数、斯皮尔曼等级相关和皮尔逊相关进行评估。通过各种准确性指标对模型进行评估。

结果

在这项基于人群的研究中,共纳入了 3862820 名学生就诊,训练集的平均年龄为 11.6 岁(SD=3.64),时间测试集的平均年龄为 10.8 岁(SD=3.50)。从最初的 38 个预测因素中,我们通过 XGBoost 模型中的最佳卡方检验特征选择方法,分别为 1 年、3 年和 5 年预测确定了 6、9 和 13 个特征。这些特征集表现出出色的稳定性,在不同的预测窗口中,预测准确率分别为 0.82、0.73 和 0.70;宏 AUC 分别为 0.94、0.86 和 0.83;微 AUC 分别为 0.96、0.93 和 0.92。体重、身高、性别、自尊总分和年龄在所有预测窗口中始终是最具影响力的预测因素。此外,一些心理和社会健康福利预测因素在长期体重状况预测中表现出相对较高的重要性。

结论

我们证明了 ML 在识别儿童和青少年体重状况的关键预测因素方面的潜力。虽然传统的人体测量指标仍然很重要,但心理和社会健康因素也成为关键的预测因素,这可能为解决儿童和青少年体重问题提供有针对性的干预措施。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9417/11458556/55809e220192/fpubh-12-1414046-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9417/11458556/b548233a0d7e/fpubh-12-1414046-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9417/11458556/ff7bdff4512e/fpubh-12-1414046-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9417/11458556/ec0770fe79ba/fpubh-12-1414046-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9417/11458556/e9df33e174cb/fpubh-12-1414046-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9417/11458556/55809e220192/fpubh-12-1414046-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9417/11458556/b548233a0d7e/fpubh-12-1414046-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9417/11458556/ff7bdff4512e/fpubh-12-1414046-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9417/11458556/ec0770fe79ba/fpubh-12-1414046-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9417/11458556/e9df33e174cb/fpubh-12-1414046-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9417/11458556/55809e220192/fpubh-12-1414046-g0005.jpg

相似文献

1
Robust identification key predictors of short- and long-term weight status in children and adolescents by machine learning.机器学习识别儿童和青少年短期和长期体重状况的关键预测因子。
Front Public Health. 2024 Sep 24;12:1414046. doi: 10.3389/fpubh.2024.1414046. eCollection 2024.
2
Prediction of adolescent weight status by machine learning: a population-based study.基于人群的机器学习预测青少年体重状况的研究。
BMC Public Health. 2024 May 20;24(1):1351. doi: 10.1186/s12889-024-18830-1.
3
Interpretable machine learning for allergic rhinitis prediction among preschool children in Urumqi, China.中国乌鲁木齐学龄前儿童变应性鼻炎预测的可解释机器学习。
Sci Rep. 2024 Sep 27;14(1):22281. doi: 10.1038/s41598-024-73733-w.
4
The effectiveness of web-based programs on the reduction of childhood obesity in school-aged children: A systematic review.基于网络的项目对学龄儿童肥胖症减轻的有效性:一项系统评价。
JBI Libr Syst Rev. 2012;10(42 Suppl):1-14. doi: 10.11124/jbisrir-2012-248.
5
6
Machine learning algorithms for predicting COVID-19 mortality in Ethiopia.用于预测埃塞俄比亚 COVID-19 死亡率的机器学习算法。
BMC Public Health. 2024 Jun 28;24(1):1728. doi: 10.1186/s12889-024-19196-0.
7
Predicting and identifying factors associated with undernutrition among children under five years in Ghana using machine learning algorithms.利用机器学习算法预测和识别加纳五岁以下儿童营养不良的相关因素。
PLoS One. 2024 Feb 13;19(2):e0296625. doi: 10.1371/journal.pone.0296625. eCollection 2024.
8
Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?预测模型工具能否识别 ACL 重建术后阿片类药物使用时间延长的高风险患者?
Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.
9
Issue of Data Imbalance on Low Birthweight Baby Outcomes Prediction and Associated Risk Factors Identification: Establishment of Benchmarking Key Machine Learning Models With Data Rebalancing Strategies.低出生体重婴儿结局预测中数据不平衡问题及相关危险因素识别:应用数据再平衡策略建立基准机器学习模型。
J Med Internet Res. 2023 May 31;25:e44081. doi: 10.2196/44081.
10
Use of Multiprognostic Index Domain Scores, Clinical Data, and Machine Learning to Improve 12-Month Mortality Risk Prediction in Older Hospitalized Patients: Prospective Cohort Study.使用多预后指标领域评分、临床数据和机器学习提高老年住院患者 12 个月死亡率风险预测:前瞻性队列研究。
J Med Internet Res. 2021 Jun 21;23(6):e26139. doi: 10.2196/26139.

引用本文的文献

1
Application and Analysis of Random Forest and Support Vector Classification in Risk Prediction of Childhood Obesity and Hyperuricemia.随机森林和支持向量分类在儿童肥胖和高尿酸血症风险预测中的应用与分析
Diabetes Metab Syndr Obes. 2025 Jul 7;18:2221-2233. doi: 10.2147/DMSO.S519284. eCollection 2025.

本文引用的文献

1
Prediction of adolescent weight status by machine learning: a population-based study.基于人群的机器学习预测青少年体重状况的研究。
BMC Public Health. 2024 May 20;24(1):1351. doi: 10.1186/s12889-024-18830-1.
2
Obesity Prediction with EHR Data: A deep learning approach with interpretable elements.利用电子健康记录数据进行肥胖预测:一种具有可解释元素的深度学习方法。
ACM Trans Comput Healthc. 2022 Jul;3(3). doi: 10.1145/3506719. Epub 2022 Apr 7.
3
Effect of Sleep Extension on Objectively Assessed Energy Intake Among Adults With Overweight in Real-life Settings: A Randomized Clinical Trial.
真实环境中延长睡眠时间对超重成年人客观评估能量摄入的影响:一项随机临床试验。
JAMA Intern Med. 2022 Apr 1;182(4):365-374. doi: 10.1001/jamainternmed.2021.8098.
4
A data value metric for quantifying information content and utility.一种用于量化信息内容和效用的数据值指标。
J Big Data. 2021;8(1):82. doi: 10.1186/s40537-021-00446-6. Epub 2021 Jun 5.
5
Feature Selection and Feature Stability Measurement Method for High-Dimensional Small Sample Data Based on Big Data Technology.基于大数据技术的高维小样本数据特征选择与特征稳定性测量方法。
Comput Intell Neurosci. 2021 Sep 23;2021:3597051. doi: 10.1155/2021/3597051. eCollection 2021.
6
Effect of Air Pollution on Obesity in Children: A Systematic Review and Meta-Analysis.空气污染对儿童肥胖的影响:一项系统评价与荟萃分析
Children (Basel). 2021 Apr 23;8(5):327. doi: 10.3390/children8050327.
7
Machine Learning Models to Predict Childhood and Adolescent Obesity: A Review.机器学习模型预测儿童和青少年肥胖:综述。
Nutrients. 2020 Aug 16;12(8):2466. doi: 10.3390/nu12082466.
8
Youth Self Report Thought Problems and Sleep Difficulties Are Linked to Suicidal Ideation Among Psychiatrically Hospitalized Adolescents.青少年自评思维问题和睡眠困难与住院青少年的自杀意念有关。
J Child Adolesc Psychopharmacol. 2020 Oct;30(8):522-525. doi: 10.1089/cap.2019.0160. Epub 2020 Feb 13.
9
A comparative study on feature selection for a risk prediction model for colorectal cancer.用于结直肠癌风险预测模型的特征选择的比较研究。
Comput Methods Programs Biomed. 2019 Aug;177:219-229. doi: 10.1016/j.cmpb.2019.06.001. Epub 2019 Jun 4.
10
Increasing socioeconomic disparities in sedentary behaviors in Chinese children.中国儿童久坐行为的社会经济差异日益增大。
BMC Public Health. 2019 Jun 13;19(1):754. doi: 10.1186/s12889-019-7092-7.