Nutrition and Clinical Trials Unit, GENYAL Platform IMDEA-Food Institute, CEI UAM+CSIC, 28049, Madrid, Spain.
Biostatistics and Bioinformatics Unit, IMDEA-Food Institute, CEI UAM+CSIC, 28049, Madrid, Spain.
Sci Rep. 2021 Jan 21;11(1):1910. doi: 10.1038/s41598-021-81205-8.
The increased prevalence of childhood obesity is expected to translate in the near future into a concomitant soaring of multiple cardio-metabolic diseases. Obesity has a complex, multifactorial etiology, that includes multiple and multidomain potential risk factors: genetics, dietary and physical activity habits, socio-economic environment, lifestyle, etc. In addition, all these factors are expected to exert their influence through a specific and especially convoluted way during childhood, given the fast growth along this period. Machine Learning methods are the appropriate tools to model this complexity, given their ability to cope with high-dimensional, non-linear data. Here, we have analyzed by Machine Learning a sample of 221 children (6-9 years) from Madrid, Spain. Both Random Forest and Gradient Boosting Machine models have been derived to predict the body mass index from a wide set of 190 multidomain variables (including age, sex, genetic polymorphisms, lifestyle, socio-economic, diet, exercise, and gestation ones). A consensus relative importance of the predictors has been estimated through variable importance measures, implemented robustly through an iterative process that included permutation and multiple imputation. We expect this analysis will help to shed light on the most important variables associated to childhood obesity, in order to choose better treatments for its prevention.
儿童肥胖症的患病率预计在不久的将来会相应地急剧上升,导致多种心血管代谢疾病的发病率上升。肥胖症具有复杂的多因素病因,包括多种潜在的风险因素:遗传、饮食和体育活动习惯、社会经济环境、生活方式等。此外,所有这些因素预计会在儿童时期通过特定的、特别复杂的方式发挥影响,因为在这个时期,儿童的生长速度很快。机器学习方法是建模这种复杂性的合适工具,因为它们能够处理高维、非线性数据。在这里,我们通过机器学习分析了来自西班牙马德里的 221 名儿童(6-9 岁)的样本。我们已经从广泛的 190 多维变量(包括年龄、性别、遗传多态性、生活方式、社会经济、饮食、运动和妊娠)中得出了随机森林和梯度提升机模型,以预测体重指数。通过变量重要性度量,稳健地通过包括置换和多重插补的迭代过程来估计预测因子的相对重要性。我们希望这项分析将有助于阐明与儿童肥胖相关的最重要的变量,以便为预防肥胖症选择更好的治疗方法。