Department of Physical Education, Seoul National University, Republic of Korea.
Department of Sports and Leisure Studies, Keimyung University, Republic of Korea.
Clin Nutr. 2022 Jan;41(1):144-152. doi: 10.1016/j.clnu.2021.11.027. Epub 2021 Nov 24.
BACKGROUND & AIMS: Lean muscle and fat mass in the human body are important indicators of the risk of cardiovascular and metabolic diseases. Techniques such as dual-energy X-ray absorptiometry (DXA) accurately measure body composition, but they are costly and not easily accessible. Multiple linear regression (MLR) models have been developed to estimate body composition using simple demographic and anthropometric measures instead of expensive techniques, but MLR models do not explore nonlinear interactions between inputs. In this study, we developed simple demographic and anthropometric measure-driven artificial neural network (ANN) models that can estimate lean muscle and fat mass more effectively than MLR models.
We extracted the demographic, anthropometric, and body composition measures of 20,137 participants from the National Health and Nutrition Examination Survey conducted between 1999 and 2006. We included 13 demographic and anthropometric measures as inputs for the ANN models and divided the dataset into training and validation sets (70:30 ratio) to build and cross-validate the models that estimate lean muscle and fat mass, which were originally measured using DXA. This process was repeated 100 times by randomly dividing the training and validation sets to eliminate any effect of data division on model performance. We built additional models separately for each sex and ethnicity, older individuals, and people with underlying diseases. The coefficient of determination (R) and standard error of estimate (SEE) were used to quantify the goodness of fit.
The ANN models yielded high R values between 0.923 and 0.981. These values were significantly higher than those of the MLR models (p < 0.001) in all cases. The percentage difference in R between the ANN and MLR models ranged between 0.40% ± 0.02% and 2.65% ± 0.27%. The SEE values of the ANN models, which were below 2 kg for all cases, were significantly lower than those of MLR models (p < 0.001). The percentage difference in SEE values between the ANN and MLR models ranged between -5.67% ± 0.39% and -22.32% ± 1.98%.
We developed and validated an inexpensive but effective method for estimating body composition using easily obtainable demographic and anthropometric data.
人体的瘦肌肉和脂肪量是心血管和代谢疾病风险的重要指标。双能 X 射线吸收法(DXA)等技术可以准确测量身体成分,但成本高且不易获得。已经开发了使用简单的人口统计学和人体测量学指标而不是昂贵技术来估计身体成分的多元线性回归(MLR)模型,但 MLR 模型并未探索输入之间的非线性相互作用。在这项研究中,我们开发了简单的人口统计学和人体测量学指标驱动的人工神经网络(ANN)模型,这些模型可以比 MLR 模型更有效地估计瘦肌肉和脂肪量。
我们从 1999 年至 2006 年进行的全国健康和营养检查调查中提取了 20,137 名参与者的人口统计学、人体测量学和身体成分测量值。我们将 13 个人口统计学和人体测量学指标作为 ANN 模型的输入,并将数据集分为训练集和验证集(70:30 比例),以建立和交叉验证最初使用 DXA 测量的瘦肌肉和脂肪量的模型。通过随机划分训练集和验证集,我们重复了 100 次此过程,以消除数据划分对模型性能的任何影响。我们分别为每个性别和种族、年龄较大的个体以及患有潜在疾病的个体建立了其他模型。决定系数(R)和估计标准误差(SEE)用于量化拟合优度。
ANN 模型产生的 R 值介于 0.923 和 0.981 之间。在所有情况下,这些值均明显高于 MLR 模型(p <0.001)。ANN 与 MLR 模型之间 R 值的差异百分比在 0.40%±0.02%至 2.65%±0.27%之间。ANN 模型的 SEE 值在所有情况下均低于 2kg,明显低于 MLR 模型(p <0.001)。ANN 与 MLR 模型之间 SEE 值的差异百分比在-5.67%±0.39%至-22.32%±1.98%之间。
我们开发并验证了一种使用易于获得的人口统计学和人体测量学数据来估算身体成分的廉价但有效的方法。