Kim Hyerim, Lim Dong Hoon, Kim Yoona
Department of Food and Nutrition, Gyeongsang National University, Jinju 52828, Gyeongnam, Korea.
Department of Information & Statistics, Department of Bio & Medical Big Data (BK21 Four Program) and RINS, Gyeongsang National University, Jinju 52828, Gyeongnam, Korea.
Int J Environ Res Public Health. 2021 May 24;18(11):5597. doi: 10.3390/ijerph18115597.
Few studies have been conducted to classify and predict the influence of nutritional intake on overweight/obesity, dyslipidemia, hypertension and type 2 diabetes mellitus (T2DM) based on deep learning such as deep neural network (DNN). The present study aims to classify and predict associations between nutritional intake and risk of overweight/obesity, dyslipidemia, hypertension and T2DM by developing a DNN model, and to compare a DNN model with the most popular machine learning models such as logistic regression and decision tree. Subjects aged from 40 to 69 years in the 4-7th (from 2007 through 2018) Korea National Health and Nutrition Examination Survey (KNHANES) were included. Diagnostic criteria of dyslipidemia ( = 10,731), hypertension ( = 10,991), T2DM ( = 3889) and overweight/obesity ( = 10,980) were set as dependent variables. Nutritional intakes were set as independent variables. A DNN model comprising one input layer with 7 nodes, three hidden layers with 30 nodes, 12 nodes, 8 nodes in each layer and one output layer with one node were implemented in Python programming language using Keras with tensorflow backend. In DNN, binary cross-entropy loss function for binary classification was used with Adam optimizer. For avoiding overfitting, dropout was applied to each hidden layer. Structural equation modelling (SEM) was also performed to simultaneously estimate multivariate causal association between nutritional intake and overweight/obesity, dyslipidemia, hypertension and T2DM. The DNN model showed the higher prediction accuracy with 0.58654 for dyslipidemia, 0.79958 for hypertension, 0.80896 for T2DM and 0.62496 for overweight/obesity compared with two other machine leaning models with five-folds cross-validation. Prediction accuracy for dyslipidemia, hypertension, T2DM and overweight/obesity were 0.58448, 0.79929, 0.80818 and 0.62486, respectively, when analyzed by a logistic regression, also were 0.52148, 0.66773, 0.71587 and 0.54026, respectively, when analyzed by a decision tree. This study observed a DNN model with three hidden layers with 30 nodes, 12 nodes, 8 nodes in each layer had better prediction accuracy than two conventional machine learning models of a logistic regression and decision tree.
很少有研究基于深度学习(如深度神经网络,DNN)对营养摄入对超重/肥胖、血脂异常、高血压和2型糖尿病(T2DM)的影响进行分类和预测。本研究旨在通过开发一个DNN模型来对营养摄入与超重/肥胖、血脂异常、高血压和T2DM风险之间的关联进行分类和预测,并将DNN模型与最流行的机器学习模型(如逻辑回归和决策树)进行比较。纳入了2007年至2018年第4至7轮韩国国家健康与营养检查调查(KNHANES)中年龄在40至69岁之间的受试者。将血脂异常(n = 10731)、高血压(n = 10991)、T2DM(n = 3889)和超重/肥胖(n = 10980)的诊断标准设为因变量。将营养摄入设为自变量。使用Python编程语言,借助Keras和tensorflow后端实现了一个DNN模型,该模型包括一个具有7个节点的输入层、三个分别具有30个节点、12个节点、8个节点的隐藏层以及一个具有1个节点的输出层。在DNN中,二元分类使用二元交叉熵损失函数和Adam优化器。为避免过拟合,对每个隐藏层应用了随机失活。还进行了结构方程建模(SEM),以同时估计营养摄入与超重/肥胖、血脂异常、高血压和T2DM之间的多变量因果关联。与其他两个机器学习模型进行五折交叉验证相比,DNN模型对血脂异常的预测准确率为0.58654,对高血压为0.79958,对T2DM为0.80896,对超重/肥胖为0.62496。通过逻辑回归分析时,血脂异常、高血压、T2DM和超重/肥胖的预测准确率分别为0.58448、0.79929、0.80818和0.62486;通过决策树分析时,分别为0.52148、0.66773、0.71587和0.54026。本研究观察到一个具有三个分别包含30个节点、12个节点、8个节点的隐藏层的DNN模型比逻辑回归和决策树这两个传统机器学习模型具有更好的预测准确率。
Int J Environ Res Public Health. 2022-11-19
Med Clin (Barc). 2014-6-6
Biomed Environ Sci. 2005-12
BMC Public Health. 2024-12-28
Front Physiol. 2024-10-24
Bioengineering (Basel). 2024-5-23
Cochrane Database Syst Rev. 2020-12-12
J Am Heart Assoc. 2020-6-16
BMC Med Inform Decis Mak. 2019-11-6