Northwestern University Feinberg School of Medicine, Chicago, Illinois, United States of America.
Virginia Commonwealth University School of Medicine, Richmond, Virginia, United States of America.
PLoS One. 2024 May 31;19(5):e0304509. doi: 10.1371/journal.pone.0304509. eCollection 2024.
Identification of associations between the obese category of weight in the general US population will continue to advance our understanding of the condition and allow clinicians, providers, communities, families, and individuals make more informed decisions. This study aims to improve the prediction of the obese category of weight and investigate its relationships with factors, ultimately contributing to healthier lifestyle choices and timely management of obesity.
Questionnaires that included demographic, dietary, exercise and health information from the US National Health and Nutrition Examination Survey (NHANES 2017-2020) were utilized with BMI 30 or higher defined as obesity. A machine learning model, XGBoost predicted the obese category of weight and Shapely Additive Explanations (SHAP) visualized the various covariates and their feature importance. Model statistics including Area under the receiver operator curve (AUROC), sensitivity, specificity, positive predictive value, negative predictive value and feature properties such as gain, cover, and frequency were measured. SHAP explanations were created for transparent and interpretable analysis.
There were 6,146 adults (age > 18) that were included in the study with average age 58.39 (SD = 12.94) and 3122 (51%) females. The machine learning model had an Area under the receiver operator curve of 0.8295. The top four covariates include waist circumference (gain = 0.185), GGT (gain = 0.101), platelet count (gain = 0.059), AST (gain = 0.057), weight (gain = 0.049), HDL cholesterol (gain = 0.032), and ferritin (gain = 0.034).
In conclusion, the utilization of machine learning models proves to be highly effective in accurately predicting the obese category of weight. By considering various factors such as demographic information, laboratory results, physical examination findings, and lifestyle factors, these models successfully identify crucial risk factors associated with the obese category of weight.
识别美国普通人群中肥胖类别的关联将继续加深我们对该病症的理解,并使临床医生、提供者、社区、家庭和个人能够做出更明智的决策。本研究旨在提高对肥胖类别的预测能力,并研究其与各种因素的关系,最终促进更健康的生活方式选择和及时管理肥胖症。
本研究使用了来自美国国家健康和营养检查调查(NHANES 2017-2020)的包含人口统计学、饮食、运动和健康信息的问卷,将 BMI 为 30 或更高定义为肥胖。XGBoost 机器学习模型预测肥胖类别,Shapely Additive Explanations(SHAP)可视化各种协变量及其特征重要性。测量模型统计数据,包括接收器操作曲线下的面积(AUROC)、灵敏度、特异性、阳性预测值、阴性预测值以及增益、覆盖率和频率等特征属性。创建 SHAP 解释以实现透明和可解释的分析。
本研究纳入了 6146 名年龄大于 18 岁的成年人,平均年龄为 58.39(标准差 = 12.94),其中 3122 名(51%)为女性。机器学习模型的 AUROC 为 0.8295。前四个最重要的协变量包括腰围(增益=0.185)、GGT(增益=0.101)、血小板计数(增益=0.059)、AST(增益=0.057)、体重(增益=0.049)、高密度脂蛋白胆固醇(增益=0.032)和铁蛋白(增益=0.034)。
总之,机器学习模型的应用证明在准确预测肥胖类别方面非常有效。通过考虑人口统计学信息、实验室结果、体检结果和生活方式因素等各种因素,这些模型成功地确定了与肥胖类别相关的关键风险因素。