• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

营养物质与外周动脉疾病关联的机器学习分析:来自1999 - 2004年美国国家健康与营养检查调查(NHANES)的见解

Machine Learning Analysis of Nutrient Associations with Peripheral Arterial Disease: Insights from NHANES 1999-2004.

作者信息

Wang Yi-Xuan, Kang Jin-Quan, Chen Zuo-Guan, Gao Shang, Zhao Wen-Xin, Zhao Ning, Lan Yong, Li Yong-Jun

机构信息

Department of Vascular Surgery, Beijing Hospital, National Center of Gerontology, Institute of Geriatric Medicine, Chinese Academy of Medical Sciences, Beijing, China; Peking University Fifth School of Clinical Medicine, Beijing, China.

Beijing Information Science & Technology University, Beijing, China.

出版信息

Ann Vasc Surg. 2025 May;114:154-162. doi: 10.1016/j.avsg.2024.12.077. Epub 2025 Jan 30.

DOI:10.1016/j.avsg.2024.12.077
PMID:39892831
Abstract

BACKGROUND

Peripheral arterial disease (PAD) is a common manifestation of atherosclerosis, affecting over 200 million people worldwide. The incidence of PAD is increasing due to the aging population. Common risk factors include smoking, diabetes, and hyperlipidemia, but its exact pathogenesis remains unclear. Nutritional intake is associated with the onset and progression of PAD, although relevant studies remain limited. Some studies suggest that certain nutritional elements may influence the development of PAD. This study aims to explore the relationship between nutrition and PAD using machine learning techniques. Unlike traditional statistical methods, machine learning can effectively capture complex, nonlinear relationships, providing a more comprehensive analysis of PAD risk factor.

METHODS

Data from National Health and Nutrition Examination Survey (NHANES 1999-2004) were analyzed, including demographic, clinical, and dietary information. Nutrient intake was assessed through 24-h dietary recalls using computer-assisted dietary interview system (CADI) and automated multiple pass method (AMPM) methods. PAD was defined as an ankle-brachial index (ABI) < 0.9. Six ML models-extreme gradient boosting (XGBoost), random Forest (RF), naive bayes classifier (NB), support vector machine (SVM), logistic regression (LR), and decision tree (DT)-were trained on a 70/30 train-test split, with missing data imputed and sample imbalance addressed via synthetic minority oversampling technique (SMOTE). Model performance was evaluated using the area under the receiver operating characteristic curve (AUROC), accuracy, sensitivity, specificity, precision, recall, and F1 score. Shapley additive explanations (SHAP) analysis was used to identify key features. In addition, to further enhance the interpretability of the model, we applied SHAP analysis to identify the features that have a significant impact on PAD prediction. This approach allowed us to determine the contribution of different variables to the model's output, providing deeper insights into how each feature influences the prediction of PAD outcomes.

RESULTS

Of 31,126 participants, 4,520 met the inclusion criteria (mean age 61.2 ± 13.5 years; 48.8% male), and 441 (9.7%) had ABI < 0.9. XGBoost outperformed other models, achieving an AUROC of 0.913 (95% CI, 0.891-0.936) and F1 score of 0.932. With SMOTE, its AUROC improved to 0.926 (95% CI, 0.889-0.936) and F1 score to 0.937. SHAP analysis identified vitamin C, saturated fatty acids, selenium, phosphorus, and protein intake as key predictors of PAD.

CONCLUSION

This is the first study to apply ML algorithms to examine nutrient intake and PAD in a general population. Vitamin C and phosphorus showed negative correlations with PAD, while saturated fatty acids, protein, and selenium exhibited positive associations.

摘要

背景

外周动脉疾病(PAD)是动脉粥样硬化的常见表现,全球有超过2亿人受其影响。由于人口老龄化,PAD的发病率正在上升。常见的危险因素包括吸烟、糖尿病和高脂血症,但其确切发病机制仍不清楚。营养摄入与PAD的发生和发展有关,尽管相关研究仍然有限。一些研究表明,某些营养元素可能影响PAD的发展。本研究旨在使用机器学习技术探索营养与PAD之间的关系。与传统统计方法不同,机器学习可以有效地捕捉复杂的非线性关系,从而对PAD危险因素进行更全面的分析。

方法

分析了来自国家健康与营养检查调查(NHANES 1999 - 2004)的数据,包括人口统计学、临床和饮食信息。通过使用计算机辅助饮食访谈系统(CADI)和自动多次通过法(AMPM)的24小时饮食回顾来评估营养摄入。PAD被定义为踝臂指数(ABI)<0.9。六个机器学习模型——极端梯度提升(XGBoost)、随机森林(RF)、朴素贝叶斯分类器(NB)、支持向量机(SVM)、逻辑回归(LR)和决策树(DT)——在70/30的训练 - 测试分割上进行训练,通过合成少数过采样技术(SMOTE)处理缺失数据和样本不平衡问题。使用受试者操作特征曲线下面积(AUROC)、准确性、敏感性、特异性、精确性、召回率和F1分数评估模型性能。使用Shapley加法解释(SHAP)分析来识别关键特征。此外,为了进一步提高模型的可解释性,我们应用SHAP分析来识别对PAD预测有重大影响的特征。这种方法使我们能够确定不同变量对模型输出的贡献,从而更深入地了解每个特征如何影响PAD结果的预测。

结果

在31126名参与者中,4520名符合纳入标准(平均年龄61.2±13.5岁;48.8%为男性),441名(9.7%)ABI<0.9。XGBoost的表现优于其他模型,AUROC为0.913(95%CI,0.891 - 0.936),F1分数为0.932。使用SMOTE后,其AUROC提高到0.926(95%CI,0. eighty-eight nine - 0.936),F1分数提高到0.937。SHAP分析确定维生素C、饱和脂肪酸、硒、磷和蛋白质摄入是PAD的关键预测因素。

结论

这是第一项应用机器学习算法研究普通人群营养摄入与PAD关系的研究。维生素C和磷与PAD呈负相关,而饱和脂肪酸、蛋白质和硒呈正相关。

相似文献

1
Machine Learning Analysis of Nutrient Associations with Peripheral Arterial Disease: Insights from NHANES 1999-2004.营养物质与外周动脉疾病关联的机器学习分析:来自1999 - 2004年美国国家健康与营养检查调查(NHANES)的见解
Ann Vasc Surg. 2025 May;114:154-162. doi: 10.1016/j.avsg.2024.12.077. Epub 2025 Jan 30.
2
Identifying cardiovascular disease risk in the U.S. population using environmental volatile organic compounds exposure: A machine learning predictive model based on the SHAP methodology.利用环境挥发性有机化合物暴露识别美国人群心血管疾病风险:基于 SHAP 方法的机器学习预测模型。
Ecotoxicol Environ Saf. 2024 Nov 1;286:117210. doi: 10.1016/j.ecoenv.2024.117210. Epub 2024 Oct 23.
3
Development of a machine learning model related to explore the association between heavy metal exposure and alveolar bone loss among US adults utilizing SHAP: a study based on NHANES 2015-2018.利用SHAP开发一种机器学习模型,以探索美国成年人中重金属暴露与牙槽骨丧失之间的关联:一项基于2015 - 2018年美国国家健康与营养检查调查(NHANES)的研究。
BMC Public Health. 2025 Feb 4;25(1):455. doi: 10.1186/s12889-025-21658-y.
4
Predictive model and risk analysis for peripheral vascular disease in type 2 diabetes mellitus patients using machine learning and shapley additive explanation.基于机器学习和 Shapley 加法解释的 2 型糖尿病患者外周血管疾病预测模型和风险分析。
Front Endocrinol (Lausanne). 2024 Feb 28;15:1320335. doi: 10.3389/fendo.2024.1320335. eCollection 2024.
5
Prediction and feature selection of low birth weight using machine learning algorithms.利用机器学习算法预测和选择低出生体重。
J Health Popul Nutr. 2024 Oct 12;43(1):157. doi: 10.1186/s41043-024-00647-8.
6
Prediction of depressive disorder using machine learning approaches: findings from the NHANES.使用机器学习方法预测抑郁症:来自美国国家健康与营养检查调查(NHANES)的结果
BMC Med Inform Decis Mak. 2025 Feb 17;25(1):83. doi: 10.1186/s12911-025-02903-1.
7
Prediction of lumbar disc degeneration based on interpretable machine learning models: retrospective cohort study.基于可解释机器学习模型的腰椎间盘退变预测:回顾性队列研究
Spine J. 2025 Apr 9. doi: 10.1016/j.spinee.2025.04.004.
8
Prediction of sepsis mortality in ICU patients using machine learning methods.使用机器学习方法预测 ICU 患者的败血症死亡率。
BMC Med Inform Decis Mak. 2024 Aug 16;24(1):228. doi: 10.1186/s12911-024-02630-z.
9
Dietary fatty acids and peripheral artery disease in adults.膳食脂肪酸与成年人外周动脉疾病。
Atherosclerosis. 2012 Jun;222(2):545-50. doi: 10.1016/j.atherosclerosis.2012.03.029. Epub 2012 Apr 7.
10
Learning from the machine: is diabetes in adults predicted by lifestyle variables? A retrospective predictive modelling study of NHANES 2007-2018.向机器学习:成人糖尿病能否由生活方式变量预测?一项对2007 - 2018年美国国家健康与营养检查调查(NHANES)的回顾性预测建模研究。
BMJ Open. 2025 Mar 22;15(3):e096595. doi: 10.1136/bmjopen-2024-096595.