• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于预测心血管疾病和癌症与膳食抗氧化剂合并症的机器学习及SHAP值解释

Machine learning and SHAP value interpretation for predicting comorbidity of cardiovascular disease and cancer with dietary antioxidants.

作者信息

Qi Xiangjun, Wang Shujing, Fang Caishan, Jia Jie, Lin Lizhu, Yuan Tianhui

机构信息

The First Clinical Medical College, Guangzhou University of Chinese Medicine, Guangzhou, 510000, China.

Hospital of Chengdu University of Traditional Chinese Medicine, Chengdu University of Traditional Chinese Medicine, Chengdu, 610031, China; Yong Loo Lin School of Medicine, National University of Singapore, 117597, Singapore.

出版信息

Redox Biol. 2025 Feb;79:103470. doi: 10.1016/j.redox.2024.103470. Epub 2024 Dec 16.

DOI:10.1016/j.redox.2024.103470
PMID:39700695
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11729017/
Abstract

OBJECTIVE

To develop and validate a machine learning model incorporating dietary antioxidants to predict cardiovascular disease (CVD)-cancer comorbidity and to elucidate the role of antioxidants in disease prediction.

METHODS

Data were sourced from the National Health and Nutrition Examination Survey. Antioxidants, including vitamins, minerals, and polyphenols, were selected as key features. Additionally, demographic, lifestyle, and health condition features were incorporated to improve model accuracy. Feature preprocessing included removing collinear features, addressing class imbalance, and normalizing data. Models constructed within the mlr3 framework included recursive partitioning and regression trees, random forest, kernel k-nearest neighbors, naïve bayes, and light gradient boosting machine (LightGBM). Benchmarking provided a systematic approach to evaluating and comparing model performance. SHapley Additive exPlanation (SHAP) values were calculated to determine the prediction role of each feature in the model with the highest predictive performance.

RESULTS

This analysis included 10,064 participants, with 353 identified as having comorbid CVD and cancer. After excluding collinear features, the machine learning model retained 29 dietary antioxidant features and 9 baseline features. LightGBM achieved the highest predictive accuracy at 87.9 %, a classification error rate of 12.1 %, and the top area under the receiver operating characteristic curve (0.951) and the precision-recall curve (0.930). LightGBM also demonstrated balanced sensitivity and specificity, both close to 88 %. SHAP analysis indicated that naringenin, magnesium, theaflavin, kaempferol, hesperetin, selenium, malvidin, and vitamin C were the most influential contributors.

CONCLUSION

LightGBM exhibited the best performance for predicting CVD-cancer comorbidity. SHAP values highlighted the importance of antioxidants, with naringenin and magnesium emerging as primary factors in this model.

摘要

目的

开发并验证一个纳入膳食抗氧化剂的机器学习模型,以预测心血管疾病(CVD)-癌症合并症,并阐明抗氧化剂在疾病预测中的作用。

方法

数据来源于美国国家健康与营养检查调查。选择包括维生素、矿物质和多酚在内的抗氧化剂作为关键特征。此外,纳入人口统计学、生活方式和健康状况特征以提高模型准确性。特征预处理包括去除共线特征、解决类别不平衡问题以及对数据进行归一化处理。在mlr3框架内构建的模型包括递归划分与回归树、随机森林、核k近邻、朴素贝叶斯和轻梯度提升机(LightGBM)。基准测试提供了一种评估和比较模型性能的系统方法。计算SHapley加性解释(SHAP)值,以确定每个特征在预测性能最高的模型中的预测作用。

结果

该分析纳入了10,064名参与者,其中353人被确定患有CVD和癌症合并症。排除共线特征后,机器学习模型保留了29个膳食抗氧化剂特征和9个基线特征。LightGBM的预测准确率最高,为87.9%,分类错误率为12.1%,在受试者工作特征曲线下面积(0.951)和精确召回率曲线下面积(0.930)方面排名第一。LightGBM还表现出平衡的敏感性和特异性,均接近88%。SHAP分析表明,柚皮素、镁、茶黄素、山奈酚、橙皮素、硒、锦葵色素和维生素C是最具影响力的因素。

结论

LightGBM在预测CVD-癌症合并症方面表现出最佳性能。SHAP值突出了抗氧化剂的重要性,柚皮素和镁成为该模型中的主要因素。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b3f9/11729017/1069c539e2f3/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b3f9/11729017/bd7a4c1d3d6a/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b3f9/11729017/56c2212d07d1/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b3f9/11729017/1fd4d8c27d43/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b3f9/11729017/38fca391a2e6/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b3f9/11729017/1069c539e2f3/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b3f9/11729017/bd7a4c1d3d6a/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b3f9/11729017/56c2212d07d1/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b3f9/11729017/1fd4d8c27d43/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b3f9/11729017/38fca391a2e6/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b3f9/11729017/1069c539e2f3/gr5.jpg

相似文献

1
Machine learning and SHAP value interpretation for predicting comorbidity of cardiovascular disease and cancer with dietary antioxidants.用于预测心血管疾病和癌症与膳食抗氧化剂合并症的机器学习及SHAP值解释
Redox Biol. 2025 Feb;79:103470. doi: 10.1016/j.redox.2024.103470. Epub 2024 Dec 16.
2
Identifying cardiovascular disease risk in the U.S. population using environmental volatile organic compounds exposure: A machine learning predictive model based on the SHAP methodology.利用环境挥发性有机化合物暴露识别美国人群心血管疾病风险:基于 SHAP 方法的机器学习预测模型。
Ecotoxicol Environ Saf. 2024 Nov 1;286:117210. doi: 10.1016/j.ecoenv.2024.117210. Epub 2024 Oct 23.
3
Machine learning algorithms identify hypokalaemia risk in people with hypertension in the United States National Health and Nutrition Examination Survey 1999-2018.机器学习算法在美国国家健康与营养调查 1999-2018 中识别出高血压人群中的低钾血症风险。
Ann Med. 2023 Dec;55(1):2209336. doi: 10.1080/07853890.2023.2209336.
4
Machine learning-based predictive models for perioperative major adverse cardiovascular events in patients with stable coronary artery disease undergoing noncardiac surgery.基于机器学习的预测模型用于接受非心脏手术的稳定冠状动脉疾病患者围手术期主要不良心血管事件的预测
Comput Methods Programs Biomed. 2025 Mar;260:108561. doi: 10.1016/j.cmpb.2024.108561. Epub 2024 Dec 13.
5
Machine Learning Analysis of Nutrient Associations with Peripheral Arterial Disease: Insights from NHANES 1999-2004.营养物质与外周动脉疾病关联的机器学习分析:来自1999 - 2004年美国国家健康与营养检查调查(NHANES)的见解
Ann Vasc Surg. 2025 May;114:154-162. doi: 10.1016/j.avsg.2024.12.077. Epub 2025 Jan 30.
6
Prediction of STAS in lung adenocarcinoma with nodules ≤ 2 cm using machine learning: a multicenter retrospective study.使用机器学习预测直径≤2 cm的肺腺癌中的STAS:一项多中心回顾性研究
BMC Cancer. 2025 Mar 7;25(1):417. doi: 10.1186/s12885-025-13783-z.
7
Predicting grip strength-related frailty in middle-aged and older Chinese adults using interpretable machine learning models: a prospective cohort study.使用可解释机器学习模型预测中国中老年人握力相关的衰弱:一项前瞻性队列研究
Front Public Health. 2024 Dec 17;12:1489848. doi: 10.3389/fpubh.2024.1489848. eCollection 2024.
8
Development and validation of an interpretable machine learning model for predicting in-hospital mortality for ischemic stroke patients in ICU.用于预测ICU中缺血性中风患者院内死亡率的可解释机器学习模型的开发与验证
Int J Med Inform. 2025 Jun;198:105874. doi: 10.1016/j.ijmedinf.2025.105874. Epub 2025 Mar 9.
9
Creating machine learning models that interpretably link systemic inflammatory index, sex steroid hormones, and dietary antioxidants to identify gout using the SHAP (SHapley Additive exPlanations) method.使用 SHAP(Shapley Additive exPlanations)方法创建可解释的机器学习模型,将系统性炎症指数、性激素和膳食抗氧化剂联系起来,以识别痛风。
Front Immunol. 2024 May 1;15:1367340. doi: 10.3389/fimmu.2024.1367340. eCollection 2024.
10
Explainable machine learning model for prediction of 28-day all-cause mortality in immunocompromised patients in the intensive care unit: a retrospective cohort study based on MIMIC-IV database.用于预测重症监护病房免疫功能低下患者28天全因死亡率的可解释机器学习模型:一项基于MIMIC-IV数据库的回顾性队列研究
Eur J Med Res. 2025 May 3;30(1):358. doi: 10.1186/s40001-025-02622-3.

引用本文的文献

1
Enhancing one-year mortality prediction in STEMI patients post-PCI: an interpretable machine learning model with risk stratification.提高ST段抬高型心肌梗死患者PCI术后一年死亡率预测:一种具有风险分层的可解释机器学习模型
Front Artif Intell. 2025 Aug 22;8:1618492. doi: 10.3389/frai.2025.1618492. eCollection 2025.
2
Development and validation of an interpretable multi-task model to predict outcomes in patients with rhabdomyolysis: a multicenter retrospective cohort study.用于预测横纹肌溶解症患者预后的可解释多任务模型的开发与验证:一项多中心回顾性队列研究
EClinicalMedicine. 2025 Aug 21;87:103438. doi: 10.1016/j.eclinm.2025.103438. eCollection 2025 Sep.
3
Research hotspots and frontiers of application of mass spectrometry breath test in respiratory diseases.
质谱呼气试验在呼吸系统疾病中的应用研究热点与前沿
Front Med (Lausanne). 2025 Aug 13;12:1618588. doi: 10.3389/fmed.2025.1618588. eCollection 2025.
4
Anthropometric, Nutritional, and Lifestyle Factors Involved in Predicting Food Addiction: An Agnostic Machine Learning Approach.预测食物成瘾的人体测量学、营养和生活方式因素:一种无偏机器学习方法。
Diseases. 2025 Jul 24;13(8):236. doi: 10.3390/diseases13080236.
5
Evaluating the Oxidative Balance Score for Peripheral Artery Disease Risk: Integrating Epidemiologic Modeling and SHAP-Interpretable Machine Learning in NHANES.评估外周动脉疾病风险的氧化平衡评分:在国家健康与营养检查调查中整合流行病学建模和SHAP可解释机器学习
Food Sci Nutr. 2025 Aug 17;13(8):e70798. doi: 10.1002/fsn3.70798. eCollection 2025 Aug.
6
Development and Validation of An Interpretable Machine Learning-Based Prediction Model of Postpartum Hemorrhage in Placenta Previa Following Cesarean Section: A Multicenter Study.剖宫产术后前置胎盘产后出血的可解释机器学习预测模型的开发与验证:一项多中心研究
Reprod Sci. 2025 Aug 12. doi: 10.1007/s43032-025-01937-0.
7
Machine learning and SHAP value interpretation for predicting the response to neoadjuvant chemotherapy and long-term clinical outcomes in Chinese female breast cancer.机器学习与SHAP值解读用于预测中国女性乳腺癌新辅助化疗反应及长期临床结局
Ann Med. 2025 Dec;57(1):2541316. doi: 10.1080/07853890.2025.2541316. Epub 2025 Aug 3.
8
Identification and validation of an explainable machine learning model for vascular depression diagnosis in the older adults: a multicenter cohort study.老年人血管性抑郁诊断的可解释机器学习模型的识别与验证:一项多中心队列研究
BMC Med. 2025 Jul 31;23(1):448. doi: 10.1186/s12916-025-04283-9.
9
Machine learning and SHAP value interpretation for predicting cardiovascular disease risk in patients with diabetes using dietary antioxidants.利用膳食抗氧化剂预测糖尿病患者心血管疾病风险的机器学习与SHAP值解释
Front Nutr. 2025 Jul 16;12:1612369. doi: 10.3389/fnut.2025.1612369. eCollection 2025.
10
Association of dietary quality, biological aging, progression and mortality of cardiovascular-kidney-metabolic syndrome: insights from mediation and machine learning approaches.饮食质量、生物衰老、心血管-肾脏-代谢综合征的进展及死亡率之间的关联:来自中介分析和机器学习方法的见解
Nutr J. 2025 Jul 7;24(1):105. doi: 10.1186/s12937-025-01175-9.