• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于机器学习的动脉粥样硬化性心血管疾病生活方式风险因素分析:回顾性病例对照研究

Machine Learning-Based Analysis of Lifestyle Risk Factors for Atherosclerotic Cardiovascular Disease: Retrospective Case-Control Study.

作者信息

Kim Hye-Jin, Choi Heeji, Ahn Hyo-Jung, Shin Seung-Ho, Kim Chulho, Lee Sang-Hwa, Sohn Jong-Hee, Lee Jae-Jun

机构信息

Artificial Intelligence Research Center, College of Medicine, Hallym University, Chuhcneon-si, Republic of Korea.

Health Insurance Review and Assessment Research Institute, Health Insurance Review and Assessment Service, Wonju-si, Republic of Korea.

出版信息

JMIR Med Inform. 2025 Aug 7;13:e74415. doi: 10.2196/74415.

DOI:10.2196/74415
PMID:40773657
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12330983/
Abstract

BACKGROUND

The risk of developing atherosclerotic cardiovascular disease (ASCVD) varies among individuals and is related to a variety of lifestyle factors in addition to the presence of chronic diseases.

OBJECTIVE

We aimed to assess the predictive accuracy of machine learning (ML) models incorporating lifestyle risk behaviors for ASCVD risk using the Korean nationwide database.

METHODS

Using data from the Korea National Health and Nutrition Examination Survey, 5 ML algorithms were used for the prediction of high ASCVD risk: logistic regression (LR), support vector machine, random forest, extreme gradient boosting, and light gradient boosting models. ASCVD risk was assessed using the pooled cohort equations, with a high-risk threshold of ≥7.5% over 10 years. Among the 8573 participants aged 40-79 years, propensity score matching (PSM) was used to adjust for demographic confounders. We divided the dataset into a training and a test dataset in an 8:2 ratio. We also used bootstrapping to train the ML model with the area under the receiver operating characteristics curve score. Shapley additive explanations were used to identify the models' important variables in assessing high ASCVD risks. In sensitivity analysis, we additionally performed binary LR analysis, in which the ML model's results were consistent with the conventional statistical model.

RESULTS

Of the 8573 participants, 41.7% (n=3578) had high ASCVD risk. Before PSM, age and sex differed significantly between groups. PSM (1:1) yielded 1976 patients with balanced demographics. After PSM, the high ASCVD risk group had higher alcohol or tobacco use, lower omega-3 intake, higher BMI, less physical activity, and spent less time sitting. In 5 ML models, the extreme gradient boosting model showed the highest area under the receiver operating characteristics curve, indicating superior overall discrimination between high and low ASCVD risk groups. However, the light gradient boosting model demonstrated better performance in accuracy, recall, and F1-score. Variable importance analysis using Shapley additive explanations identified smoking and age as the strongest predictors, while BMI, sodium or omega-3 intake, and low-density lipoprotein cholesterol also had significant variables. Sensitivity analysis using multivariable LR analysis also confirmed these findings, showing that smoking, BMI, and low-density lipoprotein cholesterol increased ASCVD risk, whereas omega-3 intake and physical activity were associated with lower risk.

CONCLUSIONS

Analyzing lifestyle behavioral factors in ASCVD risk with an ML model improves the predictive performance compared to traditional models. Personalized prevention strategies tailored to an individual's lifestyle can effectively reduce ASCVD risk.

摘要

背景

动脉粥样硬化性心血管疾病(ASCVD)的发病风险因人而异,除了慢性病外,还与多种生活方式因素有关。

目的

我们旨在使用韩国全国性数据库评估纳入生活方式风险行为的机器学习(ML)模型对ASCVD风险的预测准确性。

方法

利用韩国国民健康与营养检查调查的数据,使用5种ML算法预测高ASCVD风险:逻辑回归(LR)、支持向量机、随机森林、极端梯度提升和轻梯度提升模型。使用汇总队列方程评估ASCVD风险,10年高风险阈值为≥7.5%。在8573名40-79岁的参与者中,采用倾向得分匹配(PSM)来调整人口统计学混杂因素。我们将数据集按8:2的比例分为训练集和测试集。我们还使用自助法训练具有受试者操作特征曲线下面积得分的ML模型。使用夏普利值附加解释来识别模型在评估高ASCVD风险时的重要变量。在敏感性分析中,我们额外进行了二元LR分析,其中ML模型的结果与传统统计模型一致。

结果

在8573名参与者中,41.7%(n=3578)有高ASCVD风险。在PSM之前,两组之间的年龄和性别差异显著。PSM(1:1)产生了1976名人口统计学特征均衡的患者。PSM后,高ASCVD风险组的酒精或烟草使用量更高、ω-3摄入量更低、BMI更高、身体活动更少且久坐时间更少。在5种ML模型中,极端梯度提升模型的受试者操作特征曲线下面积最高,表明在区分高和低ASCVD风险组方面总体辨别能力更强。然而,轻梯度提升模型在准确性、召回率和F1分数方面表现更好。使用夏普利值附加解释的变量重要性分析确定吸烟和年龄是最强的预测因素,而BMI、钠或ω-3摄入量以及低密度脂蛋白胆固醇也是重要变量。使用多变量LR分析的敏感性分析也证实了这些发现,表明吸烟、BMI和低密度脂蛋白胆固醇会增加ASCVD风险,而ω-3摄入量和身体活动与较低风险相关。

结论

与传统模型相比,使用ML模型分析ASCVD风险中的生活方式行为因素可提高预测性能。根据个人生活方式量身定制的个性化预防策略可有效降低ASCVD风险。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1de/12330983/6377557296e4/medinform-v13-e74415-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1de/12330983/f96fefd97c86/medinform-v13-e74415-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1de/12330983/1df8f919b3c8/medinform-v13-e74415-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1de/12330983/6377557296e4/medinform-v13-e74415-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1de/12330983/f96fefd97c86/medinform-v13-e74415-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1de/12330983/1df8f919b3c8/medinform-v13-e74415-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e1de/12330983/6377557296e4/medinform-v13-e74415-g003.jpg

相似文献

1
Machine Learning-Based Analysis of Lifestyle Risk Factors for Atherosclerotic Cardiovascular Disease: Retrospective Case-Control Study.基于机器学习的动脉粥样硬化性心血管疾病生活方式风险因素分析:回顾性病例对照研究
JMIR Med Inform. 2025 Aug 7;13:e74415. doi: 10.2196/74415.
2
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
3
Machine Learning Model for Predicting Coronary Heart Disease Risk: Development and Validation Using Insights From a Japanese Population-Based Study.预测冠心病风险的机器学习模型:基于日本人群研究的见解进行开发与验证
JMIR Cardio. 2025 May 12;9:e68066. doi: 10.2196/68066.
4
Prediction of Insulin Resistance in Nondiabetic Population Using LightGBM and Cohort Validation of Its Clinical Value: Cross-Sectional and Retrospective Cohort Study.使用LightGBM预测非糖尿病人群的胰岛素抵抗及其临床价值的队列验证:横断面和回顾性队列研究
JMIR Med Inform. 2025 Jun 13;13:e72238. doi: 10.2196/72238.
5
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
6
Supervised Machine Learning Models for Predicting Sepsis-Associated Liver Injury in Patients With Sepsis: Development and Validation Study Based on a Multicenter Cohort Study.用于预测脓毒症患者脓毒症相关肝损伤的监督式机器学习模型:基于多中心队列研究的开发与验证研究
J Med Internet Res. 2025 May 26;27:e66733. doi: 10.2196/66733.
7
Tailoring Risk Prediction Models to Local Populations.针对当地人群定制风险预测模型。
JAMA Cardiol. 2024 Nov 1;9(11):1018-1028. doi: 10.1001/jamacardio.2024.2912.
8
Development and validation of a machine learning-based model for predicting intraoperative blood loss during burn surgery.基于机器学习的烧伤手术术中失血量预测模型的开发与验证
Surgery. 2025 Aug;184:109445. doi: 10.1016/j.surg.2025.109445. Epub 2025 May 29.
9
Does the Presence of Missing Data Affect the Performance of the SORG Machine-learning Algorithm for Patients With Spinal Metastasis? Development of an Internet Application Algorithm.缺失数据的存在是否会影响 SORG 机器学习算法在脊柱转移瘤患者中的性能?开发一种互联网应用算法。
Clin Orthop Relat Res. 2024 Jan 1;482(1):143-157. doi: 10.1097/CORR.0000000000002706. Epub 2023 Jun 12.
10
Predicting Early-Onset Colorectal Cancer in Individuals Below Screening Age Using Machine Learning and Real-World Data: Case Control Study.利用机器学习和真实世界数据预测筛查年龄以下个体的早发性结直肠癌:病例对照研究
JMIR Cancer. 2025 Jun 19;11:e64506. doi: 10.2196/64506.

本文引用的文献

1
Lifelong prevention of atherosclerotic cardiovascular disease (ASCVD) through LDL-C control - Means and cost/benefit of sustained very low lifetime LDL-C targets.通过控制低密度脂蛋白胆固醇(LDL-C)实现动脉粥样硬化性心血管疾病(ASCVD)的终身预防——维持极低终身LDL-C目标的方法及成本效益
Am J Prev Cardiol. 2024 Aug 11;19:100720. doi: 10.1016/j.ajpc.2024.100720. eCollection 2024 Sep.
2
Global Prevalence of Overweight and Obesity in Children and Adolescents: A Systematic Review and Meta-Analysis.全球儿童和青少年超重和肥胖的患病率:系统评价和荟萃分析。
JAMA Pediatr. 2024 Aug 1;178(8):800-813. doi: 10.1001/jamapediatrics.2024.1576.
3
Smoking cessation and vascular endothelial function.
戒烟与血管内皮功能。
Hypertens Res. 2023 Dec;46(12):2670-2678. doi: 10.1038/s41440-023-01455-z. Epub 2023 Oct 12.
4
Sex-specific differences in cardiovascular risk factors and implications for cardiovascular disease prevention in women.女性心血管危险因素的性别特异性差异及其对心血管疾病预防的影响。
Atherosclerosis. 2023 Nov;384:117269. doi: 10.1016/j.atherosclerosis.2023.117269. Epub 2023 Sep 4.
5
Recent applications of machine learning and deep learning models in the prediction, diagnosis, and management of diabetes: a comprehensive review.机器学习和深度学习模型在糖尿病预测、诊断及管理中的最新应用:一项全面综述
Diabetol Metab Syndr. 2022 Dec 27;14(1):196. doi: 10.1186/s13098-022-00969-9.
6
The Role of Oxidative Stress in Atherosclerosis.氧化应激在动脉粥样硬化中的作用。
Cells. 2022 Nov 30;11(23):3843. doi: 10.3390/cells11233843.
7
Hypertension as a risk factor for atherosclerosis: Cardiovascular risk assessment.高血压作为动脉粥样硬化的危险因素:心血管风险评估。
Front Cardiovasc Med. 2022 Aug 22;9:959285. doi: 10.3389/fcvm.2022.959285. eCollection 2022.
8
Health Disparities Across the Continuum of ASCVD Risk.动脉粥样硬化性心血管疾病风险连续谱中的健康差异。
Curr Cardiol Rep. 2022 Sep;24(9):1129-1137. doi: 10.1007/s11886-022-01736-y. Epub 2022 Jul 5.
9
Atherosclerotic cardiovascular disease risk assessment: An American Society for Preventive Cardiology clinical practice statement.动脉粥样硬化性心血管疾病风险评估:美国预防心脏病学会临床实践声明
Am J Prev Cardiol. 2022 Mar 15;10:100335. doi: 10.1016/j.ajpc.2022.100335. eCollection 2022 Jun.
10
Pathophysiology of Atherosclerosis.动脉粥样硬化的病理生理学。
Int J Mol Sci. 2022 Mar 20;23(6):3346. doi: 10.3390/ijms23063346.