• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于随机森林的非侵入性参数预测代谢综合征的数据平衡方法的效果。

The effect of data balancing approaches on the prediction of metabolic syndrome using non-invasive parameters based on random forest.

机构信息

School of Public Health, Bam University of Medical Sciences, Bam, Iran.

Research Center for Food Hygiene and Safety, School of Public Health, Shahid Sadoughi University of Medical Sciences, Yazd, Iran.

出版信息

BMC Bioinformatics. 2024 Jan 11;25(1):18. doi: 10.1186/s12859-024-05633-9.

DOI:10.1186/s12859-024-05633-9
PMID:38212697
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10782700/
Abstract

BACKGROUND

Metabolic syndrome (MetS) is a cluster of metabolic abnormalities (including obesity, insulin resistance, hypertension, and dyslipidemia), which can be used to identify at-risk populations for diabetes and cardiovascular diseases, the main causes of morbidity and mortality worldwide. The achievement of a simple approach for diagnosing MetS without needing biochemical tests is so valuable. The present study aimed to predict MetS using non-invasive features based on a successful random forest learning algorithm. Also, to deal with the problem of data imbalance that naturally exists in this type of data, the effect of two different data balancing approaches, including the Synthetic Minority Over-sampling Technique (SMOTE) and Random Splitting data balancing (SplitBal), on model performance is investigated.

RESULTS

The most important determinant for MetS prediction was waist circumference. Applying a random forest learning algorithm to imbalanced data, the trained models reach 86.9% and 79.4% accuracies and 37.1% and 38.2% sensitivities in men and women, respectively. However, by applying the SplitBal data balancing technique, the best results were obtained, and despite that the accuracy of the trained models decreased by 7.8% and 11.3%, but their sensitivity improved significantly to 82.3% and 73.7% in men and women, respectively.

CONCLUSIONS

The random forest learning method, along with data balancing techniques, especially SplitBal, could create MetS prediction models with promising results that can be applied as a useful prognostic tool in health screening programs.

摘要

背景

代谢综合征(MetS)是一组代谢异常(包括肥胖、胰岛素抵抗、高血压和血脂异常),可用于识别糖尿病和心血管疾病的高危人群,这是全球发病率和死亡率的主要原因。实现一种无需生化检测即可诊断 MetS 的简单方法非常有价值。本研究旨在使用基于成功随机森林学习算法的无创特征来预测 MetS。此外,为了解决此类数据中存在的固有数据不平衡问题,研究了两种不同的数据平衡方法,包括合成少数过采样技术(SMOTE)和随机分割数据平衡(SplitBal),对模型性能的影响。

结果

MetS 预测的最重要决定因素是腰围。应用随机森林学习算法对不平衡数据进行处理,训练后的模型在男性和女性中的准确率分别达到 86.9%和 79.4%,灵敏度分别达到 37.1%和 38.2%。然而,通过应用 SplitBal 数据平衡技术,可以获得最佳结果,尽管训练模型的准确率分别下降了 7.8%和 11.3%,但它们的灵敏度分别显著提高到男性和女性的 82.3%和 73.7%。

结论

随机森林学习方法结合数据平衡技术,特别是 SplitBal,可以创建具有有前途结果的 MetS 预测模型,可作为健康筛查计划中的有用预后工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b423/10782700/f95a6c01c490/12859_2024_5633_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b423/10782700/ecd8dae02863/12859_2024_5633_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b423/10782700/7f314378d1d0/12859_2024_5633_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b423/10782700/f95a6c01c490/12859_2024_5633_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b423/10782700/ecd8dae02863/12859_2024_5633_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b423/10782700/7f314378d1d0/12859_2024_5633_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b423/10782700/f95a6c01c490/12859_2024_5633_Fig3_HTML.jpg

相似文献

1
The effect of data balancing approaches on the prediction of metabolic syndrome using non-invasive parameters based on random forest.基于随机森林的非侵入性参数预测代谢综合征的数据平衡方法的效果。
BMC Bioinformatics. 2024 Jan 11;25(1):18. doi: 10.1186/s12859-024-05633-9.
2
Prediction of metabolic and pre-metabolic syndromes using machine learning models with anthropometric, lifestyle, and biochemical factors from a middle-aged population in Korea.使用来自韩国中年人群的人体测量学、生活方式和生化因素的机器学习模型预测代谢和前代谢综合征。
BMC Public Health. 2022 Apr 6;22(1):664. doi: 10.1186/s12889-022-13131-x.
3
Metabolic syndrome prediction using non-invasive and dietary parameters based on a support vector machine.基于支持向量机的无创和饮食参数对代谢综合征的预测。
Nutr Metab Cardiovasc Dis. 2024 Jan;34(1):126-135. doi: 10.1016/j.numecd.2023.08.018. Epub 2023 Sep 9.
4
Sleep Quality, Nutrient Intake, and Social Development Index Predict Metabolic Syndrome in the Tlalpan 2020 Cohort: A Machine Learning and Synthetic Data Study.睡眠质量、营养摄入与社会发展指数预测特拉尔潘2020队列中的代谢综合征:一项机器学习与合成数据研究
Nutrients. 2024 Feb 23;16(5):612. doi: 10.3390/nu16050612.
5
Stroke Prediction with Machine Learning Methods among Older Chinese.基于机器学习方法对中国老年人进行中风预测。
Int J Environ Res Public Health. 2020 Mar 12;17(6):1828. doi: 10.3390/ijerph17061828.
6
Joint modeling strategy for using electronic medical records data to build machine learning models: an example of intracerebral hemorrhage.利用电子病历数据构建机器学习模型的联合建模策略:以脑出血为例。
BMC Med Inform Decis Mak. 2022 Oct 25;22(1):278. doi: 10.1186/s12911-022-02018-x.
7
A hybrid sampling algorithm combining synthetic minority over-sampling technique and edited nearest neighbor for missed abortion diagnosis.一种结合合成少数过采样技术和编辑最近邻的混合采样算法,用于诊断漏诊的流产。
BMC Med Inform Decis Mak. 2022 Dec 29;22(1):344. doi: 10.1186/s12911-022-02075-2.
8
Age-associated increase in abdominal obesity and insulin resistance, and usefulness of AHA/NHLBI definition of metabolic syndrome for predicting cardiovascular disease in Japanese elderly with type 2 diabetes mellitus.年龄相关的腹型肥胖和胰岛素抵抗增加,以及 AHA/NHLBI 代谢综合征定义在预测日本老年 2 型糖尿病患者心血管疾病中的作用。
Gerontology. 2010;56(2):141-9. doi: 10.1159/000246970. Epub 2009 Oct 10.
9
Metabolic syndrome in nondiabetic, obese, first-degree relatives of African American patients with type 2 diabetes: African American triglycerides-HDL-C and insulin resistance paradox.非糖尿病、肥胖的非裔美国2型糖尿病患者一级亲属中的代谢综合征:非裔美国人甘油三酯-高密度脂蛋白胆固醇与胰岛素抵抗悖论
Ethn Dis. 2006 Autumn;16(4):830-6.
10
Assessment of eight insulin resistance surrogate indexes for predicting metabolic syndrome and hypertension in Thai law enforcement officers.评估八种胰岛素抵抗替代指标在预测泰国执法人员代谢综合征和高血压中的作用。
PeerJ. 2023 May 29;11:e15463. doi: 10.7717/peerj.15463. eCollection 2023.

引用本文的文献

1
Machine Learning Framework for Ovarian Cancer Diagnostics Using Plasma Lipidomics and Metabolomics.基于血浆脂质组学和代谢组学的卵巢癌诊断机器学习框架
Int J Mol Sci. 2025 Jul 10;26(14):6630. doi: 10.3390/ijms26146630.
2
Development of a Predictive Model for Metabolic Syndrome Using Noninvasive Data and its Cardiovascular Disease Risk Assessments: Multicohort Validation Study.使用非侵入性数据开发代谢综合征预测模型及其心血管疾病风险评估:多队列验证研究
J Med Internet Res. 2025 May 2;27:e67525. doi: 10.2196/67525.

本文引用的文献

1
Prediction of metabolic and pre-metabolic syndromes using machine learning models with anthropometric, lifestyle, and biochemical factors from a middle-aged population in Korea.使用来自韩国中年人群的人体测量学、生活方式和生化因素的机器学习模型预测代谢和前代谢综合征。
BMC Public Health. 2022 Apr 6;22(1):664. doi: 10.1186/s12889-022-13131-x.
2
Metabolic Syndrome Prediction Models Using Machine Learning and Sasang Constitution Type.使用机器学习和四象体质类型的代谢综合征预测模型
Evid Based Complement Alternat Med. 2021 Feb 8;2021:8315047. doi: 10.1155/2021/8315047. eCollection 2021.
3
Modeling of diagnosis for metabolic syndrome by integrating symptoms into physiochemical indexes.
将症状整合到生理化学指标中对代谢综合征进行诊断建模。
Biomed Pharmacother. 2021 May;137:111367. doi: 10.1016/j.biopha.2021.111367. Epub 2021 Feb 13.
4
Methods for algorithmic diagnosis of metabolic syndrome.代谢综合征的算法诊断方法。
Artif Intell Med. 2019 Nov;101:101708. doi: 10.1016/j.artmed.2019.101708. Epub 2019 Oct 15.
5
The metabolic syndrome - What is it and how should it be managed?代谢综合征——它是什么,应该如何管理?
Eur J Prev Cardiol. 2019 Dec;26(2_suppl):33-46. doi: 10.1177/2047487319886404.
6
Machine Learning in Medicine.医学中的机器学习
N Engl J Med. 2019 Apr 4;380(14):1347-1358. doi: 10.1056/NEJMra1814259.
7
Metabolic Syndrome Prediction Using Machine Learning Models with Genetic and Clinical Information from a Nonobese Healthy Population.使用具有来自非肥胖健康人群的遗传和临床信息的机器学习模型预测代谢综合征
Genomics Inform. 2018 Dec;16(4):e31. doi: 10.5808/GI.2018.16.4.e31. Epub 2018 Dec 28.
8
Big Data's Role in Precision Public Health.大数据在精准公共卫生中的作用。
Front Public Health. 2018 Mar 7;6:68. doi: 10.3389/fpubh.2018.00068. eCollection 2018.
9
The Global Epidemic of the Metabolic Syndrome.代谢综合征的全球流行。
Curr Hypertens Rep. 2018 Feb 26;20(2):12. doi: 10.1007/s11906-018-0812-z.
10
Prediction models to identify individuals at risk of metabolic syndrome who are unlikely to participate in a health intervention program.预测模型以识别有代谢综合征风险但不太可能参与健康干预计划的个体。
Int J Med Inform. 2018 Mar;111:90-99. doi: 10.1016/j.ijmedinf.2017.12.009. Epub 2017 Dec 30.