• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用集成学习方法优化高血压预测。

Optimizing hypertension prediction using ensemble learning approaches.

作者信息

Sifat Isteaq Kabir, Kibria Md Kaderi

机构信息

Department of Statistics, Hajee Mohammad Danesh Science and Technology University, Dinajpur, Bangladesh.

出版信息

PLoS One. 2024 Dec 23;19(12):e0315865. doi: 10.1371/journal.pone.0315865. eCollection 2024.

DOI:10.1371/journal.pone.0315865
PMID:39715219
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11666061/
Abstract

Hypertension (HTN) prediction is critical for effective preventive healthcare strategies. This study investigates how well ensemble learning techniques work to increase the accuracy of HTN prediction models. Utilizing a dataset of 612 participants from Ethiopia, which includes 27 features potentially associated with HTN risk, we aimed to enhance predictive performance over traditional single-model methods. A multi-faceted feature selection approach was employed, incorporating Boruta, Lasso Regression, Forward and Backward Selection, and Random Forest feature importance, and found 13 common features that were considered for prediction. Five machine learning (ML) models such as logistic regression (LR), artificial neural network (ANN), random forest (RF), extreme gradient boosting (XGB), light gradient boosting machine (LGBM), and a stacking ensemble model were trained using selected features to predict HTN. The models' performance on the testing set was evaluated using accuracy, precision, recall, F1-score, and area under the curve (AUC). Additionally, SHapley Additive exPlanations (SHAP) was utilized to examine the impact of individual features on the models' predictions and identify the most important risk factors for HTN. The stacking ensemble model emerged as the most effective approach for predicting HTN risk, achieving an accuracy of 96.32%, precision of 95.48%, recall of 97.51%, F1-score of 96.48%, and an AUC of 0.971. SHAP analysis of the stacking model identified weight, drinking habits, history of hypertension, salt intake, age, diabetes, BMI, and fat intake as the most significant and interpretable risk factors for HTN. Our results demonstrate significant advancements in predictive accuracy and robustness, highlighting the potential of ensemble learning as a pivotal tool in healthcare analytics. This research contributes to ongoing efforts to optimize HTN prediction models, ultimately supporting early intervention and personalized healthcare management.

摘要

高血压(HTN)预测对于有效的预防性医疗保健策略至关重要。本研究调查了集成学习技术在提高HTN预测模型准确性方面的效果。利用来自埃塞俄比亚的612名参与者的数据集,其中包括27个可能与HTN风险相关的特征,我们旨在提高预测性能,超越传统的单模型方法。采用了多方面的特征选择方法,包括Boruta、套索回归、向前和向后选择以及随机森林特征重要性,并确定了13个用于预测的共同特征。使用选定的特征训练了五个机器学习(ML)模型,如逻辑回归(LR)、人工神经网络(ANN)、随机森林(RF)、极端梯度提升(XGB)、轻梯度提升机(LGBM)以及一个堆叠集成模型来预测HTN。使用准确率、精确率、召回率、F1分数和曲线下面积(AUC)评估模型在测试集上的性能。此外,利用SHapley加性解释(SHAP)来检查单个特征对模型预测的影响,并确定HTN最重要的风险因素。堆叠集成模型成为预测HTN风险最有效的方法,准确率达到96.32%,精确率为95.48%,召回率为97.51%,F1分数为96.48%,AUC为0.971。对堆叠模型的SHAP分析确定体重、饮酒习惯、高血压病史、盐摄入量、年龄、糖尿病、BMI和脂肪摄入量是HTN最重要且可解释的风险因素。我们的结果表明在预测准确性和稳健性方面有显著进展,突出了集成学习作为医疗分析中关键工具的潜力。这项研究有助于正在进行的优化HTN预测模型的努力,最终支持早期干预和个性化医疗管理。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9928/11666061/1ed951e3dc54/pone.0315865.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9928/11666061/22b56f9d7c8d/pone.0315865.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9928/11666061/e9c3b47307e2/pone.0315865.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9928/11666061/1ed951e3dc54/pone.0315865.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9928/11666061/22b56f9d7c8d/pone.0315865.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9928/11666061/e9c3b47307e2/pone.0315865.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9928/11666061/1ed951e3dc54/pone.0315865.g003.jpg

相似文献

1
Optimizing hypertension prediction using ensemble learning approaches.使用集成学习方法优化高血压预测。
PLoS One. 2024 Dec 23;19(12):e0315865. doi: 10.1371/journal.pone.0315865. eCollection 2024.
2
Predicting the risk of hypertension using machine learning algorithms: A cross sectional study in Ethiopia.使用机器学习算法预测高血压风险:在埃塞俄比亚的一项横断面研究。
PLoS One. 2023 Aug 24;18(8):e0289613. doi: 10.1371/journal.pone.0289613. eCollection 2023.
3
Interpretable machine learning method to predict the risk of pre-diabetes using a national-wide cross-sectional data: evidence from CHNS.利用全国性横断面数据预测糖尿病前期风险的可解释机器学习方法:来自中国健康与营养调查的证据
BMC Public Health. 2025 Mar 26;25(1):1145. doi: 10.1186/s12889-025-22419-7.
4
Prediction and feature selection of low birth weight using machine learning algorithms.利用机器学习算法预测和选择低出生体重。
J Health Popul Nutr. 2024 Oct 12;43(1):157. doi: 10.1186/s41043-024-00647-8.
5
Predicting major adverse cardiac events in diabetes and chronic kidney disease: a machine learning study from the Silesia Diabetes-Heart Project.预测糖尿病和慢性肾脏病患者的主要不良心脏事件:西里西亚糖尿病-心脏项目的一项机器学习研究
Cardiovasc Diabetol. 2025 Feb 15;24(1):76. doi: 10.1186/s12933-025-02615-w.
6
A Risk Prediction Model for Physical Restraints Among Older Chinese Adults in Long-term Care Facilities: Machine Learning Study.长期护理机构中老年人身体约束的风险预测模型:机器学习研究。
J Med Internet Res. 2023 Apr 6;25:e43815. doi: 10.2196/43815.
7
Machine learning-based predictive models for perioperative major adverse cardiovascular events in patients with stable coronary artery disease undergoing noncardiac surgery.基于机器学习的预测模型用于接受非心脏手术的稳定冠状动脉疾病患者围手术期主要不良心血管事件的预测
Comput Methods Programs Biomed. 2025 Mar;260:108561. doi: 10.1016/j.cmpb.2024.108561. Epub 2024 Dec 13.
8
Interpretable machine learning for allergic rhinitis prediction among preschool children in Urumqi, China.中国乌鲁木齐学龄前儿童变应性鼻炎预测的可解释机器学习。
Sci Rep. 2024 Sep 27;14(1):22281. doi: 10.1038/s41598-024-73733-w.
9
Prediction of lateral lymph node metastasis with short diameter less than 8 mm in papillary thyroid carcinoma based on radiomics.基于放射组学的甲状腺乳头状癌短径小于 8mm 预测侧颈部淋巴结转移
Cancer Imaging. 2024 Nov 15;24(1):155. doi: 10.1186/s40644-024-00803-7.
10
Machine learning algorithms for predicting COVID-19 mortality in Ethiopia.用于预测埃塞俄比亚 COVID-19 死亡率的机器学习算法。
BMC Public Health. 2024 Jun 28;24(1):1728. doi: 10.1186/s12889-024-19196-0.

引用本文的文献

1
Hybrid feature selection framework for enhanced credit card fraud detection using machine learning models.使用机器学习模型增强信用卡欺诈检测的混合特征选择框架
PLoS One. 2025 Jul 16;20(7):e0326975. doi: 10.1371/journal.pone.0326975. eCollection 2025.
2
Prediction of Myopia Among Undergraduate Students Using Ensemble Machine Learning Techniques.使用集成机器学习技术预测本科生近视情况。
Health Sci Rep. 2025 May 26;8(5):e70874. doi: 10.1002/hsr2.70874. eCollection 2025 May.

本文引用的文献

1
Risk factors and prediction models for cardiovascular complications of hypertension in older adults with machine learning: A cross-sectional study.机器学习用于老年高血压患者心血管并发症的危险因素及预测模型:一项横断面研究。
Heliyon. 2024 Mar 10;10(6):e27941. doi: 10.1016/j.heliyon.2024.e27941. eCollection 2024 Mar 30.
2
Predicting hypertension control using machine learning.利用机器学习预测高血压控制情况。
PLoS One. 2024 Mar 20;19(3):e0299932. doi: 10.1371/journal.pone.0299932. eCollection 2024.
3
Predictive model for early detection of type 2 diabetes using patients' clinical symptoms, demographic features, and knowledge of diabetes.
利用患者临床症状、人口统计学特征及糖尿病知识对2型糖尿病进行早期检测的预测模型。
Health Sci Rep. 2024 Jan 25;7(1):e1834. doi: 10.1002/hsr2.1834. eCollection 2024 Jan.
4
Predicting the risk of hypertension using machine learning algorithms: A cross sectional study in Ethiopia.使用机器学习算法预测高血压风险:在埃塞俄比亚的一项横断面研究。
PLoS One. 2023 Aug 24;18(8):e0289613. doi: 10.1371/journal.pone.0289613. eCollection 2023.
5
Hypertension control rate in India: systematic review and meta-analysis of population-level non-interventional studies, 2001-2022.印度的高血压控制率:2001年至2022年人群水平非干预性研究的系统评价与荟萃分析
Lancet Reg Health Southeast Asia. 2022 Nov 23;9:100113. doi: 10.1016/j.lansea.2022.100113. eCollection 2023 Feb.
6
Prevalence and Associated Factors of Hypertension Among Adults in Gurage Zone, Southwest Ethiopia, 2022.2022年埃塞俄比亚西南部古拉格地区成年人高血压患病率及相关因素
SAGE Open Nurs. 2023 Feb 6;9:23779608231153473. doi: 10.1177/23779608231153473. eCollection 2023 Jan-Dec.
7
A Review of Feature Selection Methods for Machine Learning-Based Disease Risk Prediction.基于机器学习的疾病风险预测的特征选择方法综述
Front Bioinform. 2022 Jun 27;2:927312. doi: 10.3389/fbinf.2022.927312. eCollection 2022.
8
Body mass index, body fat percentage, and visceral fat as mediators in the association between health literacy and hypertension among residents living in rural and suburban areas.在农村和郊区居民中,身体质量指数、体脂百分比和内脏脂肪作为健康素养与高血压之间关联的中介因素。
Front Med (Lausanne). 2022 Sep 6;9:877013. doi: 10.3389/fmed.2022.877013. eCollection 2022.
9
A personal history of research on hypertension From an encounter with hypertension to the development of hypertension practice based on out-of-clinic blood pressure measurements.高血压研究的个人史 从偶然遇到高血压到基于诊室外血压测量的高血压实践发展。
Hypertens Res. 2022 Nov;45(11):1726-1742. doi: 10.1038/s41440-022-01011-1. Epub 2022 Sep 8.
10
Prevalence and associated factors of hypertension among adult patients attending the outpatient department at the primary hospitals of Wolkait tegedie zone, Northwest Ethiopia.埃塞俄比亚西北部沃尔凯泰杰迪区基层医院门诊成年患者高血压的患病率及相关因素
Front Neurol. 2022 Aug 9;13:943595. doi: 10.3389/fneur.2022.943595. eCollection 2022.