基于若干易于收集的风险因素预测高血压风险：一种机器学习方法。

Predicting the Risk of Hypertension Based on Several Easy-to-Collect Risk Factors: A Machine Learning Method.

机构信息

Institute of Intelligent Machines, Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei, China.

Science Island Branch of Graduate School, University of Science and Technology of China, Hefei, China.

出版信息

Front Public Health. 2021 Sep 24;9:619429. doi: 10.3389/fpubh.2021.619429. eCollection 2021.

DOI:10.3389/fpubh.2021.619429

PMID:34631636

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8497705/

Abstract

Hypertension is a widespread chronic disease. Risk prediction of hypertension is an intervention that contributes to the early prevention and management of hypertension. The implementation of such intervention requires an effective and easy-to-implement hypertension risk prediction model. This study evaluated and compared the performance of four machine learning algorithms on predicting the risk of hypertension based on easy-to-collect risk factors. A dataset of 29,700 samples collected through a physical examination was used for model training and testing. Firstly, we identified easy-to-collect risk factors of hypertension, through univariate logistic regression analysis. Then, based on the selected features, 10-fold cross-validation was utilized to optimize four models, random forest (RF), CatBoost, MLP neural network and logistic regression (LR), to find the best hyper-parameters on the training set. Finally, the performance of models was evaluated by AUC, accuracy, sensitivity and specificity on the test set. The experimental results showed that the RF model outperformed the other three models, and achieved an AUC of 0.92, an accuracy of 0.82, a sensitivity of 0.83 and a specificity of 0.81. In addition, Body Mass Index (BMI), age, family history and waist circumference (WC) are the four primary risk factors of hypertension. These findings reveal that it is feasible to use machine learning algorithms, especially RF, to predict hypertension risk without clinical or genetic data. The technique can provide a non-invasive and economical way for the prevention and management of hypertension in a large population.

摘要

高血压是一种广泛存在的慢性疾病。高血压风险预测是一种干预措施，可以促进高血压的早期预防和管理。实施这种干预措施需要一个有效且易于实施的高血压风险预测模型。本研究评估和比较了四种机器学习算法在基于易于收集的风险因素预测高血压风险方面的性能。通过体检收集了 29700 个样本的数据集用于模型训练和测试。首先，我们通过单变量逻辑回归分析确定了高血压的易于收集的风险因素。然后，基于选定的特征，我们使用 10 折交叉验证来优化四个模型，随机森林（RF）、CatBoost、MLP 神经网络和逻辑回归（LR），以在训练集上找到最佳的超参数。最后，我们通过 AUC、准确性、敏感性和特异性在测试集上评估模型的性能。实验结果表明，RF 模型优于其他三个模型，其 AUC 为 0.92，准确性为 0.82，敏感性为 0.83，特异性为 0.81。此外，体重指数（BMI）、年龄、家族史和腰围（WC）是高血压的四个主要风险因素。这些发现表明，使用机器学习算法，特别是 RF，在没有临床或遗传数据的情况下预测高血压风险是可行的。该技术可以为大规模人群的高血压预防和管理提供一种非侵入性和经济的方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/375d/8497705/46e03c3a3013/fpubh-09-619429-g0001.jpg

相似文献

Predicting the Risk of Hypertension Based on Several Easy-to-Collect Risk Factors: A Machine Learning Method.

Front Public Health. 2021 Sep 24;9:619429. doi: 10.3389/fpubh.2021.619429. eCollection 2021.

Optimizing neural networks for medical data sets: A case study on neonatal apnea prediction.

Artif Intell Med. 2019 Jul;98:59-76. doi: 10.1016/j.artmed.2019.07.008. Epub 2019 Jul 25.

Evaluating the risk of hypertension in residents in primary care in Shanghai, China with machine learning algorithms.

Front Public Health. 2022 Oct 4;10:984621. doi: 10.3389/fpubh.2022.984621. eCollection 2022.

Patient-Level Prediction of Cardio-Cerebrovascular Events in Hypertension Using Nationwide Claims Data.

J Med Internet Res. 2019 Feb 15;21(2):e11757. doi: 10.2196/11757.

Predicting post-stroke pneumonia using deep neural network approaches.

Int J Med Inform. 2019 Dec;132:103986. doi: 10.1016/j.ijmedinf.2019.103986. Epub 2019 Oct 1.

Use of Non-invasive Parameters and Machine-Learning Algorithms for Predicting Future Risk of Type 2 Diabetes: A Retrospective Cohort Study of Health Data From Kuwait.

Front Endocrinol (Lausanne). 2019 Sep 11;10:624. doi: 10.3389/fendo.2019.00624. eCollection 2019.

Machine learning algorithms to predict early pregnancy loss after in vitro fertilization-embryo transfer with fetal heart rate as a strong predictor.

Comput Methods Programs Biomed. 2020 Nov;196:105624. doi: 10.1016/j.cmpb.2020.105624. Epub 2020 Jun 25.

Development and validation of machine learning prediction model based on computed tomography angiography-derived hemodynamics for rupture status of intracranial aneurysms: a Chinese multicenter study.

Eur Radiol. 2020 Sep;30(9):5170-5182. doi: 10.1007/s00330-020-06886-7. Epub 2020 Apr 29.

Machine-learning prediction of adolescent alcohol use: a cross-study, cross-cultural validation.

Addiction. 2019 Apr;114(4):662-671. doi: 10.1111/add.14504. Epub 2018 Dec 21.

Evaluating the performance of machine learning methods and variable selection methods for predicting difficult-to-measure traits in Holstein dairy cattle using milk infrared spectral data.

J Dairy Sci. 2021 Jul;104(7):8107-8121. doi: 10.3168/jds.2020-19861. Epub 2021 Apr 15.

引用本文的文献

Screening hypertension using non-laboratory risk factors with machine learning: a retrospective cross-sectional study in Indonesia.

BMJ Open. 2025 Aug 27;15(8):e092364. doi: 10.1136/bmjopen-2024-092364.

A machine learning approach to predict hypertension using cross-sectional & two years follow up data from a health & demographic cohort of Assam, North East India.

Indian J Med Res. 2025 Apr;161(4):394-405. doi: 10.25259/IJMR_881_2024.

Temporal analysis of non-communicable diseases and NCD-HIV/AIDS comorbidity in Malawi: A 4-year retrospective study 2020-2022.

Trop Med Int Health. 2025 Aug;30(8):782-800. doi: 10.1111/tmi.14134. Epub 2025 Jun 3.

Development and evaluation of a machine learning model for osteoporosis risk prediction in Korean women.

BMC Womens Health. 2025 Mar 28;25(1):146. doi: 10.1186/s12905-025-03669-4.

Optimizing hypertension prediction using ensemble learning approaches.

PLoS One. 2024 Dec 23;19(12):e0315865. doi: 10.1371/journal.pone.0315865. eCollection 2024.

Next-visit prediction and prevention of hypertension using large-scale routine health checkup data.

PLoS One. 2024 Nov 13;19(11):e0313658. doi: 10.1371/journal.pone.0313658. eCollection 2024.

Environmental chemical exposures and a machine learning-based model for predicting hypertension in NHANES 2003-2016.

BMC Cardiovasc Disord. 2024 Oct 9;24(1):544. doi: 10.1186/s12872-024-04216-z.

Bilateral Matching Method for Business Resources Based on Synergy Effects and Incomplete Data.

Entropy (Basel). 2024 Aug 6;26(8):669. doi: 10.3390/e26080669.

Transforming Healthcare: The AI Revolution in the Comprehensive Care of Hypertension.

Clin Pract. 2024 Jul 10;14(4):1357-1374. doi: 10.3390/clinpract14040109.

Transforming Hypertension Diagnosis and Management in The Era of Artificial Intelligence: A 2023 National Heart, Lung, and Blood Institute (NHLBI) Workshop Report.

Hypertension. 2025 Jan;82(1):36-45. doi: 10.1161/HYPERTENSIONAHA.124.22095. Epub 2024 Jul 16.

本文引用的文献

Predicting Breast Cancer in Chinese Women Using Machine Learning Techniques: Algorithm Development.

JMIR Med Inform. 2020 Jun 8;8(6):e17364. doi: 10.2196/17364.

The relationship between obesity, diabetes, hypertension and vitamin D deficiency among Saudi Arabians aged 15 and over: results from the Saudi health interview survey.

BMC Endocr Disord. 2020 Jun 5;20(1):81. doi: 10.1186/s12902-020-00562-z.

Catastrophic health expenditure: a comparative study between hypertensive patients with and without complication in rural Shandong, China.

BMC Public Health. 2020 Apr 22;20(1):545. doi: 10.1186/s12889-020-08662-0.

Streamlining the KOOS Activities of Daily Living Subscale Using Machine Learning.

Orthop J Sports Med. 2020 Mar 24;8(3):2325967120910447. doi: 10.1177/2325967120910447. eCollection 2020 Mar.

A data-driven approach to predicting diabetes and cardiovascular disease with machine learning.

BMC Med Inform Decis Mak. 2019 Nov 6;19(1):211. doi: 10.1186/s12911-019-0918-5.

A need for the use of a standard protocol for waist circumference measurement across studies.

Diabetes Res Clin Pract. 2020 Mar;161:107908. doi: 10.1016/j.diabres.2019.107908. Epub 2019 Oct 31.

An evaluation of the impact of aggressive hypertension, diabetes and smoking cessation management on CVD outcomes at the population level: a dynamic simulation analysis.

BMC Public Health. 2019 Aug 14;19(1):1105. doi: 10.1186/s12889-019-7429-2.

Systolic and diastolic hypertension independently predict CVD risk.

Nat Rev Cardiol. 2019 Oct;16(10):578-579. doi: 10.1038/s41569-019-0248-4.

Combination of Healthy Lifestyle Factors on the Risk of Hypertension in a Large Cohort of French Adults.

Nutrients. 2019 Jul 23;11(7):1687. doi: 10.3390/nu11071687.

Age at menarche and prevention of hypertension through lifestyle in young Chinese adult women: result from project ELEFANT.

BMC Womens Health. 2018 Nov 9;18(1):182. doi: 10.1186/s12905-018-0677-y.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于若干易于收集的风险因素预测高血压风险：一种机器学习方法。

Predicting the Risk of Hypertension Based on Several Easy-to-Collect Risk Factors: A Machine Learning Method.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献