当预测高血压时，形状附加值可以有效地在机器学习中可视化相关协变量。

Shapely additive values can effectively visualize pertinent covariates in machine learning when predicting hypertension.

机构信息

Cornell University, New York, USA.

Northwestern University Feinberg School of Medicine, Chicago, USA.

出版信息

J Clin Hypertens (Greenwich). 2023 Dec;25(12):1135-1144. doi: 10.1111/jch.14745. Epub 2023 Nov 16.

DOI:10.1111/jch.14745

PMID:37971610

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10710553/

Abstract

Machine learning methods are widely used within the medical field to enhance prediction. However, little is known about the reliability and efficacy of these models to predict long-term medical outcomes such as blood pressure using lifestyle factors, such as diet. The authors assessed whether machine-learning techniques could accurately predict hypertension risk using nutritional information. A cross-sectional study using data from the National Health and Nutrition Examination Survey (NHANES) between January 2017 and March 2020. XGBoost was used as the machine-learning model of choice in this study due to its increased performance relative to other common methods within medical studies. Model prediction metrics (e.g., AUROC, Balanced Accuracy) were used to measure overall model efficacy, covariate Gain statistics (percentage each covariate contributes to the overall prediction) and SHapely Additive exPlanations (SHAP, method to visualize each covariate) were used to provide explanations to machine-learning output and increase the transparency of this otherwise cryptic method. Of a total of 9650 eligible patients, the mean age was 41.02 (SD = 22.16), 4792 (50%) males, 4858 (50%) female, 3407 (35%) White patients, 2567 (27%) Black patients, 2108 (22%) Hispanic patients, and 981 (10%) Asian patients. From evaluation of model gain statistics, age was found to be the single strongest predictor of hypertension, with a gain of 53.1%. Additionally, demographic factors such as poverty and Black race were also strong predictors of hypertension, with gain of 4.33% and 4.18%, respectively. Nutritional Covariates contributed 37% to the overall prediction: Sodium, Caffeine, Potassium, and Alcohol intake being significantly represented within the model. Machine Learning can be used to predict hypertension.

摘要

机器学习方法在医学领域被广泛用于增强预测。然而，对于这些模型使用生活方式因素（如饮食）来预测长期医疗结果（如血压）的可靠性和效果知之甚少。作者评估了机器学习技术是否可以使用营养信息准确预测高血压风险。这是一项使用 2017 年 1 月至 2020 年 3 月期间国家健康和营养检查调查（NHANES）数据的横断面研究。由于 XGBoost 在医学研究中相对于其他常见方法具有更高的性能，因此它被用作本研究中的机器学习模型选择。模型预测指标（例如 AUROC、平衡准确性）用于衡量整体模型效果，协变量增益统计数据（每个协变量对整体预测的贡献百分比）和 Shapely Additive exPlanations（SHAP，用于可视化每个协变量的方法）用于为机器学习输出提供解释，并增加该方法的透明度，因为该方法本来是隐晦的。在总共 9650 名合格患者中，平均年龄为 41.02（SD=22.16），男性 4792 人（50%），女性 4858 人（50%），白人 3407 人（35%），黑人 2567 人（27%），西班牙裔 2108 人（22%），亚裔 981 人（10%）。从模型增益统计数据的评估来看，年龄是高血压的单一最强预测因素，增益为 53.1%。此外，贫困和黑人种族等人口统计学因素也是高血压的强预测因素，增益分别为 4.33%和 4.18%。营养协变量对整体预测的贡献为 37%：钠、咖啡因、钾和酒精摄入量在模型中得到了显著体现。机器学习可用于预测高血压。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f06/10710553/f8630676a2c6/JCH-25-1135-g003.jpg

相似文献

Shapely additive values can effectively visualize pertinent covariates in machine learning when predicting hypertension.

J Clin Hypertens (Greenwich). 2023 Dec;25(12):1135-1144. doi: 10.1111/jch.14745. Epub 2023 Nov 16.

Increasing transparency in machine learning through bootstrap simulation and shapely additive explanations.

PLoS One. 2023 Feb 23;18(2):e0281922. doi: 10.1371/journal.pone.0281922. eCollection 2023.

Use of machine learning to identify risk factors for insomnia.

PLoS One. 2023 Apr 12;18(4):e0282622. doi: 10.1371/journal.pone.0282622. eCollection 2023.

Use of machine learning to identify risk factors for coronary artery disease.

PLoS One. 2023 Apr 14;18(4):e0284103. doi: 10.1371/journal.pone.0284103. eCollection 2023.

Exploring Depression and Nutritional Covariates Amongst US Adults using Shapely Additive Explanations.

Health Sci Rep. 2023 Oct 20;6(10):e1635. doi: 10.1002/hsr2.1635. eCollection 2023 Oct.

Application of a transparent artificial intelligence algorithm for US adults in the obese category of weight.

PLoS One. 2024 May 31;19(5):e0304509. doi: 10.1371/journal.pone.0304509. eCollection 2024.

Machine-learning Models Predict 30-Day Mortality, Cardiovascular Complications, and Respiratory Complications After Aseptic Revision Total Joint Arthroplasty.

Clin Orthop Relat Res. 2022 Nov 1;480(11):2137-2145. doi: 10.1097/CORR.0000000000002276. Epub 2022 Jun 20.

Use of feature importance statistics to accurately predict asthma attacks using machine learning: A cross-sectional cohort study of the US population.

PLoS One. 2023 Nov 22;18(11):e0288903. doi: 10.1371/journal.pone.0288903. eCollection 2023.

Machine learning algorithms identify hypokalaemia risk in people with hypertension in the United States National Health and Nutrition Examination Survey 1999-2018.

Ann Med. 2023 Dec;55(1):2209336. doi: 10.1080/07853890.2023.2209336.

Comparison of model feature importance statistics to identify covariates that contribute most to model accuracy in prediction of insomnia.

PLoS One. 2024 Jul 2;19(7):e0306359. doi: 10.1371/journal.pone.0306359. eCollection 2024.

引用本文的文献

Predicting survival factor following suicide attempt in Iran: an ensemble machine learning technique.

BMC Psychiatry. 2025 Aug 28;25(1):833. doi: 10.1186/s12888-025-07241-0.

Construction and validation of a risk prediction model for chronic obstructive pulmonary disease (COPD): a cross-sectional study based on the NHANES database from 2009 to 2018.

BMC Pulm Med. 2025 Jul 3;25(1):317. doi: 10.1186/s12890-025-03776-w.

The application of suitable sports games for junior high school students based on deep learning and artificial intelligence.

Sci Rep. 2025 May 16;15(1):17056. doi: 10.1038/s41598-025-01941-z.

Negative association between body roundness index and constipation: insights from NHANES.

J Health Popul Nutr. 2025 May 9;44(1):149. doi: 10.1186/s41043-025-00886-3.

Association between red blood cell distribution width-to-albumin ratio and depression: a cross-sectional analysis among US adults, 2011-2018.

BMC Psychiatry. 2025 May 7;25(1):464. doi: 10.1186/s12888-025-06907-z.

Implementation of an Integrated, Clinical Decision Support Tool at the Point of Antihypertensive Medication Refill Request to Improve Hypertension Management: Controlled Pre-Post Study.

JMIR Med Inform. 2025 Apr 11;13:e70752. doi: 10.2196/70752.

Prevalence of metabolic syndrome in patients with inflammatory bowel disease: a meta-analysis on a global scale.

J Health Popul Nutr. 2025 Apr 9;44(1):112. doi: 10.1186/s41043-025-00860-z.

Association of dietary calcium intake with chronic bronchitis and emphysema.

J Health Popul Nutr. 2025 Apr 2;44(1):102. doi: 10.1186/s41043-025-00843-0.

Development and validation of a machine learning model for online predicting the risk of in heart failure: based on the routine blood test and their derived parameters.

Front Cardiovasc Med. 2025 Mar 17;12:1539966. doi: 10.3389/fcvm.2025.1539966. eCollection 2025.

Machine learning analysis of emerging risk factors for early-onset hypertension in the Tlalpan 2020 cohort.

Front Cardiovasc Med. 2025 Jan 17;11:1434418. doi: 10.3389/fcvm.2024.1434418. eCollection 2024.

本文引用的文献

Dendrogram of transparent feature importance machine learning statistics to classify associations for heart failure: A reanalysis of a retrospective cohort study of the Medical Information Mart for Intensive Care III (MIMIC-III) database.

PLoS One. 2023 Jul 20;18(7):e0288819. doi: 10.1371/journal.pone.0288819. eCollection 2023.

Computation of the distribution of model accuracy statistics in machine learning: Comparison between analytically derived distributions and simulation-based methods.

Health Sci Rep. 2023 Apr 20;6(4):e1214. doi: 10.1002/hsr2.1214. eCollection 2023 Apr.

Sedentary Behavioral Studies of Young and Middle-Aged Adults with Hypertension in the Framework of Behavioral Epidemiology: A Scoping Review.

Int J Environ Res Public Health. 2022 Dec 14;19(24):16796. doi: 10.3390/ijerph192416796.

Hypertension in Dialysis Patients: Diagnostic Approaches and Evaluation of Epidemiology.

Diagnostics (Basel). 2022 Nov 26;12(12):2961. doi: 10.3390/diagnostics12122961.

Prediction model of obstructive sleep apnea-related hypertension: Machine learning-based development and interpretation study.

Front Cardiovasc Med. 2022 Dec 5;9:1042996. doi: 10.3389/fcvm.2022.1042996. eCollection 2022.

Geospatial epidemiology of hypertension and its risk factors in India: Findings from National Family Health Survey (2015-2016).

J Family Med Prim Care. 2022 Sep;11(9):5730-5737. doi: 10.4103/jfmpc.jfmpc_174_22. Epub 2022 Oct 14.

Effects of Marital Status and Income on Hypertension: The Korean Genome and Epidemiology Study (KoGES).

J Prev Med Public Health. 2022 Nov;55(6):506-519. doi: 10.3961/jpmph.22.264. Epub 2022 Oct 7.

The Use of Machine Learning for the Care of Hypertension and Heart Failure.

JACC Asia. 2021 Sep 21;1(2):162-172. doi: 10.1016/j.jacasi.2021.07.005. eCollection 2021 Sep.

Interpretable machine learning for 28-day all-cause in-hospital mortality prediction in critically ill patients with heart failure combined with hypertension: A retrospective cohort study based on medical information mart for intensive care database-IV and eICU databases.

Front Cardiovasc Med. 2022 Oct 12;9:994359. doi: 10.3389/fcvm.2022.994359. eCollection 2022.

Evaluating the risk of hypertension in residents in primary care in Shanghai, China with machine learning algorithms.

Front Public Health. 2022 Oct 4;10:984621. doi: 10.3389/fpubh.2022.984621. eCollection 2022.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

当预测高血压时，形状附加值可以有效地在机器学习中可视化相关协变量。

Shapely additive values can effectively visualize pertinent covariates in machine learning when predicting hypertension.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献