基于机器学习的高血压预测模型的可解释性研究。

On the interpretability of machine learning-based model for predicting hypertension.

机构信息

Data Systems Group, Institute of Computer Science, University of Tartu, 2 J. Liivi St., 50409, Tartu, Estonia.

Houston Methodist Center, Tartu, Estonia.

出版信息

BMC Med Inform Decis Mak. 2019 Jul 29;19(1):146. doi: 10.1186/s12911-019-0874-0.

DOI:10.1186/s12911-019-0874-0

PMID:31357998

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6664803/

Abstract

BACKGROUND

Although complex machine learning models are commonly outperforming the traditional simple interpretable models, clinicians find it hard to understand and trust these complex models due to the lack of intuition and explanation of their predictions. The aim of this study to demonstrate the utility of various model-agnostic explanation techniques of machine learning models with a case study for analyzing the outcomes of the machine learning random forest model for predicting the individuals at risk of developing hypertension based on cardiorespiratory fitness data.

METHODS

The dataset used in this study contains information of 23,095 patients who underwent clinician-referred exercise treadmill stress testing at Henry Ford Health Systems between 1991 and 2009 and had a complete 10-year follow-up. Five global interpretability techniques (Feature Importance, Partial Dependence Plot, Individual Conditional Expectation, Feature Interaction, Global Surrogate Models) and two local interpretability techniques (Local Surrogate Models, Shapley Value) have been applied to present the role of the interpretability techniques on assisting the clinical staff to get better understanding and more trust of the outcomes of the machine learning-based predictions.

RESULTS

Several experiments have been conducted and reported. The results show that different interpretability techniques can shed light on different insights on the model behavior where global interpretations can enable clinicians to understand the entire conditional distribution modeled by the trained response function. In contrast, local interpretations promote the understanding of small parts of the conditional distribution for specific instances.

CONCLUSIONS

Various interpretability techniques can vary in their explanations for the behavior of the machine learning model. The global interpretability techniques have the advantage that it can generalize over the entire population while local interpretability techniques focus on giving explanations at the level of instances. Both methods can be equally valid depending on the application need. Both methods are effective methods for assisting clinicians on the medical decision process, however, the clinicians will always remain to hold the final say on accepting or rejecting the outcome of the machine learning models and their explanations based on their domain expertise.

摘要

背景

虽然复杂的机器学习模型通常优于传统的简单可解释模型，但由于缺乏对预测结果的直观理解和信任，临床医生发现很难理解和信任这些复杂模型。本研究旨在通过对基于心肺功能数据预测高血压发病风险的机器学习随机森林模型结果进行分析的案例研究，展示各种与模型无关的机器学习模型解释技术的实用性。

方法

本研究使用的数据集包含了 1991 年至 2009 年期间在亨利福特健康系统接受临床医生推荐的运动平板压力测试的 23095 名患者的信息，并且这些患者都有完整的 10 年随访。本研究应用了 5 种全局可解释性技术（特征重要性、部分依赖图、个体条件期望、特征交互、全局替代模型）和 2 种局部可解释性技术（局部替代模型、Shapley 值），以展示这些解释技术如何帮助临床医生更好地理解和信任基于机器学习的预测结果。

结果

进行并报告了多项实验。结果表明，不同的解释技术可以揭示模型行为的不同见解，全局解释可以使临床医生了解训练后的响应函数所建模的整个条件分布，而局部解释则可以促进对特定实例的条件分布的小部分的理解。

结论

各种解释技术可以对机器学习模型的行为有不同的解释。全局可解释性技术的优势在于它可以对整个总体进行概括，而局部可解释性技术则侧重于在实例层面上进行解释。这两种方法都可以根据应用需求同样有效。这两种方法都是辅助临床医生进行医疗决策的有效方法，但是，临床医生始终可以根据自己的专业知识对机器学习模型及其解释的结果做出接受或拒绝的最终决定。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/884c/6664803/74606f8c1771/12911_2019_874_Fig1_HTML.jpg

相似文献

On the interpretability of machine learning-based model for predicting hypertension.基于机器学习的高血压预测模型的可解释性研究。

BMC Med Inform Decis Mak. 2019 Jul 29;19(1):146. doi: 10.1186/s12911-019-0874-0.

Comparison of machine learning techniques to predict all-cause mortality using fitness data: the Henry ford exercIse testing (FIT) project.使用健身数据比较机器学习技术预测全因死亡率：亨利福特锻炼测试（FIT）项目。

BMC Med Inform Decis Mak. 2017 Dec 19;17(1):174. doi: 10.1186/s12911-017-0566-6.

Intelligible Models for HealthCare: Predicting the Probability of 6-Month Unfavorable Outcome in Patients with Ischemic Stroke.适用于医疗保健的可理解模型：预测缺血性中风患者6个月不良结局的概率

Neuroinformatics. 2022 Jul;20(3):575-585. doi: 10.1007/s12021-021-09535-6. Epub 2021 Aug 26.

Using machine learning on cardiorespiratory fitness data for predicting hypertension: The Henry Ford ExercIse Testing (FIT) Project.利用心肺适能数据进行机器学习预测高血压：亨利福特锻炼测试（FIT）项目。

PLoS One. 2018 Apr 18;13(4):e0195344. doi: 10.1371/journal.pone.0195344. eCollection 2018.

FIT calculator: a multi-risk prediction framework for medical outcomes using cardiorespiratory fitness data.FIT 计算器：一种基于心肺健康数据的医疗结果多风险预测框架。

Sci Rep. 2024 Apr 16;14(1):8745. doi: 10.1038/s41598-024-59401-z.

Development of prediction models for one-year brain tumour survival using machine learning: a comparison of accuracy and interpretability.使用机器学习开发脑肿瘤一年生存率预测模型：准确性与可解释性的比较

Comput Methods Programs Biomed. 2023 May;233:107482. doi: 10.1016/j.cmpb.2023.107482. Epub 2023 Mar 13.

IHCP: interpretable hepatitis C prediction system based on black-box machine learning models.IHCP：基于黑盒机器学习模型的可解释丙型肝炎预测系统。

BMC Bioinformatics. 2023 Sep 6;24(1):333. doi: 10.1186/s12859-023-05456-0.

Opening the Black Box: The Promise and Limitations of Explainable Machine Learning in Cardiology.揭开黑箱：可解释机器学习在心脏病学中的前景与局限。

Can J Cardiol. 2022 Feb;38(2):204-213. doi: 10.1016/j.cjca.2021.09.004. Epub 2021 Sep 14.

Predicting diabetes mellitus using SMOTE and ensemble machine learning approach: The Henry Ford ExercIse Testing (FIT) project.使用SMOTE和集成机器学习方法预测糖尿病：亨利·福特运动测试（FIT）项目。

PLoS One. 2017 Jul 24;12(7):e0179805. doi: 10.1371/journal.pone.0179805. eCollection 2017.

Opening the black box: interpretable machine learning for predictor finding of metabolic syndrome.打开黑箱：代谢综合征预测因子发现的可解释机器学习。

BMC Endocr Disord. 2022 Aug 26;22(1):214. doi: 10.1186/s12902-022-01121-4.

引用本文的文献

Usability and adoption in a randomized trial of GutGPT a GenAI tool for gastrointestinal bleeding.在一项针对用于胃肠道出血的生成式人工智能工具GutGPT的随机试验中的可用性和采用情况。

NPJ Digit Med. 2025 Aug 18;8(1):527. doi: 10.1038/s41746-025-01896-5.

Diagnostic models for sepsis-associated encephalopathy: a comprehensive systematic review and meta-analysis.脓毒症相关性脑病的诊断模型：一项全面的系统评价和荟萃分析。

Front Neurol. 2025 Jul 31;16:1645397. doi: 10.3389/fneur.2025.1645397. eCollection 2025.

Predicting ICU Delirium in Critically Ill COVID-19 Patients Using Demographic, Clinical, and Laboratory Admission Data: A Machine Learning Approach.利用人口统计学、临床和实验室入院数据预测重症 COVID-19 患者的 ICU 谵妄：一种机器学习方法。

Life (Basel). 2025 Jun 30;15(7):1045. doi: 10.3390/life15071045.

Machine learning differentiation of rheumatoid arthritis-Sjögren's syndrome overlap from Sjögren's syndrome with polyarthritis.类风湿关节炎-干燥综合征重叠综合征与伴多关节炎的干燥综合征的机器学习鉴别诊断

Front Immunol. 2025 Jul 8;16:1614631. doi: 10.3389/fimmu.2025.1614631. eCollection 2025.

Predicting patient risk of leaving without being seen using machine learning: a retrospective study in a single overcrowded emergency department.使用机器学习预测患者未就诊即离开的风险：在一个过度拥挤的急诊科进行的回顾性研究

BMC Emerg Med. 2025 Jul 15;25(1):121. doi: 10.1186/s12873-025-01287-9.

The impact of online food delivery applications on dietary pattern disruption in the Arab region.在线食品配送应用对阿拉伯地区饮食模式紊乱的影响。

Front Public Health. 2025 Jun 10;13:1569945. doi: 10.3389/fpubh.2025.1569945. eCollection 2025.

Sleep Temporal Entropy as a Novel Digital Biomarker of Sleep Fragmentation for Cardiometabolic and Mortality Risk.睡眠时间熵作为睡眠碎片化的一种新型数字生物标志物，用于评估心脏代谢风险和死亡风险。

medRxiv. 2025 Jun 6:2025.06.04.25328946. doi: 10.1101/2025.06.04.25328946.

Artificial intelligence in retinal image analysis for hypertensive retinopathy diagnosis: a comprehensive review and perspective.用于高血压性视网膜病变诊断的视网膜图像分析中的人工智能：全面综述与展望

Vis Comput Ind Biomed Art. 2025 May 1;8(1):11. doi: 10.1186/s42492-025-00194-x.

A multi-modal deep learning solution for precise pneumonia diagnosis: the PneumoFusion-Net model.一种用于精确肺炎诊断的多模态深度学习解决方案：PneumoFusion-Net模型。

Front Physiol. 2025 Mar 12;16:1512835. doi: 10.3389/fphys.2025.1512835. eCollection 2025.

Interpretable machine learning models for prolonged Emergency Department wait time prediction.用于预测急诊科长时间等待时间的可解释机器学习模型。

BMC Health Serv Res. 2025 Mar 18;25(1):403. doi: 10.1186/s12913-025-12535-w.

本文引用的文献

PLoS One. 2018 Apr 18;13(4):e0195344. doi: 10.1371/journal.pone.0195344. eCollection 2018.

Machine Learning and Prediction in Medicine - Beyond the Peak of Inflated Expectations.医学中的机器学习与预测——超越过高期望的顶峰

N Engl J Med. 2017 Jun 29;376(26):2507-2509. doi: 10.1056/NEJMp1702071.

Predicting the Future - Big Data, Machine Learning, and Clinical Medicine.预测未来——大数据、机器学习与临床医学。

N Engl J Med. 2016 Sep 29;375(13):1216-9. doi: 10.1056/NEJMp1606181.

Machine Learning and the Profession of Medicine.机器学习与医学职业。

JAMA. 2016 Feb 9;315(6):551-2. doi: 10.1001/jama.2015.18421.

Current depressive symptoms but not history of depression predict hospital readmission or death after discharge from medical wards: a multisite prospective cohort study.当前的抑郁症状而非抑郁病史可预测内科病房出院后的再入院或死亡：一项多中心前瞻性队列研究。

Gen Hosp Psychiatry. 2016 Mar-Apr;39:80-5. doi: 10.1016/j.genhosppsych.2015.12.001. Epub 2015 Dec 18.

Machine Learning in Medicine.医学中的机器学习

Circulation. 2015 Nov 17;132(20):1920-30. doi: 10.1161/CIRCULATIONAHA.115.001593.

On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation.关于通过逐层相关性传播对非线性分类器决策进行逐像素解释

PLoS One. 2015 Jul 10;10(7):e0130140. doi: 10.1371/journal.pone.0130140. eCollection 2015.

A comparison of models for predicting early hospital readmissions.预测早期医院再入院的模型比较。

J Biomed Inform. 2015 Aug;56:229-38. doi: 10.1016/j.jbi.2015.05.016. Epub 2015 Jun 1.

Physical fitness and hypertension in a population at risk for cardiovascular disease: the Henry Ford ExercIse Testing (FIT) Project.心血管疾病高危人群的体能与高血压：亨利·福特运动测试（FIT）项目

J Am Heart Assoc. 2014 Dec;3(6):e001268. doi: 10.1161/JAHA.114.001268.

Rationale and design of the Henry Ford Exercise Testing Project (the FIT project).亨利·福特运动测试项目（FIT项目）的原理与设计

Clin Cardiol. 2014 Aug;37(8):456-61. doi: 10.1002/clc.22302.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于机器学习的高血压预测模型的可解释性研究。

On the interpretability of machine learning-based model for predicting hypertension.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献