• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于机器学习的巴西医院主要不良心血管事件风险预测:开发、外部验证和可解释性。

Machine learning-based risk prediction for major adverse cardiovascular events in a Brazilian hospital: Development, external validation, and interpretability.

机构信息

Ribeirão Preto Medical School, University of São Paulo, Ribeirão Preto, São Paulo, Brazil.

Predicting Health GmbH, Graz, Austria.

出版信息

PLoS One. 2024 Oct 11;19(10):e0311719. doi: 10.1371/journal.pone.0311719. eCollection 2024.

DOI:10.1371/journal.pone.0311719
PMID:39392843
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11469522/
Abstract

BACKGROUND

Studies of cardiovascular disease risk prediction by machine learning algorithms often do not assess their ability to generalize to other populations and few of them include an analysis of the interpretability of individual predictions. This manuscript addresses the development and validation, both internal and external, of predictive models for the assessment of risks of major adverse cardiovascular events (MACE). Global and local interpretability analyses of predictions were conducted towards improving MACE's model reliability and tailoring preventive interventions.

METHODS

The models were trained and validated on a retrospective cohort with the use of data from Ribeirão Preto Medical School (RPMS), University of São Paulo, Brazil. Data from Beth Israel Deaconess Medical Center (BIDMC), USA, were used for external validation. A balanced sample of 6,000 MACE cases and 6,000 non-MACE cases from RPMS was created for training and internal validation and an additional one of 8,000 MACE cases and 8,000 non-MACE cases from BIDMC was employed for external validation. Eight machine learning algorithms, namely Penalized Logistic Regression, Random Forest, XGBoost, Decision Tree, Support Vector Machine, k-Nearest Neighbors, Naive Bayes, and Multi-Layer Perceptron were trained to predict a 5-year risk of major adverse cardiovascular events and their predictive performance was evaluated regarding accuracy, ROC curve (receiver operating characteristic), and AUC (area under the ROC curve). LIME and Shapley values were applied towards insights about model interpretability.

FINDINGS

Random Forest showed the best predictive performance in both internal validation (AUC = 0.871 (0.859-0.882); Accuracy = 0.794 (0.782-0.808)) and external one (AUC = 0.786 (0.778-0.792); Accuracy = 0.710 (0.704-0.717)). Compared to LIME, Shapley values suggest more consistent explanations on exploratory analysis and importance of features.

CONCLUSIONS

Among the machine learning algorithms evaluated, Random Forest showed the best generalization ability, both internally and externally. Shapley values for local interpretability were more informative than LIME ones, which is in line with our exploratory analysis and global interpretation of the final model. Machine learning algorithms with good generalization and accompanied by interpretability analyses are recommended for assessments of individual risks of cardiovascular diseases and development of personalized preventive actions.

摘要

背景

机器学习算法进行心血管疾病风险预测的研究通常并未评估其对其他人群的泛化能力,且很少对个体预测的可解释性进行分析。本文旨在开发和验证用于评估主要不良心血管事件(MACE)风险的预测模型,并对其进行内部和外部验证。通过对预测结果进行全局和局部可解释性分析,提高 MACE 模型的可靠性并为预防干预措施提供参考。

方法

本研究使用来自巴西圣保罗大学里贝朗普雷图医学院(RPMS)的数据,对回顾性队列进行模型训练和验证。使用来自美国贝斯以色列女执事医疗中心(BIDMC)的数据进行外部验证。从 RPMS 创建了一个平衡的样本,包含 6000 例 MACE 病例和 6000 例非 MACE 病例,用于训练和内部验证,并从 BIDMC 额外创建了一个包含 8000 例 MACE 病例和 8000 例非 MACE 病例的样本,用于外部验证。训练了八种机器学习算法,包括 Penalized Logistic Regression、Random Forest、XGBoost、Decision Tree、Support Vector Machine、k-Nearest Neighbors、Naive Bayes 和 Multi-Layer Perceptron,以预测 5 年的主要不良心血管事件风险,并评估其准确性、ROC 曲线(接收者操作特征)和 AUC(ROC 曲线下面积)。应用 LIME 和 Shapley 值来深入了解模型的可解释性。

结果

Random Forest 在内部验证(AUC = 0.871(0.859-0.882);准确度 = 0.794(0.782-0.808))和外部验证(AUC = 0.786(0.778-0.792);准确度 = 0.710(0.704-0.717))中均表现出最佳的预测性能。与 LIME 相比,Shapley 值在探索性分析和特征重要性方面提供了更一致的解释。

结论

在所评估的机器学习算法中,Random Forest 表现出最佳的泛化能力,无论是在内部还是外部验证中。局部可解释性的 Shapley 值比 LIME 值提供了更有信息量的解释,这与我们的探索性分析和最终模型的全局解释一致。建议对个体的心血管疾病风险进行评估,并制定个性化的预防措施时,使用具有良好泛化能力并结合可解释性分析的机器学习算法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a66/11469522/5d906eb8d4a8/pone.0311719.g018.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a66/11469522/842870fc246c/pone.0311719.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a66/11469522/b2581084a741/pone.0311719.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a66/11469522/0cf66b53fd98/pone.0311719.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a66/11469522/3d30e3982234/pone.0311719.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a66/11469522/509ebad30b01/pone.0311719.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a66/11469522/e866b15b4600/pone.0311719.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a66/11469522/4dc169b0033a/pone.0311719.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a66/11469522/9212ee8edad3/pone.0311719.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a66/11469522/cb10166820ed/pone.0311719.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a66/11469522/83c234021fcb/pone.0311719.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a66/11469522/cb054342b0f6/pone.0311719.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a66/11469522/b7cec09c88e7/pone.0311719.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a66/11469522/b0c46658c679/pone.0311719.g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a66/11469522/e4bf8ff354c8/pone.0311719.g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a66/11469522/6764a7d1afc2/pone.0311719.g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a66/11469522/08edc5ac1c92/pone.0311719.g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a66/11469522/406a1e6286c8/pone.0311719.g017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a66/11469522/5d906eb8d4a8/pone.0311719.g018.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a66/11469522/842870fc246c/pone.0311719.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a66/11469522/b2581084a741/pone.0311719.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a66/11469522/0cf66b53fd98/pone.0311719.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a66/11469522/3d30e3982234/pone.0311719.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a66/11469522/509ebad30b01/pone.0311719.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a66/11469522/e866b15b4600/pone.0311719.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a66/11469522/4dc169b0033a/pone.0311719.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a66/11469522/9212ee8edad3/pone.0311719.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a66/11469522/cb10166820ed/pone.0311719.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a66/11469522/83c234021fcb/pone.0311719.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a66/11469522/cb054342b0f6/pone.0311719.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a66/11469522/b7cec09c88e7/pone.0311719.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a66/11469522/b0c46658c679/pone.0311719.g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a66/11469522/e4bf8ff354c8/pone.0311719.g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a66/11469522/6764a7d1afc2/pone.0311719.g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a66/11469522/08edc5ac1c92/pone.0311719.g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a66/11469522/406a1e6286c8/pone.0311719.g017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a66/11469522/5d906eb8d4a8/pone.0311719.g018.jpg

相似文献

1
Machine learning-based risk prediction for major adverse cardiovascular events in a Brazilian hospital: Development, external validation, and interpretability.基于机器学习的巴西医院主要不良心血管事件风险预测:开发、外部验证和可解释性。
PLoS One. 2024 Oct 11;19(10):e0311719. doi: 10.1371/journal.pone.0311719. eCollection 2024.
2
Prediction Model of Osteonecrosis of the Femoral Head After Femoral Neck Fracture: Machine Learning-Based Development and Validation Study.股骨颈骨折后股骨头坏死的预测模型:基于机器学习的开发与验证研究
JMIR Med Inform. 2021 Nov 19;9(11):e30079. doi: 10.2196/30079.
3
Development and Validation of an Explainable Machine Learning Model for Predicting Myocardial Injury After Noncardiac Surgery in Two Centers in China: Retrospective Study.中国两个中心用于预测非心脏手术后心肌损伤的可解释机器学习模型的开发与验证:一项回顾性研究
JMIR Aging. 2024 Jul 26;7:e54872. doi: 10.2196/54872.
4
Prediction and validation of mild cognitive impairment in occupational dust exposure population based on machine learning.基于机器学习的职业性粉尘暴露人群轻度认知功能障碍的预测和验证。
Ecotoxicol Environ Saf. 2024 Oct 15;285:117111. doi: 10.1016/j.ecoenv.2024.117111. Epub 2024 Sep 26.
5
Development and Validation of a Machine Learning Algorithm to Predict the Risk of Blood Transfusion after Total Hip Replacement in Patients with Femoral Neck Fractures: A Multicenter Retrospective Cohort Study.用于预测股骨颈骨折患者全髋关节置换术后输血风险的机器学习算法的开发与验证:一项多中心回顾性队列研究
Orthop Surg. 2024 Aug;16(8):2066-2080. doi: 10.1111/os.14160. Epub 2024 Jul 1.
6
Risk Prediction of Major Adverse Cardiovascular Events Occurrence Within 6 Months After Coronary Revascularization: Machine Learning Study.冠状动脉血运重建后6个月内主要不良心血管事件发生的风险预测:机器学习研究
JMIR Med Inform. 2022 Apr 20;10(4):e33395. doi: 10.2196/33395.
7
Development and validation of explainable machine-learning models for carotid atherosclerosis early screening.开发和验证可解释的机器学习模型,用于颈动脉粥样硬化的早期筛查。
J Transl Med. 2023 May 29;21(1):353. doi: 10.1186/s12967-023-04093-8.
8
Application value of the automated machine learning model based on modified CT index combined with serological indices in the early prediction of lung cancer.基于改良CT指标联合血清学指标的自动化机器学习模型在肺癌早期预测中的应用价值
Front Public Health. 2024 Apr 5;12:1368217. doi: 10.3389/fpubh.2024.1368217. eCollection 2024.
9
Can Machine-learning Algorithms Predict Early Revision TKA in the Danish Knee Arthroplasty Registry?机器学习算法能否预测丹麦膝关节置换登记处的早期翻修 TKA?
Clin Orthop Relat Res. 2020 Sep;478(9):2088-2101. doi: 10.1097/CORR.0000000000001343.
10
Responsible AI for cardiovascular disease detection: Towards a privacy-preserving and interpretable model.心血管疾病检测的负责任 AI:迈向隐私保护和可解释的模型。
Comput Methods Programs Biomed. 2024 Sep;254:108289. doi: 10.1016/j.cmpb.2024.108289. Epub 2024 Jun 17.

引用本文的文献

1
Predicting cardiovascular risk with hybrid ensemble learning and explainable AI.使用混合集成学习和可解释人工智能预测心血管风险。
Sci Rep. 2025 May 23;15(1):17927. doi: 10.1038/s41598-025-01650-7.

本文引用的文献

1
Machine learning-based prediction of in-hospital death for patients with takotsubo syndrome: The InterTAK-ML model.基于机器学习的 Takotsubo 综合征患者院内死亡预测:InterTAK-ML 模型。
Eur J Heart Fail. 2023 Dec;25(12):2299-2311. doi: 10.1002/ejhf.2983. Epub 2023 Sep 18.
2
MIMIC-IV, a freely accessible electronic health record dataset.MIMIC-IV,一个可自由访问的电子健康记录数据集。
Sci Data. 2023 Jan 3;10(1):1. doi: 10.1038/s41597-022-01899-x.
3
Explainable machine learning framework for predicting long-term cardiovascular disease risk among adolescents.
用于预测青少年长期心血管疾病风险的可解释机器学习框架。
Sci Rep. 2022 Dec 19;12(1):21905. doi: 10.1038/s41598-022-25933-5.
4
Prediction of major adverse cardiovascular events in patients with acute coronary syndrome: Development and validation of a non-invasive nomogram model based on autonomic nervous system assessment.急性冠状动脉综合征患者主要不良心血管事件的预测:基于自主神经系统评估的无创列线图模型的建立与验证
Front Cardiovasc Med. 2022 Nov 3;9:1053470. doi: 10.3389/fcvm.2022.1053470. eCollection 2022.
5
Using Machine Learning Techniques to Predict MACE in Very Young Acute Coronary Syndrome Patients.使用机器学习技术预测极年轻急性冠状动脉综合征患者的主要不良心血管事件
Diagnostics (Basel). 2022 Feb 6;12(2):422. doi: 10.3390/diagnostics12020422.
6
Cardiovascular Statistics - Brazil 2021.心血管统计数据 - 巴西2021年
Arq Bras Cardiol. 2022 Jan;118(1):115-373. doi: 10.36660/abc.20211012.
7
Unsupervised Learning for Automated Detection of Coronary Artery Disease Subgroups.无监督学习在冠状动脉疾病亚组自动检测中的应用。
J Am Heart Assoc. 2021 Dec 7;10(23):e021976. doi: 10.1161/JAHA.121.021976. Epub 2021 Nov 30.
8
Major adverse cardiovascular event definitions used in observational analysis of administrative databases: a systematic review.行政数据库观察性分析中使用的主要不良心血管事件定义:一项系统综述。
BMC Med Res Methodol. 2021 Nov 6;21(1):241. doi: 10.1186/s12874-021-01440-5.
9
Statistical Development and Validation of Clinical Prediction Models.统计模型的开发与验证。
Anesthesiology. 2021 Sep 1;135(3):396-405. doi: 10.1097/ALN.0000000000003871.
10
Machine Learning Based Risk Prediction for Major Adverse Cardiovascular Events.基于机器学习的主要不良心血管事件风险预测
Stud Health Technol Inform. 2021 May 7;279:136-143. doi: 10.3233/SHTI210100.