利用机器学习和电子健康记录进行心血管疾病一级预防风险预测的机遇与挑战：一项系统综述

Opportunities and Challenges of Cardiovascular Disease Risk Prediction for Primary Prevention Using Machine Learning and Electronic Health Records: A Systematic Review.

作者信息

Liu Tianyi, Krentz Andrew J, Huo Zhiqiang, Ćurčin Vasa

机构信息

School of Life Course & Population Sciences, King's College London, SE1 1UL London, UK.

Metadvice, 1025 St-Sulpice, Switzerland.

出版信息

Rev Cardiovasc Med. 2025 Apr 25;26(4):37443. doi: 10.31083/RCM37443. eCollection 2025 Apr.

DOI:10.31083/RCM37443

PMID:40351688

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12059770/

Abstract

BACKGROUND

Cardiovascular disease (CVD) remains the foremost cause of morbidity and mortality worldwide. Recent advancements in machine learning (ML) have demonstrated substantial potential in augmenting risk stratification for primary prevention, surpassing conventional statistical models in predictive performance. Thus, integrating ML with Electronic Health Records (EHRs) enables refined risk estimation by leveraging the granularity and breadth of longitudinal individual patient data. However, fundamental barriers persist, including limited generalizability, challenges in interpretability, and the absence of rigorous external validation, all of which impede widespread clinical deployment.

METHODS

This review adheres to the methodological rigor of the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) and Scale for the Assessment of Narrative Review Articles (SANRA) guidelines. A systematic literature search was performed in March 2024, encompassing the Medline and Embase databases, to identify studies published since 2010. Supplementary references were retrieved from the Institute for Scientific Information (ISI) Web of Science, and manual searches were curated. The selection process, conducted via Rayyan, focused on systematic and narrative reviews evaluating ML-driven models for long-term CVD risk prediction within primary prevention contexts utilizing EHR data. Studies investigating short-term prognostication, highly specific comorbid cohorts, or conventional models devoid of ML components were excluded.

RESULTS

Following an exhaustive screening of 1757 records, 22 studies met the inclusion criteria. Of these, 10 were systematic reviews (four incorporating meta-analyses), while 12 constituted narrative reviews, with the majority published post-2020. The synthesis underscores the superiority of ML in modeling intricate EHR-derived risk factors, facilitating precision-driven cardiovascular risk assessment. Nonetheless, salient challenges endure heterogeneity in CVD outcome definitions, undermine comparability, data incompleteness and inconsistency compromise model robustness, and a dearth of external validation constrains clinical translatability. Moreover, ethical and regulatory considerations, including algorithmic opacity, equity in predictive performance, and the absence of standardized evaluation frameworks, pose formidable obstacles to seamless integration into clinical workflows.

CONCLUSIONS

Despite the transformative potential of ML-based CVD risk prediction, it remains encumbered by methodological, technical, and regulatory impediments that hinder its full-scale adoption into real-world healthcare settings. This review underscores the imperative circumstances for standardized validation protocols, stringent regulatory oversight, and interdisciplinary collaboration to bridge the translational divide. Our findings established an integrative framework for developing, validating, and applying ML-based CVD risk prediction algorithms, addressing both clinical and technical dimensions. To further advance this field, we propose a standardized, transparent, and regulated EHR platform that facilitates fair model evaluation, reproducibility, and clinical translation by providing a high-quality, representative dataset with structured governance and benchmarking mechanisms. Meanwhile, future endeavors must prioritize enhancing model transparency, mitigating biases, and ensuring adaptability to heterogeneous clinical populations, fostering equitable and evidence-based implementation of ML-driven predictive analytics in cardiovascular medicine.

摘要

背景

心血管疾病（CVD）仍是全球发病和死亡的首要原因。机器学习（ML）的最新进展已显示出在加强一级预防风险分层方面的巨大潜力，其预测性能超过了传统统计模型。因此，将ML与电子健康记录（EHR）相结合，能够通过利用个体患者纵向数据的粒度和广度实现更精确的风险估计。然而，一些基本障碍仍然存在，包括可推广性有限、可解释性方面的挑战以及缺乏严格的外部验证，所有这些都阻碍了其在临床中的广泛应用。

方法

本综述遵循系统评价和Meta分析的首选报告项目（PRISMA）以及叙述性综述文章评估量表（SANRA）指南的方法严谨性。2024年3月进行了系统的文献检索，涵盖Medline和Embase数据库，以识别2010年以来发表的研究。从科学信息研究所（ISI）的科学网检索了补充参考文献，并进行了手动检索。通过Rayyan进行的筛选过程侧重于评估在一级预防背景下利用EHR数据进行长期CVD风险预测的ML驱动模型的系统评价和叙述性综述。排除了研究短期预后、高度特异性合并症队列或缺乏ML组件的传统模型的研究。

结果

在对1757条记录进行详尽筛选后，22项研究符合纳入标准。其中，10项为系统评价（4项纳入了Meta分析），12项为叙述性综述，大多数研究发表于2020年之后。综合分析强调了ML在对复杂的EHR衍生风险因素进行建模方面的优越性，有助于进行精准驱动的心血管风险评估。尽管如此，仍存在一些突出挑战，CVD结局定义的异质性破坏了可比性，数据不完整和不一致影响了模型的稳健性，缺乏外部验证限制了临床可转化性。此外，伦理和监管方面的考虑，包括算法不透明、预测性能的公平性以及缺乏标准化评估框架，对无缝融入临床工作流程构成了巨大障碍。

结论

尽管基于ML的CVD风险预测具有变革潜力，但它仍然受到方法、技术和监管方面的障碍的阻碍，这些障碍阻碍了其在现实世界医疗环境中的全面应用。本综述强调了标准化验证方案、严格监管监督和跨学科合作以弥合转化差距的迫切情况。我们的研究结果建立了一个用于开发、验证和应用基于ML的CVD风险预测算法的综合框架，涵盖了临床和技术层面。为了进一步推动该领域的发展，我们提出了一个标准化、透明且受监管的EHR平台，通过提供具有结构化治理和基准机制的高质量、代表性数据集，促进公平的模型评估、可重复性和临床转化。同时，未来的努力必须优先提高模型透明度、减轻偏差并确保对异质临床人群的适应性，促进ML驱动的预测分析在心血管医学中的公平和循证实施。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d9de/12059770/4c929155c11f/2153-8174-26-4-37443-g1.jpg

相似文献

Opportunities and Challenges of Cardiovascular Disease Risk Prediction for Primary Prevention Using Machine Learning and Electronic Health Records: A Systematic Review.

Rev Cardiovasc Med. 2025 Apr 25;26(4):37443. doi: 10.31083/RCM37443. eCollection 2025 Apr.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Artificial intelligence in hospital infection prevention: an integrative review.

Front Public Health. 2025 Apr 2;13:1547450. doi: 10.3389/fpubh.2025.1547450. eCollection 2025.

The future of Cochrane Neonatal.

Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.

Machine learning based prediction models for cardiovascular disease risk using electronic health records data: systematic review and meta-analysis.

Eur Heart J Digit Health. 2024 Oct 27;6(1):7-22. doi: 10.1093/ehjdh/ztae080. eCollection 2025 Jan.

Artificial intelligence for breast cancer detection and its health technology assessment: A scoping review.

Comput Biol Med. 2025 Jan;184:109391. doi: 10.1016/j.compbiomed.2024.109391. Epub 2024 Nov 22.

Beyond the black stump: rapid reviews of health research issues affecting regional, rural and remote Australia.

Med J Aust. 2020 Dec;213 Suppl 11:S3-S32.e1. doi: 10.5694/mja2.50881.

Artificial intelligence for cardiovascular disease risk assessment in personalised framework: a scoping review.

EClinicalMedicine. 2024 May 27;73:102660. doi: 10.1016/j.eclinm.2024.102660. eCollection 2024 Jul.

The Role of AI in Nursing Education and Practice: Umbrella Review.

J Med Internet Res. 2025 Apr 4;27:e69881. doi: 10.2196/69881.

Artificial Intelligence in Thoracic Surgery: A Review Bridging Innovation and Clinical Practice for the Next Generation of Surgical Care.

J Clin Med. 2025 Apr 16;14(8):2729. doi: 10.3390/jcm14082729.

本文引用的文献

Development and validation of a new algorithm for improved cardiovascular risk prediction.

Nat Med. 2024 May;30(5):1440-1447. doi: 10.1038/s41591-024-02905-y. Epub 2024 Apr 18.

TRIPOD+AI statement: updated guidance for reporting clinical prediction models that use regression or machine learning methods.

BMJ. 2024 Apr 16;385:e078378. doi: 10.1136/bmj-2023-078378.

Major Limitations of Cardiovascular Risk Scores.

Cardiovasc Ther. 2024 Feb 28;2024:4133365. doi: 10.1155/2024/4133365. eCollection 2024.

Artificial intelligence in the risk prediction models of cardiovascular disease and development of an independent validation screening tool: a systematic review.

BMC Med. 2024 Feb 5;22(1):56. doi: 10.1186/s12916-024-03273-7.

Prioritizing the primary prevention of heart failure: Measuring, modifying and monitoring risk.

Prog Cardiovasc Dis. 2024 Jan-Feb;82:2-14. doi: 10.1016/j.pcad.2024.01.001. Epub 2024 Jan 24.

Artificial Intelligence-Based Clinical Decision Support Systems in Cardiovascular Diseases.

Anatol J Cardiol. 2024 Jan 7;28(2):74-86. doi: 10.14744/AnatolJCardiol.2023.3685.

Global Burden of Cardiovascular Diseases and Risks, 1990-2022.

J Am Coll Cardiol. 2023 Dec 19;82(25):2350-2473. doi: 10.1016/j.jacc.2023.11.007.

Polysocial Risk Scores: Implications for Cardiovascular Disease Risk Assessment and Management.

Curr Atheroscler Rep. 2023 Dec;25(12):1059-1068. doi: 10.1007/s11883-023-01173-4. Epub 2023 Dec 4.

Machine Learning in Cardiovascular Risk Prediction and Precision Preventive Approaches.

Curr Atheroscler Rep. 2023 Dec;25(12):1069-1081. doi: 10.1007/s11883-023-01174-3. Epub 2023 Nov 27.

Novel Prediction Equations for Absolute Risk Assessment of Total Cardiovascular Disease Incorporating Cardiovascular-Kidney-Metabolic Health: A Scientific Statement From the American Heart Association.

Circulation. 2023 Dec 12;148(24):1982-2004. doi: 10.1161/CIR.0000000000001191. Epub 2023 Nov 10.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用机器学习和电子健康记录进行心血管疾病一级预防风险预测的机遇与挑战：一项系统综述

Opportunities and Challenges of Cardiovascular Disease Risk Prediction for Primary Prevention Using Machine Learning and Electronic Health Records: A Systematic Review.

作者信息

Liu Tianyi, Krentz Andrew J, Huo Zhiqiang, Ćurčin Vasa

机构信息

School of Life Course & Population Sciences, King's College London, SE1 1UL London, UK.

Metadvice, 1025 St-Sulpice, Switzerland.