利用机器学习和来自区域医疗保健系统的电子病历数据特征来自动化和改进心血管疾病预测。

Automating and improving cardiovascular disease prediction using Machine learning and EMR data features from a regional healthcare system.

机构信息

School of Business, State University of New York at New Paltz, New Paltz, NY, USA.

Department of Computer Science, Northern Kentucky University, Highland Heights, Kentucky, USA.

出版信息

Int J Med Inform. 2022 Jul;163:104786. doi: 10.1016/j.ijmedinf.2022.104786. Epub 2022 Apr 29.

DOI:10.1016/j.ijmedinf.2022.104786

PMID:35512622

Abstract

BACKGROUND

The ACC/AHA Pooled Cohort Equations (PCE) Risk Calculator is widely used in the US for primary prevention of atherosclerotic cardiovascular disease (ASCVD), but may under- or over-estimate risk in some populations. We therefore designed an automated, population-specific ASCVD risk calculator using machine-learning (ML) methods and electronic medical record (EMR) data, and compared its predictive power with that of the PCE calculator.

METHODS AND FINDINGS

We collected data from 101,110 unique EMRs of living patients from January 1, 2009 to April 30, 2020. ML techniques were applied to patient datasets that included either only cross-sectional (CS) features, or CS combined with longitudinal (LT) features derived from vital statistics and laboratory values. We compared the utility of the models using a proposed new cost measure (Screened Cases Percentage @ Sensitivity level). All ML models tested achieved better predictive power than the PCE risk calculator. The random forest ML technique (RF) applied on the combination of CS and LT features (RF-LTC) produced the best area under curve (AUC) score of 0.902 (95% confidence interval (CI), 0.895-0.910). To detect 90% of all positive ASCVD cases, the best ML model required screening only 43% of patients, while the PCE risk calculator required screening 69% of patients.

CONCLUSIONS

Prediction models built using ML techniques improved ASCVD prediction and reduced the number of screenings required to predict ASCVD when compared with the PCE calculator, alone. Combining LT and CS features in the ML models significantly improved ASCVD prediction compared with using CS features, alone.

摘要

背景

ACC/AHA 队列方程（PCE）风险计算器在美国被广泛用于动脉粥样硬化性心血管疾病（ASCVD）的一级预防，但在某些人群中可能会低估或高估风险。因此，我们使用机器学习（ML）方法和电子病历（EMR）数据设计了一种自动化的、特定人群的 ASCVD 风险计算器，并将其预测能力与 PCE 计算器进行了比较。

方法和发现

我们从 2009 年 1 月 1 日至 2020 年 4 月 30 日期间收集了来自 101,110 个独特的 EMR 的活患者数据。ML 技术应用于包含仅横断面（CS）特征或 CS 与来自生命统计和实验室值的纵向（LT）特征相结合的患者数据集。我们使用一种新的成本度量（灵敏度水平下的筛查病例百分比）来比较模型的效用。所有测试的 ML 模型都比 PCE 风险计算器具有更好的预测能力。随机森林 ML 技术（RF）应用于 CS 和 LT 特征的组合（RF-LTC）产生了最佳的曲线下面积（AUC）评分 0.902（95%置信区间（CI），0.895-0.910）。为了检测所有阳性 ASCVD 病例的 90%，最佳 ML 模型只需筛查 43%的患者，而 PCE 风险计算器则需要筛查 69%的患者。

结论

与单独使用 PCE 计算器相比，使用 ML 技术构建的预测模型可提高 ASCVD 的预测能力，并减少预测 ASCVD 所需的筛查数量。与仅使用 CS 特征相比，在 ML 模型中结合 LT 和 CS 特征可显著提高 ASCVD 的预测能力。

相似文献

Automating and improving cardiovascular disease prediction using Machine learning and EMR data features from a regional healthcare system.利用机器学习和来自区域医疗保健系统的电子病历数据特征来自动化和改进心血管疾病预测。

Int J Med Inform. 2022 Jul;163:104786. doi: 10.1016/j.ijmedinf.2022.104786. Epub 2022 Apr 29.

Comparing the performance of machine learning and conventional models for predicting atherosclerotic cardiovascular disease in a general Chinese population.比较机器学习模型和传统模型在预测一般中国人群中动脉粥样硬化性心血管疾病方面的性能。

BMC Med Inform Decis Mak. 2023 Jul 24;23(1):134. doi: 10.1186/s12911-023-02242-z.

Machine Learning Outperforms ACC / AHA CVD Risk Calculator in MESA.机器学习在 MESA 研究中优于 ACC/AHA CVD 风险计算器。

J Am Heart Assoc. 2018 Nov 20;7(22):e009476. doi: 10.1161/JAHA.118.009476.

Machine learning and atherosclerotic cardiovascular disease risk prediction in a multi-ethnic population.多民族人群中的机器学习与动脉粥样硬化性心血管疾病风险预测

NPJ Digit Med. 2020 Sep 23;3:125. doi: 10.1038/s41746-020-00331-1. eCollection 2020.

Evaluation of Atherosclerotic Cardiovascular Risk Prediction Models in China: Results From the CHERRY Study.中国动脉粥样硬化性心血管疾病风险预测模型的评估：CHERRY研究结果

JACC Asia. 2022 Jan 4;2(1):33-43. doi: 10.1016/j.jacasi.2021.10.007. eCollection 2022 Feb.

Comparative performance of the two pooled cohort equations for predicting atherosclerotic cardiovascular disease.两种列线图方程预测动脉粥样硬化性心血管疾病的比较性能。

Atherosclerosis. 2021 Oct;334:23-29. doi: 10.1016/j.atherosclerosis.2021.08.034. Epub 2021 Aug 21.

Combining European and U.S. risk prediction models with polygenic risk scores to refine cardiovascular prevention: the CoLaus|PsyCoLaus Study.将欧洲和美国的风险预测模型与多基因风险评分相结合，以完善心血管预防：CoLaus|PsyCoLaus 研究。

Eur J Prev Cardiol. 2023 May 9;30(7):561-571. doi: 10.1093/eurjpc/zwad012.

Incorporating longitudinal history of risk factors into atherosclerotic cardiovascular disease risk prediction using deep learning.利用深度学习将危险因素的纵向历史纳入动脉粥样硬化性心血管疾病风险预测中。

Sci Rep. 2024 Jan 31;14(1):2554. doi: 10.1038/s41598-024-51685-5.

Machine learning approaches improve risk stratification for secondary cardiovascular disease prevention in multiethnic patients.机器学习方法可提高多民族患者二级心血管疾病预防的风险分层。

Open Heart. 2021 Oct;8(2). doi: 10.1136/openhrt-2021-001802.

Estimation of Atherosclerotic Cardiovascular Disease Risk Among Patients in the Veterans Affairs Health Care System.在退伍军人事务医疗保健系统中的患者中估算动脉粥样硬化性心血管疾病风险。

JAMA Netw Open. 2020 Jul 1;3(7):e208236. doi: 10.1001/jamanetworkopen.2020.8236.

引用本文的文献

Machine learning based prediction models for cardiovascular disease risk using electronic health records data: systematic review and meta-analysis.基于机器学习利用电子健康记录数据预测心血管疾病风险的模型：系统评价与荟萃分析

Eur Heart J Digit Health. 2024 Oct 27;6(1):7-22. doi: 10.1093/ehjdh/ztae080. eCollection 2025 Jan.

Increasing provider awareness of Lp(a) testing for patients at risk for cardiovascular disease: A comparative study.提高医疗服务提供者对心血管疾病风险患者进行脂蛋白(a)检测的认识：一项比较研究。

Am J Prev Cardiol. 2024 Nov 23;21:100895. doi: 10.1016/j.ajpc.2024.100895. eCollection 2025 Mar.

Detecting cardiovascular diseases using unsupervised machine learning clustering based on electronic medical records.基于电子病历，使用无监督机器学习聚类法检测心血管疾病。

BMC Med Res Methodol. 2024 Dec 19;24(1):309. doi: 10.1186/s12874-024-02422-z.

CardioRiskNet: A Hybrid AI-Based Model for Explainable Risk Prediction and Prognosis in Cardiovascular Disease.心脏风险网络：一种基于人工智能的混合模型，用于心血管疾病的可解释风险预测和预后评估。

Bioengineering (Basel). 2024 Aug 12;11(8):822. doi: 10.3390/bioengineering11080822.

The Use of Deep Learning and Machine Learning on Longitudinal Electronic Health Records for the Early Detection and Prevention of Diseases: Scoping Review.深度学习和机器学习在纵向电子健康记录中用于疾病的早期检测和预防的应用：范围综述。

J Med Internet Res. 2024 Aug 20;26:e48320. doi: 10.2196/48320.

Improving cardiovascular risk prediction through machine learning modelling of irregularly repeated electronic health records.通过对不规则重复的电子健康记录进行机器学习建模来改善心血管风险预测。

Eur Heart J Digit Health. 2023 Oct 17;5(1):30-40. doi: 10.1093/ehjdh/ztad058. eCollection 2024 Jan.

Machine learning for the prediction of atherosclerotic cardiovascular disease during 3-year follow up in Chinese type 2 diabetes mellitus patients.机器学习预测中国 2 型糖尿病患者 3 年随访期间的动脉粥样硬化性心血管疾病。

J Diabetes Investig. 2023 Nov;14(11):1289-1302. doi: 10.1111/jdi.14069. Epub 2023 Aug 22.

Machine learning framework for atherosclerotic cardiovascular disease risk assessment.用于动脉粥样硬化性心血管疾病风险评估的机器学习框架

J Diabetes Metab Disord. 2022 Nov 28;22(1):423-430. doi: 10.1007/s40200-022-01160-7. eCollection 2023 Jun.

Cardiovascular diseases prediction by machine learning incorporation with deep learning.结合深度学习的机器学习用于心血管疾病预测

Front Med (Lausanne). 2023 Apr 17;10:1150933. doi: 10.3389/fmed.2023.1150933. eCollection 2023.

Cardiovascular complications in a diabetes prediction model using machine learning: a systematic review.基于机器学习的糖尿病预测模型中的心血管并发症：系统评价。

Cardiovasc Diabetol. 2023 Jan 19;22(1):13. doi: 10.1186/s12933-023-01741-7.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

利用机器学习和来自区域医疗保健系统的电子病历数据特征来自动化和改进心血管疾病预测。

Automating and improving cardiovascular disease prediction using Machine learning and EMR data features from a regional healthcare system.

机构信息

出版信息

BACKGROUND

METHODS AND FINDINGS

CONCLUSIONS

背景

方法和发现

结论

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献