利用合成电子健康记录数据和深度学习通过预测接近灾难性失代偿来改善高危心力衰竭手术干预的时机

The Use of Synthetic Electronic Health Record Data and Deep Learning to Improve Timing of High-Risk Heart Failure Surgical Intervention by Predicting Proximity to Catastrophic Decompensation.

作者信息

Guo Aixia, Foraker Randi E, MacGregor Robert M, Masood Faraz M, Cupps Brian P, Pasque Michael K

机构信息

Institute for Informatics (I2), Washington University School of Medicine, St. Louis, MO, United States.

Department of Internal Medicine, Washington University School of Medicine, St. Louis, MO, United States.

出版信息

Front Digit Health. 2020 Dec 7;2:576945. doi: 10.3389/fdgth.2020.576945. eCollection 2020.

DOI:10.3389/fdgth.2020.576945

PMID:34713050

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8521851/

Abstract

Although many clinical metrics are associated with proximity to decompensation in heart failure (HF), none are individually accurate enough to risk-stratify HF patients on a patient-by-patient basis. The dire consequences of this inaccuracy in risk stratification have profoundly lowered the clinical threshold for application of high-risk surgical intervention, such as ventricular assist device placement. Machine learning can detect non-intuitive classifier patterns that allow for innovative combination of patient feature predictive capability. A machine learning-based clinical tool to identify proximity to catastrophic HF deterioration on a patient-specific basis would enable more efficient direction of high-risk surgical intervention to those patients who have the most to gain from it, while sparing others. electronic health record (EHR) data are statistically indistinguishable from the original protected health information, and can be analyzed as if they were original data but without any privacy concerns. We demonstrate that EHR data can be easily accessed and analyzed and are amenable to machine learning analyses. We developed data from EHR data of 26,575 HF patients admitted to a single institution during the decade ending on 12/31/2018. Twenty-seven clinically-relevant features were synthesized and utilized in supervised deep learning and machine learning algorithms (i.e., deep neural networks [DNN], random forest [RF], and logistic regression [LR]) to explore their ability to predict 1-year mortality by five-fold cross validation methods. We conducted analyses leveraging features from prior to/at and after/at the time of HF diagnosis. The area under the receiver operating curve (AUC) was used to evaluate the performance of the three models: the mean AUC was 0.80 for DNN, 0.72 for RF, and 0.74 for LR. Age, creatinine, body mass index, and blood pressure levels were especially important features in predicting death within 1-year among HF patients. Machine learning models have considerable potential to improve accuracy in mortality prediction, such that high-risk surgical intervention can be applied only in those patients who stand to benefit from it. Access to EHR-based synthetic data derivatives eliminates risk of exposure of EHR data, speeds time-to-insight, and facilitates data sharing. As more clinical, imaging, and contractile features with proven predictive capability are added to these models, the development of a clinical tool to assist in timing of intervention in surgical candidates may be possible.

摘要

尽管许多临床指标与心力衰竭（HF）患者接近失代偿的情况相关，但没有一个指标能单独准确到足以对每个HF患者进行风险分层。风险分层不准确带来的严重后果已大大降低了高风险手术干预（如植入心室辅助装置）的临床应用门槛。机器学习可以检测出非直观的分类模式，从而实现患者特征预测能力的创新组合。一种基于机器学习的临床工具，能够针对特定患者识别接近灾难性HF恶化的情况，这将使高风险手术干预更有效地针对那些能从中获益最大的患者，同时避免其他患者接受不必要的手术。电子健康记录（EHR）数据在统计学上与原始受保护的健康信息无法区分，可以像分析原始数据一样进行分析，而无需担心隐私问题。我们证明EHR数据可以轻松获取和分析，并且适合进行机器学习分析。我们开发了来自2018年12月31日之前十年间入住单一机构的26575例HF患者的EHR数据。合成了27个临床相关特征，并将其用于监督深度学习和机器学习算法（即深度神经网络[DNN]）、随机森林[RF]和逻辑回归[LR]），通过五折交叉验证方法探索它们预测1年死亡率的能力。我们利用HF诊断之前/之时以及之后/之时的特征进行了分析。采用受试者工作特征曲线（AUC）下的面积来评估这三种模型的性能：DNN的平均AUC为0.80，RF为0.72，LR为0.74。年龄、肌酐、体重指数和血压水平是预测HF患者1年内死亡的特别重要的特征。机器学习模型在提高死亡率预测准确性方面具有相当大的潜力，这样高风险手术干预就可以仅应用于那些有望从中受益的患者。获取基于EHR的合成数据衍生物消除了EHR数据暴露的风险，加快了洞察时间，并促进了数据共享。随着更多具有已证实预测能力的临床、影像和收缩特征被添加到这些模型中，开发一种有助于确定手术候选者干预时机的临床工具可能成为现实。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/efe3/8521851/e84a6153a3c4/fdgth-02-576945-g0001.jpg

相似文献

The Use of Synthetic Electronic Health Record Data and Deep Learning to Improve Timing of High-Risk Heart Failure Surgical Intervention by Predicting Proximity to Catastrophic Decompensation.利用合成电子健康记录数据和深度学习通过预测接近灾难性失代偿来改善高危心力衰竭手术干预的时机

Front Digit Health. 2020 Dec 7;2:576945. doi: 10.3389/fdgth.2020.576945. eCollection 2020.

Electronic Health Record-Based Deep Learning Prediction of Death or Severe Decompensation in Heart Failure Patients.基于电子健康记录的心力衰竭患者死亡或严重失代偿的深度学习预测

JACC Heart Fail. 2022 Sep;10(9):637-647. doi: 10.1016/j.jchf.2022.05.010. Epub 2022 Jul 6.

Predicting mortality among patients with liver cirrhosis in electronic health records with machine learning.利用机器学习预测电子健康记录中肝硬化患者的死亡率。

PLoS One. 2021 Aug 31;16(8):e0256428. doi: 10.1371/journal.pone.0256428. eCollection 2021.

Machine Learning Outcome Prediction in Dilated Cardiomyopathy Using Regional Left Ventricular Multiparametric Strain.机器学习预测扩张型心肌病的区域性左心室多参数应变。

Ann Biomed Eng. 2021 Feb;49(2):922-932. doi: 10.1007/s10439-020-02639-1. Epub 2020 Oct 1.

Predicting post-stroke pneumonia using deep neural network approaches.使用深度神经网络方法预测卒中后肺炎。

Int J Med Inform. 2019 Dec;132:103986. doi: 10.1016/j.ijmedinf.2019.103986. Epub 2019 Oct 1.

Accurate Prediction of Coronary Heart Disease for Patients With Hypertension From Electronic Health Records With Big Data and Machine-Learning Methods: Model Development and Performance Evaluation.利用大数据和机器学习方法从电子健康记录中准确预测高血压患者的冠心病：模型开发与性能评估

JMIR Med Inform. 2020 Jul 6;8(7):e17257. doi: 10.2196/17257.

Comparison of Machine Learning Methods With Traditional Models for Use of Administrative Claims With Electronic Medical Records to Predict Heart Failure Outcomes.利用电子病历中的行政索赔数据进行机器学习方法与传统模型预测心力衰竭结局的比较。

JAMA Netw Open. 2020 Jan 3;3(1):e1918962. doi: 10.1001/jamanetworkopen.2019.18962.

Examining arterial pulsation to identify and risk-stratify heart failure subjects with deep neural network.利用动脉搏动识别和风险分层心力衰竭患者的深度神经网络。

Phys Eng Sci Med. 2024 Jun;47(2):477-489. doi: 10.1007/s13246-023-01378-6. Epub 2024 Feb 15.

Machine Learning Models for Predicting Influential Factors of Early Outcomes in Acute Ischemic Stroke: Registry-Based Study.用于预测急性缺血性卒中早期预后影响因素的机器学习模型：基于登记处的研究

JMIR Med Inform. 2022 Mar 25;10(3):e32508. doi: 10.2196/32508.

Developing and comparing deep learning and machine learning algorithms for osteoporosis risk prediction.开发并比较用于骨质疏松症风险预测的深度学习和机器学习算法。

Front Artif Intell. 2024 Jun 11;7:1355287. doi: 10.3389/frai.2024.1355287. eCollection 2024.

引用本文的文献

Can Synthetic Data Allow for Smaller Sample Sizes in Chronic Urticaria Research?合成数据能否在慢性荨麻疹研究中减少样本量？

Clin Transl Allergy. 2025 Aug;15(8):e70087. doi: 10.1002/clt2.70087.

Decades in the Making: The Evolution of Digital Health Research Infrastructure Through Synthetic Data, Common Data Models, and Federated Learning.数十年磨一剑：通过合成数据、通用数据模型和联邦学习实现数字健康研究基础设施的演进

J Med Internet Res. 2024 Dec 20;26:e58637. doi: 10.2196/58637.

Synthetic data can aid the analysis of clinical outcomes: How much can it be trusted?合成数据有助于临床结果分析：其可信度有多高？

Proc Natl Acad Sci U S A. 2024 Aug 6;121(32):e2414310121. doi: 10.1073/pnas.2414310121. Epub 2024 Jul 31.

An evaluation of the replicability of analyses using synthetic health data.利用合成健康数据评估分析结果的可重复性。

Sci Rep. 2024 Mar 24;14(1):6978. doi: 10.1038/s41598-024-57207-7.

Evaluating the Utility and Privacy of Synthetic Breast Cancer Clinical Trial Data Sets.评估合成乳腺癌临床试验数据集的效用和隐私性。

JCO Clin Cancer Inform. 2023 Sep;7:e2300116. doi: 10.1200/CCI.23.00116.

Digitalization of prevention and treatment and the combination of western and Chinese medicine in management of acute heart failure.急性心力衰竭防治数字化与中西医结合管理

Front Cardiovasc Med. 2023 May 25;10:1146941. doi: 10.3389/fcvm.2023.1146941. eCollection 2023.

A Multifaceted benchmarking of synthetic electronic health record generation models.综合电子健康记录生成模型的多方面基准测试。

Nat Commun. 2022 Dec 9;13(1):7609. doi: 10.1038/s41467-022-35295-1.

Deep Convolutional Generative Adversarial Networks to Enhance Artificial Intelligence in Healthcare: A Skin Cancer Application.深度卷积生成对抗网络在医疗保健中的人工智能增强：以皮肤癌为例。

Sensors (Basel). 2022 Aug 17;22(16):6145. doi: 10.3390/s22166145.

Leveraging Artificial Intelligence and Synthetic Data Derivatives for Spine Surgery Research.利用人工智能和合成数据衍生物进行脊柱外科研究。

Global Spine J. 2023 Oct;13(8):2409-2421. doi: 10.1177/21925682221085535. Epub 2022 Apr 3.

The National COVID Cohort Collaborative: Analyses of Original and Computationally Derived Electronic Health Record Data.国家 COVID 队列协作组：原始和计算衍生电子健康记录数据的分析。

J Med Internet Res. 2021 Oct 4;23(10):e30697. doi: 10.2196/30697.

本文引用的文献

Spot the difference: comparing results of analyses from real patient data and synthetic derivatives.找出差异：比较来自真实患者数据和合成衍生物的分析结果。

JAMIA Open. 2020 Dec 14;3(4):557-566. doi: 10.1093/jamiaopen/ooaa060. eCollection 2020 Dec.

Improving risk prediction in heart failure using machine learning.利用机器学习改善心力衰竭的风险预测。

Eur J Heart Fail. 2020 Jan;22(1):139-147. doi: 10.1002/ejhf.1628. Epub 2019 Nov 12.

Machine Learning Prediction of Mortality and Hospitalization in Heart Failure With Preserved Ejection Fraction.机器学习预测射血分数保留的心力衰竭患者的死亡率和住院率。

JACC Heart Fail. 2020 Jan;8(1):12-21. doi: 10.1016/j.jchf.2019.06.013. Epub 2019 Oct 9.

Artificial intelligence algorithm for predicting mortality of patients with acute heart failure.人工智能算法预测急性心力衰竭患者的死亡率。

PLoS One. 2019 Jul 8;14(7):e0219302. doi: 10.1371/journal.pone.0219302. eCollection 2019.

Machine learning-based prediction of heart failure readmission or death: implications of choosing the right model and the right metrics.基于机器学习的心力衰竭再入院或死亡预测：选择正确模型和指标的意义。

ESC Heart Fail. 2019 Apr;6(2):428-435. doi: 10.1002/ehf2.12419. Epub 2019 Feb 27.

Deep learning cardiac motion analysis for human survival prediction.用于人类生存预测的深度学习心脏运动分析

Nat Mach Intell. 2019 Feb 11;1:95-104. doi: 10.1038/s42256-019-0019-2.

Are Synthetic Data Derivatives the Future of Translational Medicine?合成数据衍生物会是转化医学的未来吗？

JACC Basic Transl Sci. 2018 Nov 12;3(5):716-718. doi: 10.1016/j.jacbts.2018.08.007. eCollection 2018 Oct.

Epidemiology, pathophysiology and clinical outcomes for heart failure patients with a mid-range ejection fraction.射血分数中间值的心衰患者的流行病学、病理生理学和临床转归。

Eur J Heart Fail. 2017 Dec;19(12):1597-1605. doi: 10.1002/ejhf.879. Epub 2017 Jun 14.

Heart Failure: Diagnosis, Management and Utilization.心力衰竭：诊断、管理与利用。

J Clin Med. 2016 Jun 29;5(7):62. doi: 10.3390/jcm5070062.

Common long-term complications of adult congenital heart disease: avoid falling in a H.E.A.P.成人先天性心脏病的常见长期并发症：避免陷入高危状态（H.E.A.P. 可能为特定首字母缩写词，此处按字面翻译为高危状态）

Expert Rev Cardiovasc Ther. 2016;14(4):445-62. doi: 10.1586/14779072.2016.1133294. Epub 2016 Jan 25.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

利用合成电子健康记录数据和深度学习通过预测接近灾难性失代偿来改善高危心力衰竭手术干预的时机

The Use of Synthetic Electronic Health Record Data and Deep Learning to Improve Timing of High-Risk Heart Failure Surgical Intervention by Predicting Proximity to Catastrophic Decompensation.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献