基于机器学习的 COVID-19 患者出院预测模型：利用电子健康记录进行开发和评估。

Machine learning-based prediction models for home discharge in patients with COVID-19: Development and evaluation using electronic health records.

机构信息

Department of Health Outcomes and Biomedical Informatics, University of Florida College of Medicine, Gainesville, FL, United States of America.

Department of Pharmaceutical Outcomes and Policy, University of Florida College of Pharmacy, Gainesville, FL, United States of America.

出版信息

PLoS One. 2023 Oct 20;18(10):e0292888. doi: 10.1371/journal.pone.0292888. eCollection 2023.

DOI:10.1371/journal.pone.0292888

PMID:37862334

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10588875/

Abstract

OBJECTIVE

This study aimed to develop and validate predictive models using electronic health records (EHR) data to determine whether hospitalized COVID-19-positive patients would be admitted to alternative medical care or discharged home.

METHODS

We conducted a retrospective cohort study using deidentified data from the University of Florida Health Integrated Data Repository. The study included 1,578 adult patients (≥18 years) who tested positive for COVID-19 while hospitalized, comprising 960 (60.8%) female patients with a mean (SD) age of 51.86 (18.49) years and 618 (39.2%) male patients with a mean (SD) age of 54.35 (18.48) years. Machine learning (ML) model training involved cross-validation to assess their performance in predicting patient disposition.

RESULTS

We developed and validated six supervised ML-based prediction models (logistic regression, Gaussian Naïve Bayes, k-nearest neighbors, decision trees, random forest, and support vector machine classifier) to predict patient discharge status. The models were evaluated based on the area under the receiver operating characteristic curve (ROC-AUC), precision, accuracy, F1 score, and Brier score. The random forest classifier exhibited the highest performance, achieving an accuracy of 0.84 and an AUC of 0.72. Logistic regression (accuracy: 0.85, AUC: 0.71), k-nearest neighbor (accuracy: 0.84, AUC: 0.63), decision tree (accuracy: 0.84, AUC: 0.61), Gaussian Naïve Bayes (accuracy: 0.84, AUC: 0.66), and support vector machine classifier (accuracy: 0.84, AUC: 0.67) also demonstrated valuable predictive capabilities.

SIGNIFICANCE

This study's findings are crucial for efficiently allocating healthcare resources during pandemics like COVID-19. By harnessing ML techniques and EHR data, we can create predictive tools to identify patients at greater risk of severe symptoms based on their medical histories. The models developed here serve as a foundation for expanding the toolkit available to healthcare professionals and organizations. Additionally, explainable ML methods, such as Shapley Additive Explanations, aid in uncovering underlying data features that inform healthcare decision-making processes.

摘要

目的

本研究旨在利用电子健康记录（EHR）数据开发和验证预测模型，以确定住院的 COVID-19 阳性患者是否会转至其他医疗护理或出院回家。

方法

我们进行了一项回顾性队列研究，使用了佛罗里达大学健康综合数据存储库的匿名数据。该研究包括 1578 名成年 COVID-19 住院阳性患者，其中 960 名（60.8%）为女性，平均（SD）年龄为 51.86（18.49）岁，618 名（39.2%）为男性，平均（SD）年龄为 54.35（18.48）岁。机器学习（ML）模型训练涉及交叉验证，以评估其在预测患者处置方面的性能。

结果

我们开发并验证了六个基于监督学习的预测模型（逻辑回归、高斯朴素贝叶斯、k-最近邻、决策树、随机森林和支持向量机分类器），以预测患者出院状态。基于受试者工作特征曲线下的面积（ROC-AUC）、精度、准确性、F1 评分和 Brier 评分对模型进行评估。随机森林分类器表现出最高的性能，准确率为 0.84，AUC 为 0.72。逻辑回归（准确率：0.85，AUC：0.71）、k-最近邻（准确率：0.84，AUC：0.63）、决策树（准确率：0.84，AUC：0.61）、高斯朴素贝叶斯（准确率：0.84，AUC：0.66）和支持向量机分类器（准确率：0.84，AUC：0.67）也表现出了有价值的预测能力。

意义

本研究的结果对于在 COVID-19 等大流行期间有效分配医疗资源至关重要。通过利用机器学习技术和 EHR 数据，我们可以创建预测工具，根据患者的病史识别出症状更严重的患者。这里开发的模型为扩大医疗保健专业人员和组织可用的工具包提供了基础。此外，可解释的机器学习方法（如 Shapley Additive Explanations）有助于揭示用于指导医疗保健决策过程的数据特征。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a683/10588875/7a3e342030bb/pone.0292888.g001.jpg

相似文献

Machine learning-based prediction models for home discharge in patients with COVID-19: Development and evaluation using electronic health records.

PLoS One. 2023 Oct 20;18(10):e0292888. doi: 10.1371/journal.pone.0292888. eCollection 2023.

Machine learning algorithms for predicting COVID-19 mortality in Ethiopia.

BMC Public Health. 2024 Jun 28;24(1):1728. doi: 10.1186/s12889-024-19196-0.

A machine learning-based prediction model for postoperative delirium in cardiac valve surgery using electronic health records.

BMC Cardiovasc Disord. 2024 Jan 18;24(1):56. doi: 10.1186/s12872-024-03723-3.

A Machine Learning Approach for Mortality Prediction in COVID-19 Pneumonia: Development and Evaluation of the Piacenza Score.

J Med Internet Res. 2021 May 31;23(5):e29058. doi: 10.2196/29058.

Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?

Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.

A machine learning approach in a monocentric cohort for predicting primary refractory disease in Diffuse Large B-cell lymphoma patients.

PLoS One. 2024 Oct 1;19(10):e0311261. doi: 10.1371/journal.pone.0311261. eCollection 2024.

A Risk Prediction Model for Physical Restraints Among Older Chinese Adults in Long-term Care Facilities: Machine Learning Study.

J Med Internet Res. 2023 Apr 6;25:e43815. doi: 10.2196/43815.

Comparing machine learning algorithms to predict COVID‑19 mortality using a dataset including chest computed tomography severity score data.

Sci Rep. 2023 Jul 13;13(1):11343. doi: 10.1038/s41598-023-38133-6.

Joint modeling strategy for using electronic medical records data to build machine learning models: an example of intracerebral hemorrhage.

BMC Med Inform Decis Mak. 2022 Oct 25;22(1):278. doi: 10.1186/s12911-022-02018-x.

A new hybrid ensemble machine-learning model for severity risk assessment and post-COVID prediction system.

Math Biosci Eng. 2022 Apr 13;19(6):6102-6123. doi: 10.3934/mbe.2022285.

引用本文的文献

Synergistic patient factors are driving recent increased pediatric urgent care demand.

PLOS Digit Health. 2024 Aug 22;3(8):e0000572. doi: 10.1371/journal.pdig.0000572. eCollection 2024 Aug.

Use of machine learning to identify protective factors for death from COVID-19 in the ICU: a retrospective study.

PeerJ. 2024 Jun 12;12:e17428. doi: 10.7717/peerj.17428. eCollection 2024.

Prediction models for COVID-19 disease outcomes.

Emerg Microbes Infect. 2024 Dec;13(1):2361791. doi: 10.1080/22221751.2024.2361791. Epub 2024 Jun 14.

本文引用的文献

Correlation of the SpO/FiO (S/F) ratio and the PaO/FiO (P/F) ratio in patients with COVID-19 pneumonia.

Med Intensiva. 2022 Jul;46(7):408-410. doi: 10.1016/j.medin.2021.10.005. Epub 2021 Nov 18.

Association between Hypomagnesemia, COVID-19, Respiratory Tract and Lung Disease.

Open Respir Med J. 2021 Sep 17;15:43-45. doi: 10.2174/1874306402115010043. eCollection 2021.

Standardizing PaO2 for PaCO2 in P/F ratio predicts in-hospital mortality in acute respiratory failure due to Covid-19: A pilot prospective study.

Eur J Intern Med. 2021 Oct;92:48-54. doi: 10.1016/j.ejim.2021.06.002. Epub 2021 Jun 17.

Coronary heart disease and COVID-19: A meta-analysis.

Med Clin (Barc). 2021 Jun 11;156(11):547-554. doi: 10.1016/j.medcli.2020.12.017. Epub 2021 Jan 28.

Effect of Machine Learning on Dispatcher Recognition of Out-of-Hospital Cardiac Arrest During Calls to Emergency Medical Services: A Randomized Clinical Trial.

JAMA Netw Open. 2021 Jan 4;4(1):e2032320. doi: 10.1001/jamanetworkopen.2020.32320.

Role of Machine Learning Techniques to Tackle the COVID-19 Crisis: Systematic Review.

JMIR Med Inform. 2021 Jan 11;9(1):e23811. doi: 10.2196/23811.

Using machine-learning risk prediction models to triage the acuity of undifferentiated patients entering the emergency care system: a systematic review.

Diagn Progn Res. 2020 Oct 2;4:16. doi: 10.1186/s41512-020-00084-1. eCollection 2020.

The PANDEMYC Score. An Easily Applicable and Interpretable Model for Predicting Mortality Associated With COVID-19.

J Clin Med. 2020 Sep 23;9(10):3066. doi: 10.3390/jcm9103066.

Adult congenital heart disease and the COVID-19 pandemic.

Heart. 2020 Sep;106(17):1302-1309. doi: 10.1136/heartjnl-2020-317258. Epub 2020 Jun 10.

A predictive model for disease progression in non-severely ill patients with coronavirus disease 2019.

Eur Respir J. 2020 Jul 16;56(1). doi: 10.1183/13993003.01234-2020. Print 2020 Jul.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于机器学习的 COVID-19 患者出院预测模型：利用电子健康记录进行开发和评估。

Machine learning-based prediction models for home discharge in patients with COVID-19: Development and evaluation using electronic health records.

机构信息

Department of Health Outcomes and Biomedical Informatics, University of Florida College of Medicine, Gainesville, FL, United States of America.

Department of Pharmaceutical Outcomes and Policy, University of Florida College of Pharmacy, Gainesville, FL, United States of America.