基于全球患者数据对新冠肺炎死亡率进行可推广预测。

Generalizable prediction of COVID-19 mortality on worldwide patient data.

作者信息

Edelson Maxim, Kuo Tsung-Ting

机构信息

UCSD Department of Computer Science and Engineering, University of California San Diego, La Jolla, California, USA.

UCSD Health Department of Biomedical Informatics, University of California San Diego, La Jolla, California, USA.

出版信息

JAMIA Open. 2022 May 25;5(2):ooac036. doi: 10.1093/jamiaopen/ooac036. eCollection 2022 Jul.

DOI:10.1093/jamiaopen/ooac036

PMID:35663116

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9129227/

Abstract

OBJECTIVE

Predicting Coronavirus disease 2019 (COVID-19) mortality for patients is critical for early-stage care and intervention. Existing studies mainly built models on datasets with limited geographical range or size. In this study, we developed COVID-19 mortality prediction models on worldwide, large-scale "sparse" data and on a "dense" subset of the data.

MATERIALS AND METHODS

We evaluated 6 classifiers, including logistic regression (LR), support vector machine (SVM), random forest (RF), multilayer perceptron (MLP), AdaBoost (AB), and Naive Bayes (NB). We also conducted temporal analysis and calibrated our models using Isotonic Regression.

RESULTS

The results showed that AB outperformed the other classifiers for the sparse dataset, while LR provided the highest-performing results for the dense dataset (with area under the receiver operating characteristic curve, or AUC ≈ 0.7 for the sparse dataset and AUC = 0.963 for the dense one). We also identified impactful features such as symptoms, countries, age, and the date of death/discharge. All our models are well-calibrated ( > .1).

DISCUSSION

Our results highlight the tradeoff of using sparse training data to increase generalizability versus training on denser data, which produces higher discrimination results. We found that covariates such as patient information on symptoms, countries (where the case was reported), age, and the date of discharge from the hospital or death were the most important for mortality prediction.

CONCLUSION

This study is a stepping-stone towards improving healthcare quality during the COVID-19 era and potentially other pandemics. Our code is publicly available at: https://doi.org/10.5281/zenodo.6336231.

摘要

目的

预测2019冠状病毒病（COVID-19）患者的死亡率对于早期护理和干预至关重要。现有研究主要基于地理范围或规模有限的数据集构建模型。在本研究中，我们基于全球范围内的大规模“稀疏”数据以及该数据的“密集”子集开发了COVID-19死亡率预测模型。

材料与方法

我们评估了6种分类器，包括逻辑回归（LR）、支持向量机（SVM）、随机森林（RF）、多层感知器（MLP）、AdaBoost（AB）和朴素贝叶斯（NB）。我们还进行了时间分析，并使用保序回归对模型进行校准。

结果

结果表明，对于稀疏数据集，AB的表现优于其他分类器，而LR在密集数据集上提供了最高的性能结果（稀疏数据集的受试者工作特征曲线下面积，即AUC≈0.7，密集数据集的AUC = 0.963）。我们还确定了有影响的特征，如症状、国家、年龄以及死亡/出院日期。我们所有的模型校准良好（>.1）。

讨论

我们的结果突出了使用稀疏训练数据以提高泛化能力与在更密集的数据上进行训练之间的权衡，后者会产生更高的判别结果。我们发现，诸如患者症状信息、国家（病例报告地）、年龄以及出院或死亡日期等协变量对于死亡率预测最为重要。

结论

本研究是在COVID-19时代及可能的其他大流行期间提高医疗质量的一块垫脚石。我们的代码可在以下网址公开获取：https://doi.org/10.5281/zenodo.6336231。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd0f/9154018/3c6be47a4a57/ooac036f1.jpg

相似文献

Generalizable prediction of COVID-19 mortality on worldwide patient data.

JAMIA Open. 2022 May 25;5(2):ooac036. doi: 10.1093/jamiaopen/ooac036. eCollection 2022 Jul.

Artificial intelligence in clinical care amidst COVID-19 pandemic: A systematic review.

Comput Struct Biotechnol J. 2021;19:2833-2850. doi: 10.1016/j.csbj.2021.05.010. Epub 2021 May 7.

Machine learning algorithms for predicting COVID-19 mortality in Ethiopia.

BMC Public Health. 2024 Jun 28;24(1):1728. doi: 10.1186/s12889-024-19196-0.

Comparing machine learning algorithms to predict COVID‑19 mortality using a dataset including chest computed tomography severity score data.

Sci Rep. 2023 Jul 13;13(1):11343. doi: 10.1038/s41598-023-38133-6.

Seminal quality prediction using data mining methods.

Technol Health Care. 2014;22(4):531-45. doi: 10.3233/THC-140816.

Machine learning models for predicting critical illness risk in hospitalized patients with COVID-19 pneumonia.

J Thorac Dis. 2021 Feb;13(2):1215-1229. doi: 10.21037/jtd-20-2580.

A Hybrid Feature Selection Approach to Screen a Novel Set of Blood Biomarkers for Early COVID-19 Mortality Prediction.

Diagnostics (Basel). 2022 Jun 30;12(7):1604. doi: 10.3390/diagnostics12071604.

Machine learning algorithms for outcome prediction in (chemo)radiotherapy: An empirical comparison of classifiers.

Med Phys. 2018 Jul;45(7):3449-3459. doi: 10.1002/mp.12967. Epub 2018 Jun 13.

Combining handcrafted features with latent variables in machine learning for prediction of radiation-induced lung damage.

Med Phys. 2019 May;46(5):2497-2511. doi: 10.1002/mp.13497. Epub 2019 Apr 8.

Application of machine learning model in predicting the likelihood of blood transfusion after hip fracture surgery.

Aging Clin Exp Res. 2023 Nov;35(11):2643-2656. doi: 10.1007/s40520-023-02550-4. Epub 2023 Sep 21.

引用本文的文献

False-positive tolerant model misconduct mitigation in distributed federated learning on electronic health record data across clinical institutions.

Sci Rep. 2025 Jul 2;15(1):23310. doi: 10.1038/s41598-025-04069-2.

Predicting positive test results using large-scale longitudinal data of demographics and medication history.

Heliyon. 2024 Dec 18;11(1):e41350. doi: 10.1016/j.heliyon.2024.e41350. eCollection 2025 Jan 15.

Early prediction of long hospital stay for Intensive Care units readmission patients using medication information.

Comput Biol Med. 2024 May;174:108451. doi: 10.1016/j.compbiomed.2024.108451. Epub 2024 Apr 8.

The relationship between depressive symptoms and sleep quality in medical staff after their infection with COVID-19.

Front Psychiatry. 2023 Nov 30;14:1269402. doi: 10.3389/fpsyt.2023.1269402. eCollection 2023.

Blockchain-enabled immutable, distributed, and highly available clinical research activity logging system for federated COVID-19 data analysis from multiple institutions.

J Am Med Inform Assoc. 2023 May 19;30(6):1167-1178. doi: 10.1093/jamia/ocad049.

Predicting COVID-19 county-level case number trend by combining demographic characteristics and social distancing policies.

JAMIA Open. 2022 Jun 25;5(3):ooac056. doi: 10.1093/jamiaopen/ooac056. eCollection 2022 Oct.

本文引用的文献

The AIFELL Score as a Predictor of Coronavirus Disease 2019 (COVID-19) Severity and Progression in Hospitalized Patients.

Diagnostics (Basel). 2022 Feb 27;12(3):604. doi: 10.3390/diagnostics12030604.

Patient outcomes following transfer between intensive care units during the COVID-19 pandemic.

Anaesthesia. 2022 Apr;77(4):398-404. doi: 10.1111/anae.15680. Epub 2022 Feb 28.

Symptom Prediction and Mortality Risk Calculation for COVID-19 Using Machine Learning.

Front Artif Intell. 2021 Jun 22;4:673527. doi: 10.3389/frai.2021.673527. eCollection 2021.

Converting tabular data into images for deep learning with convolutional neural networks.

Sci Rep. 2021 May 31;11(1):11325. doi: 10.1038/s41598-021-90923-y.

COVID-19 in Adults With Congenital Heart Disease.

J Am Coll Cardiol. 2021 Apr 6;77(13):1644-1655. doi: 10.1016/j.jacc.2021.02.023.

Predicting mortality risk in patients with COVID-19 using machine learning to help medical decision-making.

Smart Health (Amst). 2021 Apr;20:100178. doi: 10.1016/j.smhl.2020.100178. Epub 2021 Jan 16.

Risk Factors Associated with Mortality Among Patients with Novel Coronavirus Disease (COVID-19) in Africa.

J Racial Ethn Health Disparities. 2021 Oct;8(5):1267-1272. doi: 10.1007/s40615-020-00888-3. Epub 2020 Oct 13.

Common cardiovascular risk factors and in-hospital mortality in 3,894 patients with COVID-19: survival analysis and machine learning-based findings from the multicentre Italian CORIST Study.

Nutr Metab Cardiovasc Dis. 2020 Oct 30;30(11):1899-1913. doi: 10.1016/j.numecd.2020.07.031. Epub 2020 Jul 31.

Risk stratification of patients admitted to hospital with covid-19 using the ISARIC WHO Clinical Characterisation Protocol: development and validation of the 4C Mortality Score.

BMJ. 2020 Sep 9;370:m3339. doi: 10.1136/bmj.m3339.

Risk factors for mortality among COVID-19 patients.

Diabetes Res Clin Pract. 2020 Aug;166:108293. doi: 10.1016/j.diabres.2020.108293. Epub 2020 Jul 3.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于全球患者数据对新冠肺炎死亡率进行可推广预测。

Generalizable prediction of COVID-19 mortality on worldwide patient data.

作者信息

机构信息

出版信息

OBJECTIVE

MATERIALS AND METHODS

RESULTS

DISCUSSION

CONCLUSION

目的

材料与方法

结果

讨论

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献