基于决策树的机器学习模型在伊朗哈马丹 COVID-19 患者死亡率影响因素分类中的应用。

Application of machine learning models based on decision trees in classifying the factors affecting mortality of COVID-19 patients in Hamadan, Iran.

机构信息

Department of Biostatistics, School of Public Health, Hamadan University of Medical Sciences, Hamadan, Iran.

Modeling of Noncommunicable Diseases Research Center, School of Public Health, Hamadan University of Medical Sciences, Street of Shahid Fahmideh, P.O. BOX: 6517838736, Hamadan, Iran.

出版信息

BMC Med Inform Decis Mak. 2022 Jul 24;22(1):192. doi: 10.1186/s12911-022-01939-x.

DOI:10.1186/s12911-022-01939-x

PMID:35871639

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9308952/

Abstract

BACKGROUND

Due to the high mortality of COVID-19 patients, the use of a high-precision classification model of patient's mortality that is also interpretable, could help reduce mortality and take appropriate action urgently. In this study, the random forest method was used to select the effective features in COVID-19 mortality and the classification was performed using logistic model tree (LMT), classification and regression tree (CART), C4.5, and C5.0 tree based on important features.

METHODS

In this retrospective study, the data of 2470 COVID-19 patients admitted to hospitals in Hamadan, west Iran, were used, of which 75.02% recovered and 24.98% died. To classify, at first among the 25 demographic, clinical, and laboratory findings, features with a relative importance more than 6% were selected by random forest. Then LMT, C4.5, C5.0, and CART trees were developed and the accuracy of classification performance was evaluated with recall, accuracy, and F1-score criteria for training, test, and total datasets. At last, the best tree was developed and the receiver operating characteristic curve and area under the curve (AUC) value were reported.

RESULTS

The results of this study showed that among demographic and clinical features gender and age, and among laboratory findings blood urea nitrogen, partial thromboplastin time, serum glutamic-oxaloacetic transaminase, and erythrocyte sedimentation rate had more than 6% relative importance. Developing the trees using the above features revealed that the CART with the values of F1-score, Accuracy, and Recall, 0.8681, 0.7824, and 0.955, respectively, for the test dataset and 0.8667, 0.7834, and 0.9385, respectively, for the total dataset had the best performance. The AUC value obtained for the CART was 79.5%.

CONCLUSIONS

Finding a highly accurate and qualified model for interpreting the classification of a response that is considered clinically consequential is critical at all stages, including treatment and immediate decision making. In this study, the CART with its high accuracy for diagnosing and classifying mortality of COVID-19 patients as well as prioritizing important demographic, clinical, and laboratory findings in an interpretable format, risk factors for prognosis of COVID-19 patients mortality identify and enable immediate and appropriate decisions for health professionals and physicians.

摘要

背景

由于 COVID-19 患者的死亡率很高，因此使用高精度的患者死亡率分类模型，且该模型还具有可解释性，这有助于降低死亡率并紧急采取适当措施。在这项研究中，使用随机森林方法选择 COVID-19 死亡率的有效特征，并使用逻辑模型树（LMT）、分类回归树（CART）、C4.5 和 C5.0 树基于重要特征进行分类。

方法

在这项回顾性研究中，使用了来自伊朗西部哈马丹医院的 2470 名 COVID-19 患者的数据，其中 75.02%的患者康复，24.98%的患者死亡。为了进行分类，首先在 25 项人口统计学、临床和实验室发现中，随机森林选择了相对重要性超过 6%的特征。然后，开发了 LMT、C4.5、C5.0 和 CART 树，并使用召回率、准确性和 F1 评分标准评估了训练、测试和总数据集的分类性能。最后，开发了最佳树，并报告了接收者操作特征曲线和曲线下面积（AUC）值。

结果

这项研究的结果表明，在人口统计学和临床特征中，性别和年龄，以及在实验室发现中，血尿素氮、部分凝血活酶时间、血清谷氨酸-草酰乙酸转氨酶和红细胞沉降率具有超过 6%的相对重要性。使用上述特征开发的树表明，CART 的 F1 评分、准确性和召回率在测试数据集分别为 0.8681、0.7824 和 0.955，在总数据集分别为 0.8667、0.7834 和 0.9385，表现最佳。CART 的 AUC 值为 79.5%。

结论

在包括治疗和立即决策在内的所有阶段，找到一个高度准确和合格的模型来解释被认为具有临床意义的反应分类是至关重要的。在这项研究中，CART 能够以可解释的格式准确诊断和分类 COVID-19 患者的死亡率，并优先考虑重要的人口统计学、临床和实验室发现，确定 COVID-19 患者死亡率的预后风险因素，并为卫生专业人员和医生提供即时和适当的决策。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b366/9310461/33bd5a43511b/12911_2022_1939_Fig1_HTML.jpg

相似文献

Application of machine learning models based on decision trees in classifying the factors affecting mortality of COVID-19 patients in Hamadan, Iran.

BMC Med Inform Decis Mak. 2022 Jul 24;22(1):192. doi: 10.1186/s12911-022-01939-x.

Interpretable generalized neural additive models for mortality prediction of COVID-19 hospitalized patients in Hamadan, Iran.

BMC Med Res Methodol. 2022 Dec 31;22(1):339. doi: 10.1186/s12874-022-01827-y.

[Constructing a predictive model for the death risk of patients with septic shock based on supervised machine learning algorithms].

Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2024 Apr;36(4):345-352. doi: 10.3760/cma.j.cn121430-20230930-00832.

[Construction of a predictive model for in-hospital mortality of sepsis patients in intensive care unit based on machine learning].

Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2023 Jul;35(7):696-701. doi: 10.3760/cma.j.cn121430-20221219-01104.

Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?

Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.

A new hybrid ensemble machine-learning model for severity risk assessment and post-COVID prediction system.

Math Biosci Eng. 2022 Apr 13;19(6):6102-6123. doi: 10.3934/mbe.2022285.

Exploratory Data Mining Techniques (Decision Tree Models) for Examining the Impact of Internet-Based Cognitive Behavioral Therapy for Tinnitus: Machine Learning Approach.

J Med Internet Res. 2021 Nov 2;23(11):e28999. doi: 10.2196/28999.

GIS-based groundwater potential mapping using boosted regression tree, classification and regression tree, and random forest machine learning models in Iran.

Environ Monit Assess. 2016 Jan;188(1):44. doi: 10.1007/s10661-015-5049-6. Epub 2015 Dec 19.

Machine learning algorithms for predicting COVID-19 mortality in Ethiopia.

BMC Public Health. 2024 Jun 28;24(1):1728. doi: 10.1186/s12889-024-19196-0.

引用本文的文献

Predicting Risk for Patent Ductus Arteriosus in the Neonate: A Machine Learning Analysis.

Medicina (Kaunas). 2025 Mar 26;61(4):603. doi: 10.3390/medicina61040603.

Prediction of primary Hypertension in Primary Health Care Settings in Coastal Karnataka Using Artificial Neural Network.

Curr Hypertens Rev. 2025;21(2):82-93. doi: 10.2174/0115734021329874250222053144.

In vivo electrophysiology recordings and computational modeling can predict octopus arm movement.

Bioelectron Med. 2025 Feb 14;11(1):4. doi: 10.1186/s42234-025-00166-9.

A Systematic Review of the Outcomes of Utilization of Artificial Intelligence Within the Healthcare Systems of the Middle East: A Thematic Analysis of Findings.

Health Sci Rep. 2024 Dec 24;7(12):e70300. doi: 10.1002/hsr2.70300. eCollection 2024 Dec.

Single unit electrophysiology recordings and computational modeling can predict octopus arm movement.

bioRxiv. 2024 Sep 19:2024.09.13.612676. doi: 10.1101/2024.09.13.612676.

Machine learning-based evaluation of prognostic factors for mortality and relapse in patients with acute lymphoblastic leukemia: a comparative simulation study.

BMC Med Inform Decis Mak. 2024 Sep 16;24(1):261. doi: 10.1186/s12911-024-02645-6.

Breaking barriers: a statistical and machine learning-based hybrid system for predicting dementia.

Front Bioeng Biotechnol. 2024 Jan 8;11:1336255. doi: 10.3389/fbioe.2023.1336255. eCollection 2023.

Environmental and geographical factors influence the occurrence and abundance of the southern house mosquito, Culex quinquefasciatus, in Hawai'i.

Sci Rep. 2024 Jan 5;14(1):604. doi: 10.1038/s41598-023-49793-9.

Machine Learning and COVID-19: Lessons from SARS-CoV-2.

Adv Exp Med Biol. 2023;1412:311-335. doi: 10.1007/978-3-031-28012-2_17.

Complete Breast Cancer Detection and Monitoring System by Using Microwave Textile Based Antenna Sensors.

Biosensors (Basel). 2023 Jan 4;13(1):87. doi: 10.3390/bios13010087.

本文引用的文献

Predicting the COVID-19 Patients Status Using Chest CT Scan Findings: A Risk Assessment Model Based on Decision Tree Analysis.

Adv Exp Med Biol. 2023;1412:237-250. doi: 10.1007/978-3-031-28012-2_13.

Severity and mortality prediction models to triage Indian COVID-19 patients.

PLOS Digit Health. 2022 Mar 9;1(3):e0000020. doi: 10.1371/journal.pdig.0000020. eCollection 2022 Mar.

COVID-19 pneumonia level detection using deep learning algorithm and transfer learning.

Evol Intell. 2022 Sep 10:1-12. doi: 10.1007/s12065-022-00777-0.

Case fatality and mortality rates, socio-demographic profile, and clinical features of COVID-19 in the elderly population: A population-based registry study in Iran.

J Med Virol. 2022 May;94(5):2126-2132. doi: 10.1002/jmv.27594. Epub 2022 Jan 28.

Comparing machine learning algorithms for predicting COVID-19 mortality.

BMC Med Inform Decis Mak. 2022 Jan 4;22(1):2. doi: 10.1186/s12911-021-01742-0.

Prediction of global spread of COVID-19 pandemic: a review and research challenges.

Artif Intell Rev. 2022;55(3):1607-1628. doi: 10.1007/s10462-021-09988-w. Epub 2021 Jul 16.

Sociodemographic determinants and clinical risk factors associated with COVID-19 severity: a cross-sectional analysis of over 200,000 patients in Tehran, Iran.

BMC Infect Dis. 2021 May 25;21(1):474. doi: 10.1186/s12879-021-06179-4.

The Role of Immunological and Clinical Biomarkers to Predict Clinical COVID-19 Severity and Response to Therapy-A Prospective Longitudinal Study.

Front Immunol. 2021 Mar 17;12:646095. doi: 10.3389/fimmu.2021.646095. eCollection 2021.

Explaining machine learning based diagnosis of COVID-19 from routine blood tests with decision trees and criteria graphs.

Comput Biol Med. 2021 May;132:104335. doi: 10.1016/j.compbiomed.2021.104335. Epub 2021 Mar 16.

Role of hematological parameters in the stratification of COVID-19 disease severity.

Ann Med Surg (Lond). 2021 Feb;62:68-72. doi: 10.1016/j.amsu.2020.12.035. Epub 2021 Jan 8.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于决策树的机器学习模型在伊朗哈马丹 COVID-19 患者死亡率影响因素分类中的应用。

Application of machine learning models based on decision trees in classifying the factors affecting mortality of COVID-19 patients in Hamadan, Iran.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献