• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于决策树的机器学习模型在伊朗哈马丹 COVID-19 患者死亡率影响因素分类中的应用。

Application of machine learning models based on decision trees in classifying the factors affecting mortality of COVID-19 patients in Hamadan, Iran.

机构信息

Department of Biostatistics, School of Public Health, Hamadan University of Medical Sciences, Hamadan, Iran.

Modeling of Noncommunicable Diseases Research Center, School of Public Health, Hamadan University of Medical Sciences, Street of Shahid Fahmideh, P.O. BOX: 6517838736, Hamadan, Iran.

出版信息

BMC Med Inform Decis Mak. 2022 Jul 24;22(1):192. doi: 10.1186/s12911-022-01939-x.

DOI:10.1186/s12911-022-01939-x
PMID:35871639
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9308952/
Abstract

BACKGROUND

Due to the high mortality of COVID-19 patients, the use of a high-precision classification model of patient's mortality that is also interpretable, could help reduce mortality and take appropriate action urgently. In this study, the random forest method was used to select the effective features in COVID-19 mortality and the classification was performed using logistic model tree (LMT), classification and regression tree (CART), C4.5, and C5.0 tree based on important features.

METHODS

In this retrospective study, the data of 2470 COVID-19 patients admitted to hospitals in Hamadan, west Iran, were used, of which 75.02% recovered and 24.98% died. To classify, at first among the 25 demographic, clinical, and laboratory findings, features with a relative importance more than 6% were selected by random forest. Then LMT, C4.5, C5.0, and CART trees were developed and the accuracy of classification performance was evaluated with recall, accuracy, and F1-score criteria for training, test, and total datasets. At last, the best tree was developed and the receiver operating characteristic curve and area under the curve (AUC) value were reported.

RESULTS

The results of this study showed that among demographic and clinical features gender and age, and among laboratory findings blood urea nitrogen, partial thromboplastin time, serum glutamic-oxaloacetic transaminase, and erythrocyte sedimentation rate had more than 6% relative importance. Developing the trees using the above features revealed that the CART with the values of F1-score, Accuracy, and Recall, 0.8681, 0.7824, and 0.955, respectively, for the test dataset and 0.8667, 0.7834, and 0.9385, respectively, for the total dataset had the best performance. The AUC value obtained for the CART was 79.5%.

CONCLUSIONS

Finding a highly accurate and qualified model for interpreting the classification of a response that is considered clinically consequential is critical at all stages, including treatment and immediate decision making. In this study, the CART with its high accuracy for diagnosing and classifying mortality of COVID-19 patients as well as prioritizing important demographic, clinical, and laboratory findings in an interpretable format, risk factors for prognosis of COVID-19 patients mortality identify and enable immediate and appropriate decisions for health professionals and physicians.

摘要

背景

由于 COVID-19 患者的死亡率很高,因此使用高精度的患者死亡率分类模型,且该模型还具有可解释性,这有助于降低死亡率并紧急采取适当措施。在这项研究中,使用随机森林方法选择 COVID-19 死亡率的有效特征,并使用逻辑模型树(LMT)、分类回归树(CART)、C4.5 和 C5.0 树基于重要特征进行分类。

方法

在这项回顾性研究中,使用了来自伊朗西部哈马丹医院的 2470 名 COVID-19 患者的数据,其中 75.02%的患者康复,24.98%的患者死亡。为了进行分类,首先在 25 项人口统计学、临床和实验室发现中,随机森林选择了相对重要性超过 6%的特征。然后,开发了 LMT、C4.5、C5.0 和 CART 树,并使用召回率、准确性和 F1 评分标准评估了训练、测试和总数据集的分类性能。最后,开发了最佳树,并报告了接收者操作特征曲线和曲线下面积(AUC)值。

结果

这项研究的结果表明,在人口统计学和临床特征中,性别和年龄,以及在实验室发现中,血尿素氮、部分凝血活酶时间、血清谷氨酸-草酰乙酸转氨酶和红细胞沉降率具有超过 6%的相对重要性。使用上述特征开发的树表明,CART 的 F1 评分、准确性和召回率在测试数据集分别为 0.8681、0.7824 和 0.955,在总数据集分别为 0.8667、0.7834 和 0.9385,表现最佳。CART 的 AUC 值为 79.5%。

结论

在包括治疗和立即决策在内的所有阶段,找到一个高度准确和合格的模型来解释被认为具有临床意义的反应分类是至关重要的。在这项研究中,CART 能够以可解释的格式准确诊断和分类 COVID-19 患者的死亡率,并优先考虑重要的人口统计学、临床和实验室发现,确定 COVID-19 患者死亡率的预后风险因素,并为卫生专业人员和医生提供即时和适当的决策。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b366/9310461/dd4703814ab9/12911_2022_1939_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b366/9310461/33bd5a43511b/12911_2022_1939_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b366/9310461/1ca9f2c6e7dc/12911_2022_1939_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b366/9310461/f1da6d0982ee/12911_2022_1939_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b366/9310461/a2645edd6c89/12911_2022_1939_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b366/9310461/dd4703814ab9/12911_2022_1939_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b366/9310461/33bd5a43511b/12911_2022_1939_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b366/9310461/1ca9f2c6e7dc/12911_2022_1939_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b366/9310461/f1da6d0982ee/12911_2022_1939_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b366/9310461/a2645edd6c89/12911_2022_1939_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b366/9310461/dd4703814ab9/12911_2022_1939_Fig5_HTML.jpg

相似文献

1
Application of machine learning models based on decision trees in classifying the factors affecting mortality of COVID-19 patients in Hamadan, Iran.基于决策树的机器学习模型在伊朗哈马丹 COVID-19 患者死亡率影响因素分类中的应用。
BMC Med Inform Decis Mak. 2022 Jul 24;22(1):192. doi: 10.1186/s12911-022-01939-x.
2
Interpretable generalized neural additive models for mortality prediction of COVID-19 hospitalized patients in Hamadan, Iran.伊朗哈马丹 COVID-19 住院患者死亡率预测的可解释广义神经加法模型。
BMC Med Res Methodol. 2022 Dec 31;22(1):339. doi: 10.1186/s12874-022-01827-y.
3
[Constructing a predictive model for the death risk of patients with septic shock based on supervised machine learning algorithms].基于监督机器学习算法构建脓毒症休克患者死亡风险预测模型
Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2024 Apr;36(4):345-352. doi: 10.3760/cma.j.cn121430-20230930-00832.
4
[Construction of a predictive model for in-hospital mortality of sepsis patients in intensive care unit based on machine learning].基于机器学习构建重症监护病房脓毒症患者院内死亡率预测模型
Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2023 Jul;35(7):696-701. doi: 10.3760/cma.j.cn121430-20221219-01104.
5
Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?预测模型工具能否识别 ACL 重建术后阿片类药物使用时间延长的高风险患者?
Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.
6
A new hybrid ensemble machine-learning model for severity risk assessment and post-COVID prediction system.一种新的混合集成机器学习模型,用于严重程度风险评估和 COVID 后预测系统。
Math Biosci Eng. 2022 Apr 13;19(6):6102-6123. doi: 10.3934/mbe.2022285.
7
Exploratory Data Mining Techniques (Decision Tree Models) for Examining the Impact of Internet-Based Cognitive Behavioral Therapy for Tinnitus: Machine Learning Approach.探索性数据挖掘技术(决策树模型)在研究基于互联网的认知行为疗法对耳鸣影响中的应用:机器学习方法。
J Med Internet Res. 2021 Nov 2;23(11):e28999. doi: 10.2196/28999.
8
GIS-based groundwater potential mapping using boosted regression tree, classification and regression tree, and random forest machine learning models in Iran.基于地理信息系统(GIS),运用提升回归树、分类与回归树以及随机森林机器学习模型在伊朗进行地下水潜力制图。
Environ Monit Assess. 2016 Jan;188(1):44. doi: 10.1007/s10661-015-5049-6. Epub 2015 Dec 19.
9
Machine learning algorithms for predicting COVID-19 mortality in Ethiopia.用于预测埃塞俄比亚 COVID-19 死亡率的机器学习算法。
BMC Public Health. 2024 Jun 28;24(1):1728. doi: 10.1186/s12889-024-19196-0.
10

引用本文的文献

1
Predicting Risk for Patent Ductus Arteriosus in the Neonate: A Machine Learning Analysis.预测新生儿动脉导管未闭的风险:一项机器学习分析。
Medicina (Kaunas). 2025 Mar 26;61(4):603. doi: 10.3390/medicina61040603.
2
Prediction of primary Hypertension in Primary Health Care Settings in Coastal Karnataka Using Artificial Neural Network.使用人工神经网络预测卡纳塔克邦沿海地区初级卫生保健机构中的原发性高血压
Curr Hypertens Rev. 2025;21(2):82-93. doi: 10.2174/0115734021329874250222053144.
3
In vivo electrophysiology recordings and computational modeling can predict octopus arm movement.

本文引用的文献

1
Predicting the COVID-19 Patients Status Using Chest CT Scan Findings: A Risk Assessment Model Based on Decision Tree Analysis.基于决策树分析的胸部 CT 扫描结果预测 COVID-19 患者病情:风险评估模型。
Adv Exp Med Biol. 2023;1412:237-250. doi: 10.1007/978-3-031-28012-2_13.
2
Severity and mortality prediction models to triage Indian COVID-19 patients.用于对印度新冠肺炎患者进行分诊的严重程度和死亡率预测模型。
PLOS Digit Health. 2022 Mar 9;1(3):e0000020. doi: 10.1371/journal.pdig.0000020. eCollection 2022 Mar.
3
COVID-19 pneumonia level detection using deep learning algorithm and transfer learning.
体内电生理记录和计算模型可以预测章鱼手臂的运动。
Bioelectron Med. 2025 Feb 14;11(1):4. doi: 10.1186/s42234-025-00166-9.
4
A Systematic Review of the Outcomes of Utilization of Artificial Intelligence Within the Healthcare Systems of the Middle East: A Thematic Analysis of Findings.中东医疗系统中人工智能应用成果的系统评价:研究结果的主题分析
Health Sci Rep. 2024 Dec 24;7(12):e70300. doi: 10.1002/hsr2.70300. eCollection 2024 Dec.
5
Single unit electrophysiology recordings and computational modeling can predict octopus arm movement.单细胞电生理记录和计算模型可以预测章鱼手臂的运动。
bioRxiv. 2024 Sep 19:2024.09.13.612676. doi: 10.1101/2024.09.13.612676.
6
Machine learning-based evaluation of prognostic factors for mortality and relapse in patients with acute lymphoblastic leukemia: a comparative simulation study.基于机器学习的急性淋巴细胞白血病患者死亡率和复发率预后因素评估:一项比较模拟研究。
BMC Med Inform Decis Mak. 2024 Sep 16;24(1):261. doi: 10.1186/s12911-024-02645-6.
7
Breaking barriers: a statistical and machine learning-based hybrid system for predicting dementia.突破障碍:一种基于统计和机器学习的痴呆预测混合系统。
Front Bioeng Biotechnol. 2024 Jan 8;11:1336255. doi: 10.3389/fbioe.2023.1336255. eCollection 2023.
8
Environmental and geographical factors influence the occurrence and abundance of the southern house mosquito, Culex quinquefasciatus, in Hawai'i.环境和地理因素影响夏威夷南部家蚊(Culex quinquefasciatus)的发生和丰度。
Sci Rep. 2024 Jan 5;14(1):604. doi: 10.1038/s41598-023-49793-9.
9
Machine Learning and COVID-19: Lessons from SARS-CoV-2.机器学习与 COVID-19:SARS-CoV-2 的经验教训。
Adv Exp Med Biol. 2023;1412:311-335. doi: 10.1007/978-3-031-28012-2_17.
10
Complete Breast Cancer Detection and Monitoring System by Using Microwave Textile Based Antenna Sensors.利用微波纺织基天线传感器实现完整的乳腺癌检测和监测系统。
Biosensors (Basel). 2023 Jan 4;13(1):87. doi: 10.3390/bios13010087.
使用深度学习算法和迁移学习进行新冠肺炎肺炎水平检测。
Evol Intell. 2022 Sep 10:1-12. doi: 10.1007/s12065-022-00777-0.
4
Case fatality and mortality rates, socio-demographic profile, and clinical features of COVID-19 in the elderly population: A population-based registry study in Iran.老年人中 COVID-19 的病死率和死亡率、社会人口统计学特征和临床特征:伊朗一项基于人群的登记研究。
J Med Virol. 2022 May;94(5):2126-2132. doi: 10.1002/jmv.27594. Epub 2022 Jan 28.
5
Comparing machine learning algorithms for predicting COVID-19 mortality.比较用于预测 COVID-19 死亡率的机器学习算法。
BMC Med Inform Decis Mak. 2022 Jan 4;22(1):2. doi: 10.1186/s12911-021-01742-0.
6
Prediction of global spread of COVID-19 pandemic: a review and research challenges.2019年冠状病毒病大流行全球传播的预测:综述与研究挑战
Artif Intell Rev. 2022;55(3):1607-1628. doi: 10.1007/s10462-021-09988-w. Epub 2021 Jul 16.
7
Sociodemographic determinants and clinical risk factors associated with COVID-19 severity: a cross-sectional analysis of over 200,000 patients in Tehran, Iran.与 COVID-19 严重程度相关的社会人口学决定因素和临床危险因素:对伊朗德黑兰 20 多万患者的横断面分析。
BMC Infect Dis. 2021 May 25;21(1):474. doi: 10.1186/s12879-021-06179-4.
8
The Role of Immunological and Clinical Biomarkers to Predict Clinical COVID-19 Severity and Response to Therapy-A Prospective Longitudinal Study.免疫和临床生物标志物在预测 COVID-19 临床严重程度和治疗反应中的作用:一项前瞻性纵向研究。
Front Immunol. 2021 Mar 17;12:646095. doi: 10.3389/fimmu.2021.646095. eCollection 2021.
9
Explaining machine learning based diagnosis of COVID-19 from routine blood tests with decision trees and criteria graphs.用决策树和准则图解释基于常规血液检测的 COVID-19 机器学习诊断。
Comput Biol Med. 2021 May;132:104335. doi: 10.1016/j.compbiomed.2021.104335. Epub 2021 Mar 16.
10
Role of hematological parameters in the stratification of COVID-19 disease severity.血液学参数在新冠病毒疾病严重程度分层中的作用。
Ann Med Surg (Lond). 2021 Feb;62:68-72. doi: 10.1016/j.amsu.2020.12.035. Epub 2021 Jan 8.