• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于决策树、AdaBoost和支持向量机三种数据挖掘模型,与COVID-19疾病死亡相关的最重要变量:一项横断面研究。

The most important variables associated with death due to COVID-19 disease, based on three data mining models Decision Tree, AdaBoost, and Support Vector Machine: A cross-sectional study.

作者信息

Gharehhasani Bita Shokri, Rezaei Mansour, Naghipour Armin, Sayad Nazanine, Mostafaei Shayan, Alimohammadi Ehsan

机构信息

Taleghani Hospital Kermanshah University of Medical Sciences Kermanshah Iran.

Social Development and Health Promotion Research Center Kermanshah University of Medical Sciences Kermanshah Iran.

出版信息

Health Sci Rep. 2024 Jul 25;7(7):e2266. doi: 10.1002/hsr2.2266. eCollection 2024 Jul.

DOI:10.1002/hsr2.2266
PMID:39055612
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11269761/
Abstract

INTRODUCTION

Death due to covid-19 is one of the biggest health challenges in the world. There are many models that can predict death due to COVID-19. This study aimed to fit and compare Decision Tree (DT), Support Vector Machine (SVM), and AdaBoost models to predict death due to COVID-19.

METHODS

To describe the variables, mean (SD) and frequency (%) were reported. To determine the relationship between the variables and the death caused by COVID-19, chi-square test was performed with a significance level of 0.05. To compare DT, SVM and AdaBoost models for predicting death due to COVID-19 from sensitivity, specificity, accuracy and the area under the rock curve under R software using psych, caTools, random over-sampling examples, rpart, rpartplot packages was done.

RESULTS

Out of the total of 23,054 patients studied, 10,935 cases (46.5%) were women, and 12,569 cases (53.5%) were men. Additionally, the mean age of the patients was 54.9 ± 21.0 years. There is a statistically significant relationship between gender, fever, cough, muscle pain, smell and taste, abdominal pain, nausea and vomiting, diarrhea, anorexia, dizziness, chest pain, intubation, cancer, diabetes, chronic blood disease, Violation of immunity, pregnancy, Dialysis, chronic lung disease with the death of covid-19 patients showed ( < 0.05). The results showed that the sensitivity, specificity, accuracy and the area under the receiver operating characteristic curve were respectively 0.60, 0.68, 0.71, and 0.75 in the DT model, 0.54, 0.62, 0.63, and 0.71 in the SVM model, and 0.59, 0.65, 0.69 and 0.74 in the AdaBoost model.

CONCLUSION

The results showed that DT had a high predictive power compared to other data mining models. Therefore, it is suggested to researchers in different fields to use DT to predict the studied variables. Also, it is suggested to use other approaches such as random forest or XGBoost to improve the accuracy in future studies.

摘要

引言

新冠病毒肺炎(COVID-19)导致的死亡是全球最大的健康挑战之一。有许多模型可用于预测COVID-19导致的死亡。本研究旨在拟合和比较决策树(DT)、支持向量机(SVM)和AdaBoost模型来预测COVID-19导致的死亡。

方法

为描述变量,报告了均值(标准差)和频率(%)。为确定变量与COVID-19导致的死亡之间的关系,进行了显著性水平为0.05的卡方检验。为在R软件中使用psych、caTools、随机过采样示例、rpart、rpartplot包,从敏感性、特异性、准确性和roc曲线下面积方面比较DT、SVM和AdaBoost模型对COVID-19导致的死亡进行预测。

结果

在总共研究的23054例患者中,10935例(46.5%)为女性,12569例(53.5%)为男性。此外,患者的平均年龄为54.9±21.0岁。性别、发热、咳嗽、肌肉疼痛、嗅觉和味觉、腹痛、恶心和呕吐、腹泻、厌食、头晕、胸痛、插管、癌症、糖尿病、慢性血液疾病、免疫功能受损、妊娠、透析、慢性肺病与COVID-19患者死亡之间存在统计学显著关系(P<0.05)。结果显示,DT模型的敏感性、特异性、准确性和受试者工作特征曲线下面积分别为0.60、0.68、0.71和0.75,SVM模型分别为0.54、0.62、0.63和0.71,AdaBoost模型分别为0.59、0.65、0.69和0.74。

结论

结果表明,与其他数据挖掘模型相比,DT具有较高的预测能力。因此,建议不同领域的研究人员使用DT来预测所研究的变量。此外,建议在未来研究中使用其他方法,如随机森林或XGBoost来提高准确性。

相似文献

1
The most important variables associated with death due to COVID-19 disease, based on three data mining models Decision Tree, AdaBoost, and Support Vector Machine: A cross-sectional study.基于决策树、AdaBoost和支持向量机三种数据挖掘模型,与COVID-19疾病死亡相关的最重要变量:一项横断面研究。
Health Sci Rep. 2024 Jul 25;7(7):e2266. doi: 10.1002/hsr2.2266. eCollection 2024 Jul.
2
Exploratory Data Mining Techniques (Decision Tree Models) for Examining the Impact of Internet-Based Cognitive Behavioral Therapy for Tinnitus: Machine Learning Approach.探索性数据挖掘技术(决策树模型)在研究基于互联网的认知行为疗法对耳鸣影响中的应用:机器学习方法。
J Med Internet Res. 2021 Nov 2;23(11):e28999. doi: 10.2196/28999.
3
Machine learning-based COVID-19 diagnosis by demographic characteristics and clinical data.基于人口统计学特征和临床数据的机器学习辅助COVID-19诊断
Adv Respir Med. 2022 Feb 1. doi: 10.5603/ARM.a2022.0021.
4
Seminal quality prediction using data mining methods.使用数据挖掘方法进行精液质量预测。
Technol Health Care. 2014;22(4):531-45. doi: 10.3233/THC-140816.
5
Derivation and validation of different machine-learning models in mortality prediction of trauma in motorcycle riders: a cross-sectional retrospective study in southern Taiwan.不同机器学习模型在摩托车骑士创伤死亡率预测中的推导与验证:台湾南部的一项横断面回顾性研究
BMJ Open. 2018 Jan 5;8(1):e018252. doi: 10.1136/bmjopen-2017-018252.
6
Predicting the postoperative blood coagulation state of children with congenital heart disease by machine learning based on real-world data.基于真实世界数据,通过机器学习预测先天性心脏病患儿术后凝血状态。
Transl Pediatr. 2021 Jan;10(1):33-43. doi: 10.21037/tp-20-238.
7
[Constructing a predictive model for the death risk of patients with septic shock based on supervised machine learning algorithms].基于监督机器学习算法构建脓毒症休克患者死亡风险预测模型
Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2024 Apr;36(4):345-352. doi: 10.3760/cma.j.cn121430-20230930-00832.
8
Machine learning models for predicting critical illness risk in hospitalized patients with COVID-19 pneumonia.用于预测新冠肺炎肺炎住院患者危重症风险的机器学习模型
J Thorac Dis. 2021 Feb;13(2):1215-1229. doi: 10.21037/jtd-20-2580.
9
Predictive model and risk analysis for peripheral vascular disease in type 2 diabetes mellitus patients using machine learning and shapley additive explanation.基于机器学习和 Shapley 加法解释的 2 型糖尿病患者外周血管疾病预测模型和风险分析。
Front Endocrinol (Lausanne). 2024 Feb 28;15:1320335. doi: 10.3389/fendo.2024.1320335. eCollection 2024.
10
Application of machine learning approaches for osteoporosis risk prediction in postmenopausal women.机器学习方法在绝经后妇女骨质疏松症风险预测中的应用。
Arch Osteoporos. 2020 Oct 23;15(1):169. doi: 10.1007/s11657-020-00802-8.

引用本文的文献

1
Identification Exploring the Mechanism and Clinical Validation of Mitochondrial Dynamics-Related Genes in Membranous Nephropathy Based on Mendelian Randomization Study and Bioinformatics Analysis.基于孟德尔随机化研究和生物信息学分析探索膜性肾病中线粒体动力学相关基因的机制及临床验证
Biomedicines. 2025 Jun 17;13(6):1489. doi: 10.3390/biomedicines13061489.
2
AI-driven techniques for detection and mitigation of SARS-CoV-2 spread: a review, taxonomy, and trends.用于检测和缓解新冠病毒传播的人工智能驱动技术:综述、分类及趋势
Clin Exp Med. 2025 Jun 14;25(1):204. doi: 10.1007/s10238-025-01753-5.

本文引用的文献

1
Predictors of care burden among caregivers of patients with COVID-19.COVID-19 患者照顾者的照护负担预测因素。
Nurs Open. 2023 Dec;10(12):7603-7610. doi: 10.1002/nop2.1999. Epub 2023 Sep 24.
2
Red Cell Distribution Width as a Prognostic Indicator for Mortality and ICU Admission in Patients with COVID-19.红细胞分布宽度作为 COVID-19 患者死亡率和 ICU 入院的预后指标。
Adv Exp Med Biol. 2023;1412:225-235. doi: 10.1007/978-3-031-28012-2_12.
3
Prevalence and reasons for self-medication for prevention of COVID-19 among the adult population in Kermanshah-Iran.伊朗克尔曼沙阿省成年人预防 COVID-19 自我用药的流行率和原因。
Nurs Open. 2023 Jun;10(6):3946-3953. doi: 10.1002/nop2.1653. Epub 2023 Feb 22.
4
Machine learning for detecting COVID-19 from cough sounds: An ensemble-based MCDM method.基于集成的多准则决策方法的机器学习用于从咳嗽声音中检测 COVID-19。
Comput Biol Med. 2022 Jun;145:105405. doi: 10.1016/j.compbiomed.2022.105405. Epub 2022 Mar 17.
5
Statin Use in COVID-19 Hospitalized Patients and Outcomes: A Retrospective Study.他汀类药物在新冠肺炎住院患者中的应用及其结局:一项回顾性研究。
Front Cardiovasc Med. 2022 Feb 24;9:820260. doi: 10.3389/fcvm.2022.820260. eCollection 2022.
6
Deep Learning-Based Decision-Tree Classifier for COVID-19 Diagnosis From Chest X-ray Imaging.基于深度学习的用于从胸部X光影像诊断新冠肺炎的决策树分类器
Front Med (Lausanne). 2020 Jul 14;7:427. doi: 10.3389/fmed.2020.00427. eCollection 2020.
7
Application of data mining techniques to explore predictors of HCC in Egyptian patients with HCV-related chronic liver disease.应用数据挖掘技术探索埃及丙型肝炎病毒相关慢性肝病患者肝细胞癌的预测因素。
Asian Pac J Cancer Prev. 2015;16(1):381-5. doi: 10.7314/apjcp.2015.16.1.381.