• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

COVID-19 从症状到预测:一种统计和机器学习方法。

COVID-19 from symptoms to prediction: A statistical and machine learning approach.

机构信息

Department of Information System, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, 21589, Saudi Arabia.

School of Built Environment, Engineering, and Computing, Leeds Beckett University, Leeds, LS6 3QR, UK.

出版信息

Comput Biol Med. 2024 Nov;182:109211. doi: 10.1016/j.compbiomed.2024.109211. Epub 2024 Sep 28.

DOI:10.1016/j.compbiomed.2024.109211
PMID:39342677
Abstract

During the COVID-19 pandemic, the analysis of patient data has become a cornerstone for developing effective public health strategies. This study leverages a dataset comprising over 10,000 anonymized patient records from various leading medical institutions to predict COVID-19 patient age groups using a suite of statistical and machine learning techniques. Initially, extensive statistical tests including ANOVA and t-tests were utilized to assess relationships among demographic and symptomatic variables. The study then employed machine learning models such as Decision Tree, Naïve Bayes, KNN, Gradient Boosted Trees, Support Vector Machine, and Random Forest, with rigorous data preprocessing to enhance model accuracy. Further improvements were sought through ensemble methods; bagging, boosting, and stacking. Our findings indicate strong associations between key symptoms and patient age groups, with ensemble methods significantly enhancing model accuracy. Specifically, stacking applied with random forest as a meta leaner exhibited the highest accuracy (0.7054). In addition, the implementation of stacking techniques notably improved the performance of K-Nearest Neighbors (from 0.529 to 0.63) and Naïve Bayes (from 0.554 to 0.622) and demonstrated the most successful prediction method. The study aimed to understand the number of symptoms identified in COVID-19 patients and their association with different age groups. The results can assist doctors and higher authorities in improving treatment strategies. Additionally, several decision-making techniques can be applied during pandemic, tailored to specific age groups, such as resource allocation, medicine availability, vaccine development, and treatment strategies. The integration of these predictive models into clinical settings could support real-time public health responses and targeted intervention strategies.

摘要

在 COVID-19 大流行期间,对患者数据的分析已成为制定有效公共卫生策略的基石。本研究利用来自多家领先医疗机构的超过 10000 份匿名患者记录数据集,使用一系列统计和机器学习技术预测 COVID-19 患者的年龄组。最初,我们进行了广泛的统计测试,包括 ANOVA 和 t 检验,以评估人口统计学和症状变量之间的关系。然后,我们使用了机器学习模型,如决策树、朴素贝叶斯、KNN、梯度提升树、支持向量机和随机森林,并进行了严格的数据预处理以提高模型的准确性。通过集成方法,如袋装法、提升法和堆叠法进一步提高模型的性能。我们的研究结果表明,关键症状与患者年龄组之间存在很强的关联,而集成方法显著提高了模型的准确性。具体来说,随机森林作为元学习者的堆叠方法表现出最高的准确性(0.7054)。此外,堆叠技术的实施显著提高了 K-近邻(从 0.529 到 0.63)和朴素贝叶斯(从 0.554 到 0.622)的性能,并展示了最成功的预测方法。本研究旨在了解 COVID-19 患者中识别出的症状数量及其与不同年龄组的关联。研究结果可以帮助医生和上级部门改进治疗策略。此外,在大流行期间可以应用几种决策技术,针对特定年龄组进行定制,如资源分配、药物供应、疫苗开发和治疗策略。将这些预测模型集成到临床环境中,可以支持实时公共卫生响应和有针对性的干预策略。

相似文献

1
COVID-19 from symptoms to prediction: A statistical and machine learning approach.COVID-19 从症状到预测:一种统计和机器学习方法。
Comput Biol Med. 2024 Nov;182:109211. doi: 10.1016/j.compbiomed.2024.109211. Epub 2024 Sep 28.
2
Machine learning algorithms for predicting COVID-19 mortality in Ethiopia.用于预测埃塞俄比亚 COVID-19 死亡率的机器学习算法。
BMC Public Health. 2024 Jun 28;24(1):1728. doi: 10.1186/s12889-024-19196-0.
3
A new hybrid ensemble machine-learning model for severity risk assessment and post-COVID prediction system.一种新的混合集成机器学习模型,用于严重程度风险评估和 COVID 后预测系统。
Math Biosci Eng. 2022 Apr 13;19(6):6102-6123. doi: 10.3934/mbe.2022285.
4
Machine learning-based prediction models for home discharge in patients with COVID-19: Development and evaluation using electronic health records.基于机器学习的 COVID-19 患者出院预测模型:利用电子健康记录进行开发和评估。
PLoS One. 2023 Oct 20;18(10):e0292888. doi: 10.1371/journal.pone.0292888. eCollection 2023.
5
Machine Learning Hybrid Model for the Prediction of Chronic Kidney Disease.机器学习混合模型预测慢性肾脏病。
Comput Intell Neurosci. 2023 Mar 14;2023:9266889. doi: 10.1155/2023/9266889. eCollection 2023.
6
Predicting stroke occurrences: a stacked machine learning approach with feature selection and data preprocessing.预测中风发生:具有特征选择和数据预处理的堆叠机器学习方法。
BMC Bioinformatics. 2024 Oct 15;25(1):329. doi: 10.1186/s12859-024-05866-8.
7
Using Automated Machine Learning to Predict the Mortality of Patients With COVID-19: Prediction Model Development Study.利用自动化机器学习预测 COVID-19 患者的死亡率:预测模型开发研究。
J Med Internet Res. 2021 Feb 26;23(2):e23458. doi: 10.2196/23458.
8
A machine learning analysis of COVID-19 mental health data.机器学习分析 COVID-19 心理健康数据。
Sci Rep. 2022 Sep 2;12(1):14965. doi: 10.1038/s41598-022-19314-1.
9
A GA-stacking ensemble approach for forecasting energy consumption in a smart household: A comparative study of ensemble methods.基于 GA 堆叠的智能家居能耗预测集成方法研究:集成方法比较
J Environ Manage. 2024 Jul;364:121264. doi: 10.1016/j.jenvman.2024.121264. Epub 2024 Jun 12.
10
Classification of imbalanced data using machine learning algorithms to predict the risk of renal graft failures in Ethiopia.使用机器学习算法对不平衡数据进行分类,以预测埃塞俄比亚肾移植失败的风险。
BMC Med Inform Decis Mak. 2023 May 22;23(1):98. doi: 10.1186/s12911-023-02185-5.

引用本文的文献

1
SympCoughNet: symptom assisted audio-based COVID-19 detection.SympCoughNet:基于症状辅助音频的新冠病毒检测
Front Digit Health. 2025 Mar 12;7:1551298. doi: 10.3389/fdgth.2025.1551298. eCollection 2025.