Suppr
超能文献

个体因素与 COVID-19 感染的相关性：一项机器学习研究。

Individual Factors Associated With COVID-19 Infection: A Machine Learning Study.

机构信息

Cátedras Conacyt, National Council on Science and Technology, Mexico City, Mexico.

Center for Research in Geospatial Information Sciences, Mexico City, Mexico.

出版信息

Front Public Health. 2022 Jun 30;10:912099. doi: 10.3389/fpubh.2022.912099. eCollection 2022.

DOI:10.3389/fpubh.2022.912099

PMID:35844896

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9279686/

Abstract

The fast, exponential increase of COVID-19 infections and their catastrophic effects on patients' health have required the development of tools that support health systems in the quick and efficient diagnosis and prognosis of this disease. In this context, the present study aims to identify the potential factors associated with COVID-19 infections, applying machine learning techniques, particularly random forest, chi-squared, xgboost, and rpart for feature selection; ROSE and SMOTE were used as resampling methods due to the existence of class imbalance. Similarly, machine and deep learning algorithms such as support vector machines, C4.5, random forest, rpart, and deep neural networks were explored during the train/test phase to select the best prediction model. The dataset used in this study contains clinical data, anthropometric measurements, and other health parameters related to smoking habits, alcohol consumption, quality of sleep, physical activity, and health status during confinement due to the pandemic associated with COVID-19. The results showed that the XGBoost model got the best features associated with COVID-19 infection, and random forest approximated the best predictive model with a balanced accuracy of 90.41% using SMOTE as a resampling technique. The model with the best performance provides a tool to help prevent contracting SARS-CoV-2 since the variables with the highest risk factor are detected, and some of them are, to a certain extent controllable.

摘要

COVID-19 感染的快速、指数级增长及其对患者健康的灾难性影响，要求开发工具来支持卫生系统快速、有效地诊断和预测这种疾病。在这种情况下，本研究旨在应用机器学习技术，特别是随机森林、卡方检验、xgboost 和 rpart 进行特征选择，识别与 COVID-19 感染相关的潜在因素；由于存在类别不平衡，使用 ROSE 和 SMOTE 作为重采样方法。同样，在训练/测试阶段还探索了机器和深度学习算法，如支持向量机、C4.5、随机森林、rpart 和深度神经网络，以选择最佳预测模型。本研究使用的数据集包含与 COVID-19 相关的临床数据、人体测量学测量值以及与吸烟习惯、饮酒、睡眠质量、身体活动和大流行期间禁闭健康状况有关的其他健康参数。结果表明，XGBoost 模型获得了与 COVID-19 感染相关的最佳特征，随机森林使用 SMOTE 作为重采样技术，以 90.41%的平衡准确率逼近最佳预测模型。表现最佳的模型提供了一种帮助预防感染 SARS-CoV-2 的工具，因为可以检测到具有最高风险因素的变量，其中一些在一定程度上是可以控制的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e17/9279686/c8e457fdbe2c/fpubh-10-912099-g0001.jpg

相似文献

Individual Factors Associated With COVID-19 Infection: A Machine Learning Study.

Front Public Health. 2022 Jun 30;10:912099. doi: 10.3389/fpubh.2022.912099. eCollection 2022.

A new approach for determining SARS-CoV-2 epitopes using machine learning-based in silico methods.

Comput Biol Chem. 2022 Jun;98:107688. doi: 10.1016/j.compbiolchem.2022.107688. Epub 2022 Apr 30.

A new hybrid ensemble machine-learning model for severity risk assessment and post-COVID prediction system.

Math Biosci Eng. 2022 Apr 13;19(6):6102-6123. doi: 10.3934/mbe.2022285.

Prediction of death status on the course of treatment in SARS-COV-2 patients with deep learning and machine learning methods.

Comput Methods Programs Biomed. 2021 Apr;201:105951. doi: 10.1016/j.cmpb.2021.105951. Epub 2021 Jan 22.

Machine learning algorithms for predicting COVID-19 mortality in Ethiopia.

BMC Public Health. 2024 Jun 28;24(1):1728. doi: 10.1186/s12889-024-19196-0.

Clinical Predictive Models for COVID-19: Systematic Study.

J Med Internet Res. 2020 Oct 6;22(10):e21439. doi: 10.2196/21439.

Predicting mortality in SARS-COV-2 (COVID-19) positive patients in the inpatient setting using a novel deep neural network.

Int J Med Inform. 2021 Oct;154:104556. doi: 10.1016/j.ijmedinf.2021.104556. Epub 2021 Aug 21.

Use of Machine Learning and Artificial Intelligence to predict SARS-CoV-2 infection from Full Blood Counts in a population.

Int Immunopharmacol. 2020 Sep;86:106705. doi: 10.1016/j.intimp.2020.106705. Epub 2020 Jun 16.

A machine learning approach to personalized predictors of dyslipidemia: a cohort study.

Front Public Health. 2023 Sep 20;11:1213926. doi: 10.3389/fpubh.2023.1213926. eCollection 2023.

Developing machine learning-based models to predict intrauterine insemination (IUI) success by address modeling challenges in imbalanced data and providing modification solutions for them.

BMC Med Inform Decis Mak. 2022 Sep 1;22(1):228. doi: 10.1186/s12911-022-01974-8.

引用本文的文献

COVID-19 Reinfections in the City of São Paulo, Brazil: Prevalence and Socioeconomic Factors.

Open Forum Infect Dis. 2025 Apr 16;12(4):ofaf181. doi: 10.1093/ofid/ofaf181. eCollection 2025 Apr.

AutoML-Driven Insights into Patient Outcomes and Emergency Care During Romania's First Wave of COVID-19.

Bioengineering (Basel). 2024 Dec 15;11(12):1272. doi: 10.3390/bioengineering11121272.

Artificial intelligence in triage of COVID-19 patients.

Front Artif Intell. 2024 Dec 18;7:1495074. doi: 10.3389/frai.2024.1495074. eCollection 2024.

Superspreading of SARS-CoV-2: a systematic review and meta-analysis of event attack rates and individual transmission patterns.

Epidemiol Infect. 2024 Oct 8;152:e121. doi: 10.1017/S0950268824000955.

An adaptive data-driven architecture for mental health care applications.

PeerJ. 2024 Mar 29;12:e17133. doi: 10.7717/peerj.17133. eCollection 2024.

A large-scale machine learning study of sociodemographic factors contributing to COVID-19 severity.

Front Big Data. 2023 Mar 24;6:1038283. doi: 10.3389/fdata.2023.1038283. eCollection 2023.

本文引用的文献

Pruning-based oversampling technique with smoothed bootstrap resampling for imbalanced clinical dataset of Covid-19.

J King Saud Univ Comput Inf Sci. 2022 Oct;34(9):7830-7839. doi: 10.1016/j.jksuci.2021.09.021. Epub 2021 Sep 30.

Situation-Aware BDI Reasoning to Detect Early Symptoms of Covid 19 Using Smartwatch.

IEEE Sens J. 2022 Mar 3;23(2):898-905. doi: 10.1109/JSEN.2022.3156819. eCollection 2023 Jan.

Realizing an Effective COVID-19 Diagnosis System Based on Machine Learning and IoT in Smart Hospital Environment.

IEEE Internet Things J. 2021 Jan 11;8(21):15919-15928. doi: 10.1109/JIOT.2021.3050775. eCollection 2021 Nov 1.

Modular reactivation of Mexico City after COVID-19 lockdown.

BMC Public Health. 2022 May 13;22(1):961. doi: 10.1186/s12889-022-13183-z.

Using machine learning to predict COVID-19 infection and severity risk among 4510 aged adults: a UK Biobank cohort study.

Sci Rep. 2022 May 11;12(1):7736. doi: 10.1038/s41598-022-07307-z.

Inter-individual variation in objective measure of reactogenicity following COVID-19 vaccination via smartwatches and fitness bands.

NPJ Digit Med. 2022 Apr 19;5(1):49. doi: 10.1038/s41746-022-00591-z.

Real-time infection prediction with wearable physiological monitoring and AI to aid military workforce readiness during COVID-19.

Sci Rep. 2022 Mar 8;12(1):3797. doi: 10.1038/s41598-022-07764-6.

Internet of Things Concept in the Context of the COVID-19 Pandemic: A Multi-Sensor Application Design.

Sensors (Basel). 2022 Jan 10;22(2):503. doi: 10.3390/s22020503.

A machine learning application for raising WASH awareness in the times of COVID-19 pandemic.

Sci Rep. 2022 Jan 17;12(1):810. doi: 10.1038/s41598-021-03869-6.

A Novel Framework Based on Deep Learning and ANOVA Feature Selection Method for Diagnosis of COVID-19 Cases from Chest X-Ray Images.

Comput Intell Neurosci. 2022 Jan 7;2022:4694567. doi: 10.1155/2022/4694567. eCollection 2022.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

个体因素与 COVID-19 感染的相关性：一项机器学习研究。

Individual Factors Associated With COVID-19 Infection: A Machine Learning Study.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译