基于 COVID-19 数据的国家级大流行风险和准备情况分类：一种机器学习方法。

Country-level pandemic risk and preparedness classification based on COVID-19 data: A machine learning approach.

机构信息

Aston Robotics, Vision, and Intelligent Systems Lab (ARVIS), School of Engineering and Applied Science, Aston University, Birmingham, United Kingdom.

Department of Electrical and Computer Engineering, Institute of Systems and Robotics, University of Coimbra, Coimbra, Portugal.

出版信息

PLoS One. 2020 Oct 28;15(10):e0241332. doi: 10.1371/journal.pone.0241332. eCollection 2020.

DOI:10.1371/journal.pone.0241332

PMID:33112931

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7592809/

Abstract

In this work we present a three-stage Machine Learning strategy to country-level risk classification based on countries that are reporting COVID-19 information. A K% binning discretisation (K = 25) is used to create four risk groups of countries based on the risk of transmission (coronavirus cases per million population), risk of mortality (coronavirus deaths per million population), and risk of inability to test (coronavirus tests per million population). The four risk groups produced by K% binning are labelled as 'low', 'medium-low', 'medium-high', and 'high'. Coronavirus-related data are then removed and the attributes for prediction of the three types of risk are given as the geopolitical and demographic data describing each country. Thus, the calculation of class label is based on coronavirus data but the input attributes are country-level information regardless of coronavirus data. The three four-class classification problems are then explored and benchmarked through leave-one-country-out cross validation to find the strongest model, producing a Stack of Gradient Boosting and Decision Tree algorithms for risk of transmission, a Stack of Support Vector Machine and Extra Trees for risk of mortality, and a Gradient Boosting algorithm for the risk of inability to test. It is noted that high risk for inability to test is often coupled with low risks for transmission and mortality, therefore the risk of inability to test should be interpreted first, before consideration is given to the predicted transmission and mortality risks. Finally, the approach is applied to more recent risk levels to data from September 2020 and weaker results are noted due to the growth of international collaboration detracting useful knowledge from country-level attributes which suggests that similar machine learning approaches are more useful prior to situations later unfolding.

摘要

在这项工作中，我们提出了一个三阶段机器学习策略，基于报告 COVID-19 信息的国家对国家进行风险分类。使用 K%分箱离散化（K=25），根据传播风险（每百万人口的冠状病毒病例数）、死亡率风险（每百万人口的冠状病毒死亡数）和检测能力不足风险（每百万人口的冠状病毒检测数），将国家分为四个风险组。K%分箱产生的四个风险组分别标记为“低”、“中低”、“中高”和“高”。然后去除冠状病毒相关数据，将预测三种风险的属性作为描述每个国家的地缘政治和人口统计数据。因此，类别标签的计算基于冠状病毒数据，但输入属性是国家层面的信息，与冠状病毒数据无关。然后通过留一国家外交叉验证探索和基准测试这三个四分类问题，以找到最强模型，为传播风险生成梯度提升和决策树算法堆栈，为死亡率风险生成支持向量机和 Extra Trees 堆栈，为检测能力不足风险生成梯度提升算法。值得注意的是，检测能力不足的高风险通常伴随着传播和死亡率的低风险，因此应首先考虑检测能力不足的风险，然后再考虑预测的传播和死亡率风险。最后，该方法应用于 2020 年 9 月的数据，由于国际合作的增加从国家层面的属性中获取有用知识，风险水平有所下降，这表明在类似的机器学习方法在情况进一步发展之前更有用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7f3/7592809/2d846d399685/pone.0241332.g001.jpg

相似文献

Country-level pandemic risk and preparedness classification based on COVID-19 data: A machine learning approach.

PLoS One. 2020 Oct 28;15(10):e0241332. doi: 10.1371/journal.pone.0241332. eCollection 2020.

Machine Learning to Predict Mortality and Critical Events in a Cohort of Patients With COVID-19 in New York City: Model Development and Validation.

J Med Internet Res. 2020 Nov 6;22(11):e24018. doi: 10.2196/24018.

Geographic risk assessment of COVID-19 transmission using recent data: An observational study.

Medicine (Baltimore). 2020 Jun 12;99(24):e20774. doi: 10.1097/MD.0000000000020774.

Predicting Coronavirus Disease 2019 Infection Risk and Related Risk Drivers in Nursing Homes: A Machine Learning Approach.

J Am Med Dir Assoc. 2020 Nov;21(11):1533-1538.e6. doi: 10.1016/j.jamda.2020.08.030. Epub 2020 Aug 27.

Clinical Predictive Models for COVID-19: Systematic Study.

J Med Internet Res. 2020 Oct 6;22(10):e21439. doi: 10.2196/21439.

Using a simple open-source automated machine learning algorithm to forecast COVID-19 spread: A modelling study.

Adv Respir Med. 2020;88(5):400-405. doi: 10.5603/ARM.a2020.0156.

Why Is Modeling Coronavirus Disease 2019 So Difficult?

Chest. 2020 Nov;158(5):1829-1830. doi: 10.1016/j.chest.2020.06.014. Epub 2020 Jun 19.

Can We Test Our Way Out of the COVID-19 Pandemic?

J Clin Microbiol. 2020 Oct 21;58(11). doi: 10.1128/JCM.02225-20.

Coronavirus Disease 2019 (COVID-19) diagnostic technologies: A country-based retrospective analysis of screening and containment procedures during the first wave of the pandemic.

Clin Imaging. 2020 Nov;67:219-225. doi: 10.1016/j.clinimag.2020.08.014. Epub 2020 Aug 26.

Machine learning based early warning system enables accurate mortality risk prediction for COVID-19.

Nat Commun. 2020 Oct 6;11(1):5033. doi: 10.1038/s41467-020-18684-2.

引用本文的文献

Machine learning algorithms for predicting COVID-19 mortality in Ethiopia.

BMC Public Health. 2024 Jun 28;24(1):1728. doi: 10.1186/s12889-024-19196-0.

COVID-19 Pandemic Risk Assessment: Systematic Review.

Risk Manag Healthc Policy. 2024 Apr 11;17:903-925. doi: 10.2147/RMHP.S444494. eCollection 2024.

Neo-epidemiological machine learning based method for COVID-19 related estimations.

PLoS One. 2023 Mar 24;18(3):e0263991. doi: 10.1371/journal.pone.0263991. eCollection 2023.

Deep Convolutional Neural Network Mechanism Assessment of COVID-19 Severity.

Biomed Res Int. 2022 Aug 23;2022:1289221. doi: 10.1155/2022/1289221. eCollection 2022.

Predicting COVID-19 county-level case number trend by combining demographic characteristics and social distancing policies.

JAMIA Open. 2022 Jun 25;5(3):ooac056. doi: 10.1093/jamiaopen/ooac056. eCollection 2022 Oct.

Priority and age specific vaccination algorithm for the pandemic diseases: a comprehensive parametric prediction model.

BMC Med Inform Decis Mak. 2022 Jan 6;22(1):4. doi: 10.1186/s12911-021-01720-6.

A machine learning based exploration of COVID-19 mortality risk.

PLoS One. 2021 Jul 2;16(7):e0252384. doi: 10.1371/journal.pone.0252384. eCollection 2021.

AI in Fighting Covid-19: Pandemic Management.

Procedia Comput Sci. 2021;185:380-386. doi: 10.1016/j.procs.2021.05.039. Epub 2021 Jun 10.

SOM-LWL method for identification of COVID-19 on chest X-rays.

PLoS One. 2021 Feb 24;16(2):e0247176. doi: 10.1371/journal.pone.0247176. eCollection 2021.

What Can COVID-19 Teach Us about Using AI in Pandemics?

Healthcare (Basel). 2020 Dec 1;8(4):527. doi: 10.3390/healthcare8040527.

本文引用的文献

Estimating the fraction of unreported infections in epidemics with a known epicenter: An application to COVID-19.

J Econom. 2021 Jan;220(1):106-129. doi: 10.1016/j.jeconom.2020.07.047. Epub 2020 Sep 7.

COVID-19 Coronavirus Vaccine Design Using Reverse Vaccinology and Machine Learning.

Front Immunol. 2020 Jul 3;11:1581. doi: 10.3389/fimmu.2020.01581. eCollection 2020.

Machine learning using intrinsic genomic signatures for rapid classification of novel pathogens: COVID-19 case study.

PLoS One. 2020 Apr 24;15(4):e0232391. doi: 10.1371/journal.pone.0232391. eCollection 2020.

The coronavirus knows no borders.

Tidsskr Nor Laegeforen. 2020 Mar 23;140(6). doi: 10.4045/tidsskr.20.0214. Print 2020 Apr 21.

Modified SEIR and AI prediction of the epidemics trend of COVID-19 in China under public health interventions.

J Thorac Dis. 2020 Mar;12(3):165-174. doi: 10.21037/jtd.2020.02.64.

Obesity in Patients Younger Than 60 Years Is a Risk Factor for COVID-19 Hospital Admission.

Clin Infect Dis. 2020 Jul 28;71(15):896-897. doi: 10.1093/cid/ciaa415.

Level of underreporting including underdiagnosis before the first peak of COVID-19 in various countries: Preliminary retrospective results based on wavelets and deterministic modeling.

Infect Control Hosp Epidemiol. 2020 Jul;41(7):857-859. doi: 10.1017/ice.2020.116. Epub 2020 Apr 9.

Forecasting the novel coronavirus COVID-19.

PLoS One. 2020 Mar 31;15(3):e0231236. doi: 10.1371/journal.pone.0231236. eCollection 2020.

Data-based analysis, modelling and forecasting of the COVID-19 outbreak.

PLoS One. 2020 Mar 31;15(3):e0230405. doi: 10.1371/journal.pone.0230405. eCollection 2020.

High population densities catalyse the spread of COVID-19.

J Travel Med. 2020 May 18;27(3). doi: 10.1093/jtm/taaa038.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于 COVID-19 数据的国家级大流行风险和准备情况分类：一种机器学习方法。

Country-level pandemic risk and preparedness classification based on COVID-19 data: A machine learning approach.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献