Muhammad L J, Algehyne Ebrahem A, Usman Sani Sharif, Ahmad Abdulkadir, Chakraborty Chinmay, Mohammed I A
Department of Mathematics and Computer Science, Faculty of Science, Federal University of Kashere, P.M.B. 0182, Gombe, Nigeria.
Department of Mathematics, University of Tabuk, Tabuk, 71491 Saudi Arabia.
SN Comput Sci. 2021;2(1):11. doi: 10.1007/s42979-020-00394-7. Epub 2020 Nov 27.
COVID-19 or 2019-nCoV is no longer pandemic but rather endemic, with more than 651,247 people around world having lost their lives after contracting the disease. Currently, there is no specific treatment or cure for COVID-19, and thus living with the disease and its symptoms is inevitable. This reality has placed a massive burden on limited healthcare systems worldwide especially in the developing nations. Although neither an effective, clinically proven antiviral agents' strategy nor an approved vaccine exist to eradicate the COVID-19 pandemic, there are alternatives that may reduce the huge burden on not only limited healthcare systems but also the economic sector; the most promising include harnessing non-clinical techniques such as machine learning, data mining, deep learning and other artificial intelligence. These alternatives would facilitate diagnosis and prognosis for 2019-nCoV pandemic patients. Supervised machine learning models for COVID-19 infection were developed in this work with learning algorithms which include logistic regression, decision tree, support vector machine, naive Bayes, and artificial neutral network using epidemiology labeled dataset for positive and negative COVID-19 cases of Mexico. The correlation coefficient analysis between various dependent and independent features was carried out to determine a strength relationship between each dependent feature and independent feature of the dataset prior to developing the models. The 80% of the training dataset were used for training the models while the remaining 20% were used for testing the models. The result of the performance evaluation of the models showed that decision tree model has the highest accuracy of 94.99% while the Support Vector Machine Model has the highest sensitivity of 93.34% and Naïve Bayes Model has the highest specificity of 94.30%.
新冠病毒(COVID-19 或 2019 - nCoV)已不再是大流行疾病,而是地方性流行病,全球有超过 651,247 人感染该疾病后丧生。目前,尚无针对 COVID-19 的特效治疗方法或治愈手段,因此与该疾病及其症状共存不可避免。这一现实给全球有限的医疗系统带来了巨大负担,尤其是在发展中国家。尽管既没有有效的、经临床验证的抗病毒药物策略,也没有获批的疫苗来根除 COVID-19 大流行,但存在一些替代方案,这些方案不仅可以减轻有限医疗系统的巨大负担,还能减轻经济部门的负担;最有前景的方案包括利用机器学习、数据挖掘、深度学习和其他人工智能等非临床技术。这些替代方案将有助于对 2019 - nCoV 大流行患者进行诊断和预后评估。在这项工作中,使用包括逻辑回归、决策树、支持向量机、朴素贝叶斯和人工神经网络在内的学习算法,针对墨西哥 COVID-19 阳性和阴性病例的流行病学标记数据集,开发了用于 COVID-19 感染的监督机器学习模型。在开发模型之前,对各种因变量和自变量特征进行了相关系数分析,以确定数据集中每个因变量特征和自变量特征之间的强度关系。80%的训练数据集用于训练模型,其余 20%用于测试模型。模型性能评估结果表明,决策树模型的准确率最高,为 94.99%,支持向量机模型的灵敏度最高,为 93.34%,朴素贝叶斯模型的特异性最高,为 94.30%。