Almustafa Khaled Mohamad
Department of Information Systems, College of Computer and Information Systems Prince Sultan University Riyadh Kingdom of Saudi Arabia.
Concurr Comput. 2022 Feb 15;34(4):e6675. doi: 10.1002/cpe.6675. Epub 2021 Oct 16.
Coronavirus disease, Covid19, pandemic has a great effect on human heath worldwide since it was first detected in late 2019. A clear understanding of the structure of the available Covid19 datasets might give the healthcare provider a better understanding of identifying some of the cases at an early stage. In this article, we will be looking into a Covid19 Mexican Patients' Dataset (Covid109MPD), and we will apply number of machine learning algorithms on the dataset to select the best possible classification algorithm for the death and survived cases in Mexico, then we will study the performance of the enhancement of the specified classifiers in term of their features selection in order to be able to predict sever, and or death, cases from the available dataset. Results show that J48 classifier gives the best classification accuracy with 94.41% and RMSE = 0.2028 and ROC = 0.919, compared to other classifiers, and when using feature selection method, J48 classifier can predict a surviving Covid19MPD case within 94.88% accuracy, and by using only 10 out of the total 19 features.
自2019年末首次发现新型冠状病毒肺炎(Covid-19)大流行以来,它对全球人类健康产生了巨大影响。清楚了解现有Covid-19数据集的结构可能会让医疗服务提供者更好地在早期识别一些病例。在本文中,我们将研究一个Covid-19墨西哥患者数据集(Covid109MPD),并在该数据集上应用多种机器学习算法,以选择针对墨西哥死亡和存活病例的最佳分类算法,然后我们将研究指定分类器在特征选择方面增强后的性能,以便能够从可用数据集中预测重症和/或死亡病例。结果表明,与其他分类器相比,J48分类器的分类准确率最高,为94.41%,均方根误差(RMSE)=0.2028,曲线下面积(ROC)=0.919,并且在使用特征选择方法时,J48分类器能够以94.88%的准确率预测Covid19MPD存活病例,且仅使用总共19个特征中的10个特征。