Suma L S, Anand H S, Vinod Chandra S S
Department of Computational Biology and Bioinformatics, University of Kerala, Trivandrum, India.
Department of Computer Science and Engineering, Muthoot Institute of Technology and Science, Kochi, India.
J Ambient Intell Humaniz Comput. 2023;14(3):1699-1711. doi: 10.1007/s12652-021-03389-1. Epub 2021 Jul 31.
The spread rate of COVID-19 is expected to be high in the wake of the virus's mutated strain found recently in a few countries. Fast diagnosis of the disease and knowing its severity are the two significant concerns of all physicians. Even though positive or negative diagnosis can be obtained through the RT-PCR test, an automatic model that predicts severity and the diagnosis will help medical practitioners to a great extend for affirming medication. Machine learning is an efficient tool that can process vast volume of data deposited in various formats, including clinical symptoms. In this work, we have developed machine learning models for analysing a clinical data set comprising 65000 records of patients, consisting of 26 features. An optimum set of features was derived from this data set by the proposed variant of artificial bee colony optimization algorithm. By making use of these features, a binary classifier is modelled with support vector machine for the screening of COVID-19 patients. Different models were tested for this purpose and the support vector machine has showcased the highest accuracy of 96%. Successively, severity prediction in COVID positive patients was also performed successfully by the logistic regression model. The model managed to predict three severity status viz mild, moderate, and severe. The confusion matrix and the precision-recall values (0.96 and 0.97) of the binary classifier indicate the classifier's efficiency in predicting positive cases correctly. The receiver operating curve generated for the severity predicting model shows the highest accuracy, 96.0% for class 1 and 85.0% for class 2 patients. Doctors can infer these results to finalize the type of treatment/care/facilities that need to be given to the patients from time to time.
最近在一些国家发现新冠病毒的变异毒株后,预计新冠病毒的传播速度会很高。疾病的快速诊断及其严重程度是所有医生的两大重要关切。尽管通过逆转录聚合酶链反应(RT-PCR)检测可以获得阳性或阴性诊断,但一个能够预测严重程度和诊断结果的自动模型将在很大程度上帮助医生确定治疗方案。机器学习是一种高效工具,能够处理以各种格式存储的大量数据,包括临床症状。在这项工作中,我们开发了机器学习模型来分析一个包含65000条患者记录的临床数据集,该数据集由26个特征组成。通过提出的人工蜂群优化算法变体从该数据集中导出了一组最优特征。利用这些特征,使用支持向量机建立了一个二元分类器,用于筛查新冠患者。为此测试了不同的模型,支持向量机展现出了96%的最高准确率。随后,逻辑回归模型也成功地对新冠阳性患者的严重程度进行了预测。该模型成功预测了三种严重程度状态,即轻度、中度和重度。二元分类器的混淆矩阵和精确率-召回率值(0.96和0.97)表明该分类器在正确预测阳性病例方面的效率。为严重程度预测模型生成的受试者工作特征曲线显示出最高准确率,1类患者为96.0%,2类患者为85.0%。医生可以参考这些结果来确定需要不时给予患者的治疗/护理/设施类型。