Department of Computer Science, Umm Al-Qura University, Makkah 24243, Saudi Arabia.
REsearch Groups in Intelligent Machines (REGIM-Lab), National Engineering School of Sfax, University of Sfax, Sfax 3038, Tunisia.
Comput Intell Neurosci. 2022 Oct 27;2022:4535541. doi: 10.1155/2022/4535541. eCollection 2022.
The spread of COVID-19 has affected more than 200 countries and has caused serious public health concerns. The infected cases are on the increase despite the effectiveness of the vaccines. An efficient and quick surveillance system for COVID-19 can help healthcare decision-makers to contain the virus spread. In this study, we developed a novel framework using machine learning (ML) models capable of detecting COVID-19 accurately at an early stage. To estimate the risks, many models use social networking sites (SNSs) in tracking the disease outbreak. Twitter is one of the SNSs that is widely used to create an efficient resource for disease real-time analysis and can provide an early warning for health officials. We introduced a pipeline framework of outbreak prediction that incorporates a first-step hybrid method of word embedding for tweet classification. In the second step, we considered the classified tweets with external features such as vaccine rate associated with infected cases passed to machine learning algorithms for daily predictions. Thus, we applied different machine learning models such as the SVM, RF, and LR for classification and the LSTM, Prophet, and SVR for prediction. For the hybrid word embedding techniques, we applied TF-IDF, FastText, and Glove and a combination of the three features to enhance the classification. Furthermore, to improve the forecast performance, we incorporated vaccine data as input together with tweets and confirmed cases. The models' performance is more than 80% accurate, which shows the reliability of the proposed study.
COVID-19 的传播已经影响了 200 多个国家,引起了严重的公共卫生关注。尽管疫苗有效,但感染病例仍在增加。一个高效、快速的 COVID-19 监测系统可以帮助医疗保健决策者控制病毒的传播。在这项研究中,我们开发了一个使用机器学习(ML)模型的新框架,能够在早期准确地检测 COVID-19。为了估计风险,许多模型使用社交网络(SNS)来跟踪疾病爆发。Twitter 是一种广泛用于创建疾病实时分析有效资源并为卫生官员提供早期预警的 SNS。我们引入了一种爆发预测的管道框架,该框架包含了用于 tweet 分类的第一步混合词嵌入方法。在第二步中,我们考虑了与感染病例相关的疫苗接种率等外部特征的分类推文,并将其传递给机器学习算法进行日常预测。因此,我们应用了不同的机器学习模型,如 SVM、RF 和 LR 进行分类,以及 LSTM、Prophet 和 SVR 进行预测。对于混合词嵌入技术,我们应用了 TF-IDF、FastText 和 Glove 并结合了这三个特征来提高分类的准确性。此外,为了提高预测性能,我们将疫苗数据与推文和确诊病例一起作为输入。模型的性能准确率超过 80%,这表明了所提出的研究的可靠性。