Saad Eysha, Sadiq Saima, Jamil Ramish, Rustam Furqan, Mehmood Arif, Choi Gyu Sang, Ashraf Imran
Department of Computer Science, Khawaja Fareed University of Engineering and Information Technology, Rahim Yar Khan, Pakistan.
Department of Software Engineering, University of Management and Technology, Lahore, Pakistan.
Digit Health. 2022 Jul 21;8:20552076221109530. doi: 10.1177/20552076221109530. eCollection 2022 Jan-Dec.
Vaccination for the COVID-19 pandemic has raised serious concerns among the public and various rumours are spread regarding the resulting illness, adverse reactions, and death. Such rumours can damage the campaign against the COVID-19 and should be dealt with accordingly. One prospective solution is to use machine learning-based models to predict the death risk for vaccinated people by utilizing the available data. This study focuses on the prognosis of three significant events including 'not survived', 'recovered', and 'not recovered' based on the adverse events followed by the second dose of the COVID-19 vaccine. Extensive experiments are performed to analyse the efficacy of the proposed Extreme Regression- Voting Classifier model in comparison with machine learning models with Term Frequency-Inverse Document Frequency, Bag of Words, and Global Vectors, and deep learning models like Convolutional Neural Network, Long Short Term Memory, and Bidirectional Long Short Term Memory. Experiments are carried out on the original, as well as, a balanced dataset using Synthetic Minority Oversampling Approach. Results reveal that the proposed voting classifier in combination with TF-IDF outperforms with a 0.85 accuracy score on the SMOTE-balanced dataset. In line with this, the validation of the proposed voting classifier on binary classification shows state-of-the-art results with a 0.98 accuracy.
新冠疫情疫苗接种引发了公众的严重担忧,关于接种后产生的疾病、不良反应及死亡的各种谣言四处传播。此类谣言会破坏新冠疫情防控工作,应予以相应处理。一个可能的解决方案是利用机器学习模型,通过现有数据预测接种者的死亡风险。本研究基于第二剂新冠疫苗接种后的不良事件,聚焦于“未存活”“康复”和“未康复”这三个重大事件的预后情况。开展了大量实验,以分析所提出的极端回归投票分类器模型与采用词频 - 逆文档频率、词袋模型和全局向量的机器学习模型,以及卷积神经网络、长短期记忆网络和双向长短期记忆网络等深度学习模型相比的效果。使用合成少数过采样技术在原始数据集以及平衡数据集上进行实验。结果表明,所提出的投票分类器与词频 - 逆文档频率相结合,在SMOTE平衡数据集上的准确率达到0.85,表现出色。与此相符的是,所提出的投票分类器在二分类上的验证显示,其准确率达到0.98,取得了领先的成果。