Kaur Harleen, Ahsaan Shafqat Ul, Alankar Bhavya, Chang Victor
Department of Computer Science and Engineering, School of Engineering Sciences and Technology, Jamia Hamdard, New Delhi, India.
Artificial Intelligence and Information Systems Research Group, School of Computing, Engineering and Digital Technologies, Teesside University, Middlesbrough, UK.
Inf Syst Front. 2021;23(6):1417-1429. doi: 10.1007/s10796-021-10135-7. Epub 2021 Apr 20.
With the rise in cases of COVID-19, a bizarre situation of pressure was mounted on each country to make arrangements to control the population and utilize the available resources appropriately. The swiftly rising of positive cases globally created panic, anxiety and depression among people. The effect of this deadly disease was found to be directly proportional to the physical and mental health of the population. As of 28 October 2020, more than 40 million people are tested positive and more than 1 million deaths have been recorded. The most dominant tool that disturbed human life during this time is social media. The tweets regarding COVID-19, whether it was a number of positive cases or deaths, induced a wave of fear and anxiety among people living in different parts of the world. Nobody can deny the truth that social media is everywhere and everybody is connected with it directly or indirectly. This offers an opportunity for researchers and data scientists to access the data for academic and research use. The social media data contains many data that relate to real-life events like COVID-19. In this paper, an analysis of Twitter data has been done through the R programming language. We have collected the Twitter data based on hashtag keywords, including COVID-19, coronavirus, deaths, new case, recovered. In this study, we have designed an algorithm called Hybrid Heterogeneous Support Vector Machine (H-SVM) and performed the sentiment classification and classified them positive, negative and neutral sentiment scores. We have also compared the performance of the proposed algorithm on certain parameters like precision, recall, F1 score and accuracy with Recurrent Neural Network (RNN) and Support Vector Machine (SVM).
随着新冠病毒病例的增加,每个国家都面临着一种奇怪的压力局面,需要做出安排来控制人口并合理利用现有资源。全球确诊病例的迅速增加在人们中引发了恐慌、焦虑和抑郁。人们发现这种致命疾病的影响与人口的身心健康直接相关。截至2020年10月28日,已有超过4000万人检测呈阳性,记录在案的死亡人数超过100万。在此期间扰乱人类生活的最主要工具是社交媒体。关于新冠病毒的推文,无论是确诊病例数还是死亡人数,都在世界各地的人们中引发了一波恐惧和焦虑。没有人能否认社交媒体无处不在,每个人都直接或间接地与之相连这一事实。这为研究人员和数据科学家提供了获取数据用于学术和研究的机会。社交媒体数据包含许多与新冠病毒等现实生活事件相关的数据。在本文中,我们通过R编程语言对推特数据进行了分析。我们基于包括新冠病毒、冠状病毒、死亡、新病例、康复等主题标签关键词收集了推特数据。在这项研究中,我们设计了一种名为混合异构支持向量机(H-SVM)的算法,并进行了情感分类,将它们分为积极、消极和中性情感得分。我们还将该算法在精度、召回率、F1分数和准确率等参数方面的性能与递归神经网络(RNN)和支持向量机(SVM)进行了比较。