Da Ting, Yang Liang
Xi'an Microelectronics Technology Institute Xi'an 710065 China.
IEEE Access. 2020 Nov 10;8:204684-204694. doi: 10.1109/ACCESS.2020.3037248. eCollection 2020.
Unexpected but exceedingly consequential, the COVID-19 outbreak has undermined livelihoods, disrupted the economy, induced upheavals, and posed challenges to government decision-makers. Under various behavioural regulations, such as social distancing and transport limitations, social media has become the central platform on which people from all regions, regardless of local COVID-19 severity, share their feelings and exchange thoughts. Our study illustrates the evolution of moods expressed on social media regarding COVID-19-related issues and empirically confirms the hypothesis that the severity of the pandemic substantially correlates with these sentiments by analysing tweets on Sina Weibo (China's central social media platform). Methodologically, we leveraged Sentiment Knowledge Enhanced Pre-training, the most state-of-the-art natural language processing pre-trained sentiment-related multipurpose model, to label Sina Weibo tweets during the most distressed period in 2020. Given that the model itself does not provide a feature explanation, we utilize a random forest and linear probit model with the labelled sample to demonstrate how each word plays a role in the prediction. Finally, we demonstrate a strong negative linear relationship between the local severity of COVID-19 and the local sentiment response by incorporating miscellaneous geo-economic control variables. In short, our study reveals how pandemics affect local sentiment and, in a broader sense, provides an easy-to-implement and explanatory pipeline to classify sentiments and resolve related socioeconomic issues.
新冠疫情的爆发出乎意料却影响深远,它破坏了人们的生计,扰乱了经济,引发了动荡,并给政府决策者带来了挑战。在诸如社交距离和交通限制等各种行为规范下,社交媒体已成为一个核心平台,来自各个地区的人们,无论当地新冠疫情的严重程度如何,都在这个平台上分享感受、交流思想。我们的研究阐述了社交媒体上关于新冠疫情相关问题所表达情绪的演变,并通过分析中国核心社交媒体平台新浪微博上的推文,实证证实了疫情严重程度与这些情绪显著相关的假设。在方法上,我们利用了情感知识增强预训练模型,这是最先进的自然语言处理预训练情感相关多用途模型,对2020年最艰难时期的新浪微博推文进行标注。鉴于该模型本身不提供特征解释,我们使用带有标注样本的随机森林和线性概率模型来展示每个词在预测中是如何发挥作用的。最后,通过纳入各种地理经济控制变量,我们证明了新冠疫情的局部严重程度与局部情绪反应之间存在强烈的负线性关系。简而言之,我们的研究揭示了疫情如何影响局部情绪,从更广泛的意义上说,提供了一个易于实施且具有解释性的流程来对情绪进行分类并解决相关的社会经济问题。