Chen Yuan, Zhang Zhisheng
School of Mechanical Engineering, Southeast University, Nanjing, China.
Inf Process Manag. 2022 Nov;59(6):103073. doi: 10.1016/j.ipm.2022.103073. Epub 2022 Aug 29.
With the onset of COVID-19, the pandemic has aroused huge discussions on social media like Twitter, followed by many social media analyses concerning it. Despite such an abundance of studies, however, little work has been done on reactions from the public and officials on social networks and their associations, especially during the early outbreak stage. In this paper, a total of 9,259,861 COVID-19-related English tweets published from 31 December 2019 to 11 March 2020 are accumulated for exploring the participatory dynamics of public attention and news coverage during the early stage of the pandemic. An easy numeric data augmentation (ENDA) technique is proposed for generating new samples while preserving label validity. It attains superior performance on text classification tasks with deep models (BERT) than an easier data augmentation method. To demonstrate the efficacy of ENDA further, experiments and ablation studies have also been implemented on other benchmark datasets. The classification results of COVID-19 tweets show tweets peaks trigged by momentous events and a strong positive correlation between the daily number of personal narratives and news reports. We argue that there were three periods divided by the turning points on January 20 and February 23 and the low level of news coverage suggests the missed windows for government response in early January and February. Our study not only contributes to a deeper understanding of the dynamic patterns and relationships of public attention and news coverage on social media during the pandemic but also sheds light on early emergency management and government response on social media during global health crises.
随着新冠疫情的爆发,这场大流行在推特等社交媒体上引发了大量讨论,随后又出现了许多相关的社交媒体分析。然而,尽管有如此丰富的研究,但对于社交网络上公众和官员的反应及其关联,尤其是在疫情早期爆发阶段,却很少有研究。在本文中,我们收集了2019年12月31日至2020年3月11日发布的总计9259861条与新冠疫情相关的英文推文,以探究疫情早期公众关注和新闻报道的参与动态。我们提出了一种简单的数值数据增强(ENDA)技术,用于生成新样本同时保持标签有效性。在使用深度模型(BERT)的文本分类任务中,它比一种更简单的数据增强方法表现更优。为了进一步证明ENDA的有效性,我们还在其他基准数据集上进行了实验和消融研究。新冠疫情推文的分类结果显示,重大事件引发了推文高峰,个人叙述的每日数量与新闻报道之间存在很强的正相关。我们认为,以1月20日和2月23日的转折点为界可分为三个时期,新闻报道的低水平表明1月和2月初政府错过了应对窗口。我们的研究不仅有助于更深入地理解疫情期间社交媒体上公众关注和新闻报道的动态模式及关系,还为全球卫生危机期间社交媒体上的早期应急管理和政府应对提供了启示。