Suppr超能文献

一种用于早期新冠疫情推文的简单数字数据增强方法,以探索公众关注和新闻报道的参与动态。

An easy numeric data augmentation method for early-stage COVID-19 tweets exploration of participatory dynamics of public attention and news coverage.

作者信息

Chen Yuan, Zhang Zhisheng

机构信息

School of Mechanical Engineering, Southeast University, Nanjing, China.

出版信息

Inf Process Manag. 2022 Nov;59(6):103073. doi: 10.1016/j.ipm.2022.103073. Epub 2022 Aug 29.

Abstract

With the onset of COVID-19, the pandemic has aroused huge discussions on social media like Twitter, followed by many social media analyses concerning it. Despite such an abundance of studies, however, little work has been done on reactions from the public and officials on social networks and their associations, especially during the early outbreak stage. In this paper, a total of 9,259,861 COVID-19-related English tweets published from 31 December 2019 to 11 March 2020 are accumulated for exploring the participatory dynamics of public attention and news coverage during the early stage of the pandemic. An easy numeric data augmentation (ENDA) technique is proposed for generating new samples while preserving label validity. It attains superior performance on text classification tasks with deep models (BERT) than an easier data augmentation method. To demonstrate the efficacy of ENDA further, experiments and ablation studies have also been implemented on other benchmark datasets. The classification results of COVID-19 tweets show tweets peaks trigged by momentous events and a strong positive correlation between the daily number of personal narratives and news reports. We argue that there were three periods divided by the turning points on January 20 and February 23 and the low level of news coverage suggests the missed windows for government response in early January and February. Our study not only contributes to a deeper understanding of the dynamic patterns and relationships of public attention and news coverage on social media during the pandemic but also sheds light on early emergency management and government response on social media during global health crises.

摘要

随着新冠疫情的爆发,这场大流行在推特等社交媒体上引发了大量讨论,随后又出现了许多相关的社交媒体分析。然而,尽管有如此丰富的研究,但对于社交网络上公众和官员的反应及其关联,尤其是在疫情早期爆发阶段,却很少有研究。在本文中,我们收集了2019年12月31日至2020年3月11日发布的总计9259861条与新冠疫情相关的英文推文,以探究疫情早期公众关注和新闻报道的参与动态。我们提出了一种简单的数值数据增强(ENDA)技术,用于生成新样本同时保持标签有效性。在使用深度模型(BERT)的文本分类任务中,它比一种更简单的数据增强方法表现更优。为了进一步证明ENDA的有效性,我们还在其他基准数据集上进行了实验和消融研究。新冠疫情推文的分类结果显示,重大事件引发了推文高峰,个人叙述的每日数量与新闻报道之间存在很强的正相关。我们认为,以1月20日和2月23日的转折点为界可分为三个时期,新闻报道的低水平表明1月和2月初政府错过了应对窗口。我们的研究不仅有助于更深入地理解疫情期间社交媒体上公众关注和新闻报道的动态模式及关系,还为全球卫生危机期间社交媒体上的早期应急管理和政府应对提供了启示。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b884/9420706/50c14ad73626/gr1_lrg.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验