Chandrasekaran Ranganathan, Mehta Vikalp, Valkunde Tejali, Moustakas Evangelos
Department of Information and Decision Sciences, University of Illinois at Chicago, Chicago, IL, United States.
Middlesex University Dubai, Dubai, United Arab Emirates.
J Med Internet Res. 2020 Oct 23;22(10):e22624. doi: 10.2196/22624.
With restrictions on movement and stay-at-home orders in place due to the COVID-19 pandemic, social media platforms such as Twitter have become an outlet for users to express their concerns, opinions, and feelings about the pandemic. Individuals, health agencies, and governments are using Twitter to communicate about COVID-19.
The aims of this study were to examine key themes and topics of English-language COVID-19-related tweets posted by individuals and to explore the trends and variations in how the COVID-19-related tweets, key topics, and associated sentiments changed over a period of time from before to after the disease was declared a pandemic.
Building on the emergent stream of studies examining COVID-19-related tweets in English, we performed a temporal assessment covering the time period from January 1 to May 9, 2020, and examined variations in tweet topics and sentiment scores to uncover key trends. Combining data from two publicly available COVID-19 tweet data sets with those obtained in our own search, we compiled a data set of 13.9 million English-language COVID-19-related tweets posted by individuals. We use guided latent Dirichlet allocation (LDA) to infer themes and topics underlying the tweets, and we used VADER (Valence Aware Dictionary and sEntiment Reasoner) sentiment analysis to compute sentiment scores and examine weekly trends for 17 weeks.
Topic modeling yielded 26 topics, which were grouped into 10 broader themes underlying the COVID-19-related tweets. Of the 13,937,906 examined tweets, 2,858,316 (20.51%) were about the impact of COVID-19 on the economy and markets, followed by spread and growth in cases (2,154,065, 15.45%), treatment and recovery (1,831,339, 13.14%), impact on the health care sector (1,588,499, 11.40%), and governments response (1,559,591, 11.19%). Average compound sentiment scores were found to be negative throughout the examined time period for the topics of spread and growth of cases, symptoms, racism, source of the outbreak, and political impact of COVID-19. In contrast, we saw a reversal of sentiments from negative to positive for prevention, impact on the economy and markets, government response, impact on the health care industry, and treatment and recovery.
Identification of dominant themes, topics, sentiments, and changing trends in tweets about the COVID-19 pandemic can help governments, health care agencies, and policy makers frame appropriate responses to prevent and control the spread of the pandemic.
由于新冠疫情大流行,行动受限且实施居家令,推特等社交媒体平台已成为用户表达对疫情的担忧、看法和感受的渠道。个人、卫生机构和政府都在利用推特交流新冠疫情相关信息。
本研究旨在探讨个人发布的与新冠疫情相关的英文推文的关键主题和话题,并探究从该疾病被宣布为大流行之前到之后的一段时间内,与新冠疫情相关的推文、关键话题及相关情绪的变化趋势和差异。
基于对英文新冠疫情相关推文的新兴研究潮流,我们进行了一项时间跨度为2020年1月1日至5月9日的时间评估,研究推文话题和情绪得分的变化以揭示关键趋势。我们将两个公开可用的新冠疫情推文数据集的数据与我们自己搜索获得的数据相结合,编制了一个包含1,390万条个人发布的与新冠疫情相关的英文推文的数据集。我们使用引导式潜在狄利克雷分配(LDA)来推断推文背后的主题和话题,并使用VADER(基于情感词典和情感推理器)情感分析来计算情感得分并研究17周的每周趋势。
主题建模产生了26个话题,这些话题被归为与新冠疫情相关推文背后的10个更广泛的主题。在1,393,7906条被检查的推文中,2,858,316条(20.51%)是关于新冠疫情对经济和市场的影响,其次是病例传播和增长(2,154,065条,15.45%)、治疗和康复(1,831,339条,13.14%)、对医疗保健部门的影响(1,588,499条,11.40%)以及政府应对措施(1,559,591条,11.19%)。在整个检查时间段内,发现病例传播和增长、症状、种族主义、疫情源头以及新冠疫情的政治影响等话题的平均复合情感得分均为负面。相比之下,我们看到预防、对经济和市场的影响、政府应对措施、对医疗保健行业的影响以及治疗和康复等话题的情绪从负面转为正面。
识别关于新冠疫情大流行的推文中的主导主题、话题、情绪和变化趋势,有助于政府、卫生保健机构和政策制定者制定适当的应对措施,以预防和控制疫情的传播。