Department of Computer Science and Engineering, University of South Florida, Tampa, FL, USA.
Department of Mental Health Law and Policy, University of South Florida, Tampa, FL, USA.
Sci Rep. 2023 Jul 2;13(1):10694. doi: 10.1038/s41598-023-37592-1.
According to data from the U.S. Center for Disease Control and Prevention, as of June 2020, a significant number of African Americans had been infected with the coronavirus disease, experiencing disproportionately higher death rates compared to other demographic groups. These disparities highlight the urgent need to examine the experiences, behaviors, and opinions of the African American population in relation to the COVID-19 pandemic. By understanding their unique challenges in navigating matters of health and well-being, we can work towards promoting health equity, eliminating disparities, and addressing persistent barriers to care. Since Twitter data has shown significant promise as a representation of human behavior and for opinion mining, this study leverages Twitter data published in 2020 to characterize the pandemic-related experiences of the United States' African American population using aspect-based sentiment analysis. Sentiment analysis is a common task in natural language processing that identifies the emotional tone (i.e., positive, negative, or neutral) of a text sample. Aspect-based sentiment analysis increases the granularity of sentiment analysis by also extracting the aspect for which sentiment is expressed. We developed a machine learning pipeline consisting of image and language-based classification models to filter out tweets not related to COVID-19 and those unlikely published by African American Twitter subscribers, leading to an analysis of nearly 4 million tweets. Overall, our results show that the majority of tweets had a negative tone, and that the days with larger numbers of published tweets often coincided with major U.S. events related to the pandemic as suggested by major news headlines (e.g., vaccine rollout). We also show how word usage evolved throughout the year (e.g., outbreak to pandemic and coronavirus to covid). This work also points to important issues like food insecurity and vaccine hesitation, along with exposing semantic relationships between words, such as covid and exhausted. As such, this work furthers understanding of how the nationwide progression of the pandemic may have impacted the narratives of African American Twitter users.
根据美国疾病控制与预防中心的数据,截至 2020 年 6 月,大量非裔美国人感染了冠状病毒病,与其他人群相比,他们的死亡率不成比例地更高。这些差异突出表明,迫切需要研究非裔美国人在 COVID-19 大流行期间的经历、行为和意见。通过了解他们在应对健康和福祉问题方面的独特挑战,我们可以努力促进健康公平、消除差异,并解决持续存在的护理障碍。由于 Twitter 数据已显示出作为代表人类行为和意见挖掘的重要潜力,因此本研究利用 2020 年发布的 Twitter 数据,使用基于方面的情感分析来描述美国非裔人口与大流行相关的经历。情感分析是自然语言处理中的一项常见任务,用于识别文本样本的情感基调(即积极、消极或中性)。基于方面的情感分析通过提取表达情感的方面,增加了情感分析的粒度。我们开发了一个由图像和语言分类模型组成的机器学习管道,用于过滤与 COVID-19 无关的推文和不太可能由非裔美国 Twitter 用户发布的推文,从而对近 400 万条推文进行了分析。总的来说,我们的结果表明,大多数推文的基调都是负面的,而且发布推文数量较多的日子往往与大流行相关的美国重大事件相吻合,这可以从主要新闻头条(例如疫苗推出)中得到证实。我们还展示了整个一年中词汇用法的演变(例如,outbreak 到 pandemic 和 coronavirus 到 covid)。这项工作还指出了一些重要问题,例如粮食不安全和疫苗犹豫,以及揭示了单词之间的语义关系,例如 covid 和 exhausted。因此,这项工作进一步了解了全国范围内大流行的进展如何影响非裔美国 Twitter 用户的叙述。