Galgoczy Michael C, Phatak Atharva, Vinson Danielle, Mago Vijay K, Giabbanelli Philippe J
Department of Computer Science & Software Engineering, Miami University, Oxford, OH, United States.
Department of Computer Science, Lakehead University, Thunder Bay, ON, Canada.
PeerJ Comput Sci. 2022 Apr 15;8:e947. doi: 10.7717/peerj-cs.947. eCollection 2022.
Influencing and framing debates on Twitter provides power to shape public opinion. Bots have become essential tools of 'computational propaganda' on social media such as Twitter, often contributing to a large fraction of the tweets regarding political events such as elections. Although analyses have been conducted regarding the first impeachment of former president Donald Trump, they have been focused on either a manual examination of relatively few tweets to emphasize rhetoric, or the use of Natural Language Processing (NLP) of a much larger with respect to common metrics such as sentiment. In this paper, we complement existing analyses by examining the role of bots in the first impeachment with respect to three questions as follows. (Q1) Are bots actively involved in the debate? (Q2) Do bots target one political affiliation more than another? (Q3) Which sources are used by bots to support their arguments? Our methods start with collecting over 13M tweets on six key dates, from October 6th 2019 to January 21st 2020. We used machine learning to evaluate the sentiment of the tweets ( BERT) and whether it originates from a bot. We then examined these sentiments with respect to a balanced sample of Democrats and Republicans directly relevant to the impeachment, such as House Speaker Nancy Pelosi, senator Mitch McConnell, and (then former Vice President) Joe Biden. The content of posts from bots was further analyzed with respect to the sources used (with bias ratings from AllSides and Ad Fontes) and themes. Our first finding is that bots have played a significant role in contributing to the overall negative tone of the debate (Q1). Bots were targeting Democrats more than Republicans (Q2), as evidenced both by a difference in ratio (bots had more negative-to-positive tweets on Democrats than Republicans) and in composition (use of derogatory nicknames). Finally, the sources provided by bots were almost twice as likely to be from the right than the left, with a noticeable use of hyper-partisan right and most extreme right sources (Q3). Bots were thus purposely used to promote a misleading version of events. Overall, this suggests an intentional use of bots as part of a strategy, thus providing further confirmation that computational propaganda is involved in defining political events in the United States. As any empirical analysis, our work has several limitations. For example, Trump's rhetoric on Twitter has previously been characterized by an overly negative tone, thus tweets detected as negative may be echoing his message rather than acting against him. Previous works show that this possibility is limited, and its existence would only strengthen our conclusions. As our analysis is based on NLP, we focus on processing a large volume of tweets rather than manually reading all of them, thus future studies may complement our approach by using qualitative methods to assess the specific arguments used by bots.
在推特上影响和构建辩论能够塑造公众舆论。机器人已成为推特等社交媒体上“计算宣传”的重要工具,常常在有关选举等政治事件的大量推文发布中发挥作用。尽管针对前总统唐纳德·特朗普的首次弹劾进行了分析,但这些分析要么侧重于人工检查相对较少的推文以强调言辞,要么侧重于对大量推文使用自然语言处理(NLP)来分析诸如情感等常见指标。在本文中,我们通过研究机器人在首次弹劾中在以下三个问题上所起的作用来补充现有分析。(问题1)机器人是否积极参与辩论?(问题2)机器人是否更倾向于针对某一政治派别而非另一派别?(问题3)机器人使用哪些来源来支持其论点?我们的方法首先是在2019年10月6日至2020年1月21日这六个关键日期收集超过1300万条推文。我们使用机器学习来评估推文的情感(BERT)以及它是否源自机器人。然后,我们针对与弹劾直接相关的民主党人和共和党人的均衡样本,如众议院议长南希·佩洛西、参议员米奇·麦康奈尔以及(当时的前副总统)乔·拜登,来研究这些情感。对于机器人发布内容所使用的来源(根据AllSides和Ad Fontes的偏见评级)和主题进行了进一步分析。我们的第一个发现是,机器人在导致辩论的整体负面基调方面发挥了重要作用(问题1)。机器人针对民主党人的程度超过共和党人(问题2),这在比率差异(机器人针对民主党人的负面推文与正面推文的比例高于共和党人)和构成(使用贬损性昵称)两方面都得到了证明。最后,机器人提供的来源来自右翼的可能性几乎是左翼的两倍,明显使用了极端党派化的右翼和极右翼来源(问题3)。因此,机器人被故意用于传播具有误导性的事件版本。总体而言,这表明故意将机器人用作策略的一部分,从而进一步证实了计算宣传在美国政治事件定义过程中所起的作用。与任何实证分析一样,我们的工作存在一些局限性。例如,特朗普此前在推特上的言辞基调一直过于负面,因此被检测为负面的推文可能是在呼应他的信息而非反对他。先前的研究表明这种可能性有限,而且它的存在只会强化我们的结论。由于我们的分析基于自然语言处理,我们专注于处理大量推文而非人工阅读所有推文,因此未来的研究可以通过使用定性方法来评估机器人所使用的具体论点来补充我们的方法。