Wan Minyu, Su Qi, Xiang Rong, Huang Chu-Ren
Department of Chinese and Bilingual Studies, The Hong Kong Polytechnic University, Hong Kong, China.
School of Foreign Languages, Peking University, Beijing, China.
Int J Data Sci Anal. 2023;15(3):313-327. doi: 10.1007/s41060-022-00339-8. Epub 2022 Jun 14.
The rampant of COVID-19 infodemic has almost been simultaneous with the outbreak of the pandemic. Many concerted efforts are made to mitigate its negative effect to information credibility and data legitimacy. Existing work mainly focuses on fact-checking algorithms or multi-class labeling models that are less aware of the intrinsic characteristics of the language. Nor is it discussed how such representations can account for the common psycho-socio-behavior of the information consumers. This work takes a data-driven analytical approach to (1) describe the prominent lexical and grammatical features of COVID-19 misinformation; (2) interpret the underlying (psycho-)linguistic triggers in terms of sentiment, power and activity based on the affective control theory; (3) study the feature indexing for anti-infodemic modeling. The results show distinct language generalization patterns of misinformation of favoring evaluative terms and multimedia devices in delivering a negative sentiment. Such appeals are effective to arouse people's sympathy toward the vulnerable community and foment their spreading behavior.
新冠疫情信息疫情的肆虐几乎与疫情大流行同时发生。人们做出了许多协同努力来减轻其对信息可信度和数据合法性的负面影响。现有工作主要集中在事实核查算法或多类标签模型上,这些模型对语言的内在特征了解较少。也没有讨论这些表征如何能够解释信息消费者的常见心理社会行为。这项工作采用数据驱动的分析方法来:(1)描述新冠疫情错误信息突出的词汇和语法特征;(2)基于情感控制理论从情感、权力和活跃度方面解释潜在的(心理)语言触发因素;(3)研究抗信息疫情建模的特征索引。结果显示,错误信息有明显的语言概括模式,倾向于使用评价性词语和多媒体手段来传递负面情绪。这种诉求有效地唤起了人们对弱势群体的同情,并助长了他们的传播行为。