Zhou Di, Zhang Yinxian
Department of Sociology, New York University, New York, NY, 10003, USA.
Department of Sociology, Queens College, City University of New York (CUNY), New York, NY, 11367, USA.
Sci Rep. 2024 Oct 23;14(1):25048. doi: 10.1038/s41598-024-76395-w.
The growing popularity of ChatGPT and other large language models (LLMs) has led to many studies investigating their susceptibility to mistakes and biases. However, most studies have focused on models trained exclusively on English texts. This is one of the first studies that investigates cross-language political biases and inconsistencies in LLMs, specifically GPT models. Using two languages, English and simplified Chinese, we asked GPT the same questions about political issues in the United States (U.S.) and China. We found that the bilingual models' political knowledge and attitude were significantly more inconsistent regarding political issues in China than those in the U.S. The Chinese model was the least negative toward China's problems, whereas the English model was the most critical of China. This disparity cannot be explained by GPT model robustness. Instead, it suggests that political factors such as censorship and geopolitical tensions may have influenced LLM performance. Moreover, both the Chinese and English models tended to be less critical of the issues of their "own country," represented by the language used, than of the issues of "the other country." This suggests that multilingual GPT models could develop an "in-group bias" based on their training language. We discuss the implications of our findings for information transmission in an increasingly divided world.
ChatGPT和其他大语言模型(LLM)越来越受欢迎,这引发了许多关于它们对错误和偏差敏感性的研究。然而,大多数研究都集中在仅以英文文本训练的模型上。这是首批研究大语言模型(特别是GPT模型)中跨语言政治偏见和不一致性的研究之一。我们使用英语和简体中文两种语言,就美国和中国的政治问题向GPT提出相同的问题。我们发现,双语模型在中国政治问题上的政治知识和态度比在美国政治问题上的政治知识和态度明显更不一致。中文模型对中国问题的负面评价最少,而英文模型对中国的批评最为严厉。这种差异无法用GPT模型的稳健性来解释。相反,这表明审查制度和地缘政治紧张局势等政治因素可能影响了大语言模型的表现。此外,中文和英文模型对以所用语言代表的“本国”问题的批评往往少于对“他国”问题的批评。这表明多语言GPT模型可能会基于其训练语言形成“群体内偏见”。我们讨论了我们的研究结果对在日益分化的世界中信息传播的影响。