Gu Chanyuan, Nastase Samuel A, Zada Zaid, Li Ping
Department of Language Science and Technology, The Hong Kong Polytechnic University, Hong Kong SAR, China.
Princeton Neuroscience Institute and Department of Psychology, Princeton University, Princeton, NJ, USA.
NPJ Sci Learn. 2025 Jul 10;10(1):46. doi: 10.1038/s41539-025-00337-y.
While evidence has accumulated to support the argument of shared computational mechanisms underlying language comprehension between humans and large language models (LLMs), few studies have examined this argument beyond native-speaker populations. This study examines whether and how alignment between LLMs and human brains captures the homogeneity and heterogeneity in both first-language (L1) and second-language (L2) readers. We recorded brain responses of L1 and L2 English readers of texts and assessed reading performance against individual difference factors. At the group level, the two groups displayed comparable model-brain alignment in widespread regions, with similar unique contributions from contextual embeddings. At the individual level, multiple regression models revealed the effects of linguistic abilities on alignment for both groups, but effects of attentional ability and language dominance status for L2 readers only. These findings provide evidence that LLMs serve as cognitively plausible models in characterizing homogeneity and heterogeneity in reading across human populations.
虽然已有证据支持人类与大语言模型(LLM)之间在语言理解方面存在共享计算机制的观点,但很少有研究在以母语为非英语的人群中检验这一观点。本研究考察了大语言模型与人类大脑之间的一致性是否以及如何体现了第一语言(L1)和第二语言(L2)读者的同质性和异质性。我们记录了L1和L2英语读者阅读文本时的大脑反应,并根据个体差异因素评估了阅读表现。在群体层面,两组在广泛区域表现出相当的模型-大脑一致性,上下文嵌入的独特贡献相似。在个体层面,多元回归模型揭示了语言能力对两组一致性的影响,但仅揭示了注意力能力和语言优势地位对L2读者一致性的影响。这些发现提供了证据,表明大语言模型在刻画人类群体阅读中的同质性和异质性方面是具有认知合理性的模型。