Leibniz Research Centre for Working Environment and Human Factors at the Technical University of Dortmund, Dortmund, Germany.
Institute of Behavioral Science and Technology, University of St. Gallen, St. Gallen, Switzerland.
Sci Rep. 2023 Mar 28;13(1):5035. doi: 10.1038/s41598-023-32248-6.
Modern large language models generate texts that are virtually indistinguishable from those written by humans and achieve near-human performance in comprehension and reasoning tests. Yet, their complexity makes it difficult to explain and predict their functioning. We examined a state-of-the-art language model (GPT-3) using lexical decision tasks widely used to study the structure of semantic memory in humans. The results of four analyses showed that GPT-3's patterns of semantic activation are broadly similar to those observed in humans, showing significantly higher semantic activation in related (e.g., "lime-lemon") word pairs than in other-related (e.g., "sour-lemon") or unrelated (e.g., "tourist-lemon") word pairs. However, there are also significant differences between GPT-3 and humans. GPT-3's semantic activation is better predicted by similarity in words' meaning (i.e., semantic similarity) rather than their co-occurrence in the language (i.e., associative similarity). This suggests that GPT-3's semantic network is organized around word meaning rather than their co-occurrence in text.
现代大型语言模型生成的文本几乎与人类撰写的文本无法区分,并且在理解和推理测试中达到了接近人类的水平。然而,它们的复杂性使得解释和预测其功能变得困难。我们使用广泛用于研究人类语义记忆结构的词汇判断任务来研究一种最先进的语言模型(GPT-3)。四项分析的结果表明,GPT-3 的语义激活模式与人类观察到的模式大致相似,在相关(例如,“lime-lemon”)词对中的语义激活显著高于其他相关(例如,“sour-lemon”)或不相关(例如,“tourist-lemon”)词对。然而,GPT-3 和人类之间也存在显著差异。GPT-3 的语义激活可以更好地通过单词含义(即语义相似性)的相似性来预测,而不是通过它们在语言中的共现(即联想相似性)来预测。这表明 GPT-3 的语义网络是围绕单词的含义而不是它们在文本中的共现组织的。