Acciai Alessandro, Guerrisi Lucia, Perconti Pietro, Plebe Alessio, Suriano Rossella, Velardi Andrea
Department of Cognitive Science, University of Messina, Messina, Italy.
Department of Humanities Motor Sciences and Education, University Niccolò Cusano, Rome, Italy.
Front Psychol. 2025 Apr 1;16:1572076. doi: 10.3389/fpsyg.2025.1572076. eCollection 2025.
Neural language models, although at first approximation they may be simply described as predictors of the next token in a given sequence, surprisingly exhibit linguistic behaviors akin to human ones. This suggests the existence of an underlying sophisticated cognitive system in language production. This intriguing circumstance has inspired the adoption of psychological theories as investigative tools and the present research falls within this line of inquiry. What we aim to establish is the potential existence of a core of coherent integration in language production, metaphorically parallel to a human speaker's personal identity. To investigate this, we employed a well-established psychological theory on narrative coherence in autobiographical stories. This theory offers the theoretical advantage of a strong correlation between narrative coherence and a high integrative level of the personal knowledge system. It also provides the empirical advantage of methodologies for quantifying coherence and its characteristic dimensions through the analysis of autobiographical texts. The same methodology was applied to 2010 autobiographical stories generated by GPT-3.5 and an equal number from GPT-4, elicited by asking the models to assume roles that included a variety of variables such as gender, mood, and age. The large number of stories ensures adequate sampling given the stochastic nature of the models, and was made possible thanks to the adoption of an automated coherence evaluation procedure. We initially asked the models to generate 192 autobiographical stories, which were then analyzed by a team of professional psychologists. Based on this sample, we constructed a training set for the fine-tuning of GPT-3.5 as an automatic evaluator. Our results from the 4020 autobiographical stories overall show a level of narrative coherence in the models fully in line with data on human subjects, with slightly higher values in the case of GPT-4. These results suggest a high level of knowledge unification in the models, comparable to the integration of the self in human beings.
神经语言模型,尽管乍一看它们可能仅仅被描述为给定序列中下一个 tokens 的预测器,但令人惊讶的是,它们表现出与人类相似的语言行为。这表明在语言生成中存在一个潜在的复杂认知系统。这种有趣的情况激发了人们采用心理学理论作为研究工具,而本研究就属于这一探究范畴。我们旨在确定的是,在语言生成中是否存在一个连贯整合的核心,打个比方,它类似于人类说话者的个人身份。为了对此进行研究,我们采用了一种关于自传故事中叙事连贯性的成熟心理学理论。该理论具有理论优势,即叙事连贯性与个人知识系统的高整合水平之间存在很强的相关性。它还具有实证优势,即通过对自传文本的分析,提供了量化连贯性及其特征维度的方法。同样的方法被应用于 GPT - 3.5 生成的 2010 篇自传故事以及数量相等的 GPT - 4 生成的自传故事,这些故事是通过要求模型扮演包含性别、情绪和年龄等各种变量的角色而产生的。鉴于模型的随机性,大量的故事确保了足够的样本量,并且由于采用了自动连贯性评估程序才得以实现。我们最初要求模型生成 192 篇自传故事,然后由一组专业心理学家进行分析。基于这个样本,我们构建了一个训练集,用于对 GPT - 3.5 进行微调,使其成为一个自动评估器。我们从这 4020 篇自传故事中得到的总体结果表明,模型中的叙事连贯水平与人类受试者的数据完全一致,GPT - 4 的情况略高一些。这些结果表明模型中存在高度的知识统一,类似于人类自我的整合。