Britton James, Cong Yan, Hsu Yu-Yin, Chersoni Emmanuele, Blache Philippe
Department of Chinese and Bilingual Studies, The Hong Kong Polytechnic University, Hong Kong, China.
School of Languages and Cultures, Purdue University, West Lafayette, IN, United States.
Front Hum Neurosci. 2024 Sep 30;18:1363120. doi: 10.3389/fnhum.2024.1363120. eCollection 2024.
Psycholinguistic literature has consistently shown that humans rely on a rich and organized understanding of event knowledge to predict the forthcoming linguistic input during online sentence comprehension. We, the authors, expect sentences to maintain coherence with the preceding context, making congruent sentence sequences easier to process than incongruent ones. It is widely known that discourse relations between sentences (e.g., temporal, contingency, comparison) are generally made explicit through specific particles, known as , (e.g., ). However, some relations that are easily accessible to the speakers, given their event knowledge, can also be left implicit. The goal of this paper is to investigate the importance of discourse connectives in the prediction of events in human language processing and pretrained language models, with a specific focus on concessives and contrastives, which signal to comprehenders that their event-related predictions have to be . Inspired by previous work, we built a comprehensive set of story stimuli in Italian and Mandarin Chinese that differ in the plausibility and coherence of the situation being described and the presence or absence of a discourse connective. We collected plausibility judgments and reading times from native speakers for the stimuli. Moreover, we correlated the results of the experiments with the predictions given by computational modeling, using Surprisal scores obtained via Transformer-based language models. The human judgements were collected using a seven-point Likert scale and analyzed using cumulative link mixed modeling (CLMM), while the human reading times and language model surprisal scores were analyzed using linear mixed effects regression (LMER). We found that Chinese NLMs are sensitive to plausibility and connectives, although they struggle to reproduce expectation reversal effects due to a connective changing the plausibility of a given scenario; Italian results are even less aligned with human data, with no effects of either plausibility and connectives on Surprisal.
心理语言学文献一直表明,人类在在线句子理解过程中依靠对事件知识丰富且有组织的理解来预测即将到来的语言输入。我们这些作者期望句子与前文语境保持连贯,使一致的句子序列比不一致的句子序列更易于处理。众所周知,句子之间的语篇关系(如时间、偶然性、比较)通常通过特定的小品词明确表达,这些小品词被称为(例如,)。然而,鉴于说话者的事件知识,一些对他们来说容易理解的关系也可能是隐含的。本文的目的是研究语篇连接词在人类语言处理和预训练语言模型中事件预测的重要性,特别关注让步词和对比词,它们向理解者表明与事件相关的预测必须是。受先前工作的启发,我们用意大利语和汉语构建了一套全面的故事刺激材料,这些材料在描述的情境的合理性和连贯性以及是否存在语篇连接词方面存在差异。我们收集了以母语为母语的人对这些刺激材料的合理性判断和阅读时间。此外,我们将实验结果与计算建模给出的预测进行了关联,使用通过基于Transformer的语言模型获得的惊奇分数。人类判断使用七点李克特量表收集,并使用累积链接混合建模(CLMM)进行分析,而人类阅读时间和语言模型惊奇分数则使用线性混合效应回归(LMER)进行分析。我们发现,中文的神经语言模型对合理性和连接词敏感,尽管由于连接词改变了给定场景的合理性,它们难以再现期望反转效应;意大利语的结果与人类数据的一致性更低,合理性和连接词对惊奇分数均无影响。