论话语连接词对人类和语言模型预测的影响。

On the influence of discourse connectives on the predictions of humans and language models.

作者信息

Britton James, Cong Yan, Hsu Yu-Yin, Chersoni Emmanuele, Blache Philippe

机构信息

Department of Chinese and Bilingual Studies, The Hong Kong Polytechnic University, Hong Kong, China.

School of Languages and Cultures, Purdue University, West Lafayette, IN, United States.

出版信息

Front Hum Neurosci. 2024 Sep 30;18:1363120. doi: 10.3389/fnhum.2024.1363120. eCollection 2024.

DOI:10.3389/fnhum.2024.1363120

PMID:39403701

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11471541/

Abstract

Psycholinguistic literature has consistently shown that humans rely on a rich and organized understanding of event knowledge to predict the forthcoming linguistic input during online sentence comprehension. We, the authors, expect sentences to maintain coherence with the preceding context, making congruent sentence sequences easier to process than incongruent ones. It is widely known that discourse relations between sentences (e.g., temporal, contingency, comparison) are generally made explicit through specific particles, known as , (e.g., ). However, some relations that are easily accessible to the speakers, given their event knowledge, can also be left implicit. The goal of this paper is to investigate the importance of discourse connectives in the prediction of events in human language processing and pretrained language models, with a specific focus on concessives and contrastives, which signal to comprehenders that their event-related predictions have to be . Inspired by previous work, we built a comprehensive set of story stimuli in Italian and Mandarin Chinese that differ in the plausibility and coherence of the situation being described and the presence or absence of a discourse connective. We collected plausibility judgments and reading times from native speakers for the stimuli. Moreover, we correlated the results of the experiments with the predictions given by computational modeling, using Surprisal scores obtained via Transformer-based language models. The human judgements were collected using a seven-point Likert scale and analyzed using cumulative link mixed modeling (CLMM), while the human reading times and language model surprisal scores were analyzed using linear mixed effects regression (LMER). We found that Chinese NLMs are sensitive to plausibility and connectives, although they struggle to reproduce expectation reversal effects due to a connective changing the plausibility of a given scenario; Italian results are even less aligned with human data, with no effects of either plausibility and connectives on Surprisal.

摘要

心理语言学文献一直表明，人类在在线句子理解过程中依靠对事件知识丰富且有组织的理解来预测即将到来的语言输入。我们这些作者期望句子与前文语境保持连贯，使一致的句子序列比不一致的句子序列更易于处理。众所周知，句子之间的语篇关系（如时间、偶然性、比较）通常通过特定的小品词明确表达，这些小品词被称为（例如，）。然而，鉴于说话者的事件知识，一些对他们来说容易理解的关系也可能是隐含的。本文的目的是研究语篇连接词在人类语言处理和预训练语言模型中事件预测的重要性，特别关注让步词和对比词，它们向理解者表明与事件相关的预测必须是。受先前工作的启发，我们用意大利语和汉语构建了一套全面的故事刺激材料，这些材料在描述的情境的合理性和连贯性以及是否存在语篇连接词方面存在差异。我们收集了以母语为母语的人对这些刺激材料的合理性判断和阅读时间。此外，我们将实验结果与计算建模给出的预测进行了关联，使用通过基于Transformer的语言模型获得的惊奇分数。人类判断使用七点李克特量表收集，并使用累积链接混合建模（CLMM）进行分析，而人类阅读时间和语言模型惊奇分数则使用线性混合效应回归（LMER）进行分析。我们发现，中文的神经语言模型对合理性和连接词敏感，尽管由于连接词改变了给定场景的合理性，它们难以再现期望反转效应；意大利语的结果与人类数据的一致性更低，合理性和连接词对惊奇分数均无影响。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd3f/11471541/3d5a1487138c/fnhum-18-1363120-g0001.jpg

相似文献

On the influence of discourse connectives on the predictions of humans and language models.

Front Hum Neurosci. 2024 Sep 30;18:1363120. doi: 10.3389/fnhum.2024.1363120. eCollection 2024.

How Robust Is Discourse Processing for Native Readers? The Role of Connectives and the Coherence Relations They Convey.

Front Psychol. 2022 Feb 15;13:822151. doi: 10.3389/fpsyg.2022.822151. eCollection 2022.

Lexical and Structural Cues to Discourse Processing in First and Second Language.

Front Psychol. 2021 Jul 1;12:685491. doi: 10.3389/fpsyg.2021.685491. eCollection 2021.

Understanding Temporal Relations in Mandarin Chinese: An ERP Investigation.

Brain Sci. 2022 Apr 3;12(4):474. doi: 10.3390/brainsci12040474.

The effects of discourse coherence on the persistence of sentence structures.

J Exp Psychol Learn Mem Cogn. 2024 Jan;50(1):137-160. doi: 10.1037/xlm0001295. Epub 2023 Oct 26.

Discourse coherence modulates use of predictive processing during sentence comprehension.

Cognition. 2024 Jan;242:105637. doi: 10.1016/j.cognition.2023.105637. Epub 2023 Oct 17.

Predicting "When" in Discourse Engages the Human Dorsal Auditory Stream: An fMRI Study Using Naturalistic Stories.

J Neurosci. 2016 Nov 30;36(48):12180-12191. doi: 10.1523/JNEUROSCI.4100-15.2016.

[Function of connectives in text-understanding].

Shinrigaku Kenkyu. 1988 Oct;59(4):241-7. doi: 10.4992/jjpsy.59.241.

Causal sentence production in children with language impairments.

Int J Lang Commun Disord. 2007 Mar-Apr;42(2):155-86. doi: 10.1080/13682820600822281.

Anticipatory looks reveal expectations about discourse relations.

Cognition. 2014 Dec;133(3):667-91. doi: 10.1016/j.cognition.2014.08.012. Epub 2014 Sep 20.

本文引用的文献

Event Knowledge in Large Language Models: The Gap Between the Impossible and the Unlikely.

Cogn Sci. 2023 Nov;47(11):e13386. doi: 10.1111/cogs.13386.

Discourse coherence modulates use of predictive processing during sentence comprehension.

Cognition. 2024 Jan;242:105637. doi: 10.1016/j.cognition.2023.105637. Epub 2023 Oct 17.

Emergent linguistic structure in artificial neural networks trained by self-supervision.

Proc Natl Acad Sci U S A. 2020 Dec 1;117(48):30046-30054. doi: 10.1073/pnas.1907367117. Epub 2020 Jun 3.

Flexible predictions during listening comprehension: Speaker reliability affects anticipatory processes.

Neuropsychologia. 2019 Dec;135:107225. doi: 10.1016/j.neuropsychologia.2019.107225. Epub 2019 Oct 9.

Semantic and Syntactic Interference in Sentence Comprehension: A Comparison of Working Memory Models.

Front Psychol. 2017 Feb 15;8:198. doi: 10.3389/fpsyg.2017.00198. eCollection 2017.

Situation models, mental simulations, and abstract concepts in discourse comprehension.

Psychon Bull Rev. 2016 Aug;23(4):1028-34. doi: 10.3758/s13423-015-0864-x.

Reversing expectations during discourse comprehension.

Lang Cogn Neurosci. 2015 Jul 1;30(6):648-672. doi: 10.1080/23273798.2014.995679.

People Use their Knowledge of Common Events to Understand Language, and Do So as Quickly as Possible.

Lang Linguist Compass. 2009 Nov;3(6):1417-1429. doi: 10.1111/j.1749-818X.2009.00174.x.

Event-based plausibility immediately influences on-line language comprehension.

J Exp Psychol Learn Mem Cogn. 2011 Jul;37(4):913-34. doi: 10.1037/a0022964.

Effects of event knowledge in processing verbal arguments.

J Mem Lang. 2010 Nov 1;63(4):489-505. doi: 10.1016/j.jml.2010.08.004.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

论话语连接词对人类和语言模型预测的影响。

On the influence of discourse connectives on the predictions of humans and language models.

作者信息

Britton James, Cong Yan, Hsu Yu-Yin, Chersoni Emmanuele, Blache Philippe

机构信息

Department of Chinese and Bilingual Studies, The Hong Kong Polytechnic University, Hong Kong, China.

School of Languages and Cultures, Purdue University, West Lafayette, IN, United States.

出版信息

Front Hum Neurosci. 2024 Sep 30;18:1363120. doi: 10.3389/fnhum.2024.1363120. eCollection 2024.

DOI:10.3389/fnhum.2024.1363120

PMID:39403701

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11471541/

Abstract

摘要

论话语连接词对人类和语言模型预测的影响。

On the influence of discourse connectives on the predictions of humans and language models.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

论话语连接词对人类和语言模型预测的影响。

On the influence of discourse connectives on the predictions of humans and language models.

作者信息

机构信息

出版信息