强预测：语言模型意外值解释多种N400效应。

Strong Prediction: Language Model Surprisal Explains Multiple N400 Effects.

作者信息

Michaelov James A, Bardolph Megan D, Van Petten Cyma K, Bergen Benjamin K, Coulson Seana

机构信息

Department of Cognitive Science, University of California, San Diego, La Jolla, CA, USA.

Department of Psychology, Binghamton University, State University of New York, Binghamton, NY, USA.

出版信息

Neurobiol Lang (Camb). 2024 Apr 1;5(1):107-135. doi: 10.1162/nol_a_00105. eCollection 2024.

DOI:10.1162/nol_a_00105

PMID:38645623

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11025652/

Abstract

Theoretical accounts of the N400 are divided as to whether the amplitude of the N400 response to a stimulus reflects the extent to which the stimulus was predicted, the extent to which the stimulus is semantically similar to its preceding context, or both. We use state-of-the-art machine learning tools to investigate which of these three accounts is best supported by the evidence. GPT-3, a neural language model trained to compute the conditional probability of any word based on the words that precede it, was used to operationalize contextual predictability. In particular, we used an information-theoretic construct known as surprisal (the negative logarithm of the conditional probability). Contextual semantic similarity was operationalized by using two high-quality co-occurrence-derived vector-based meaning representations for words: GloVe and fastText. The cosine between the vector representation of the sentence frame and final word was used to derive contextual cosine similarity estimates. A series of regression models were constructed, where these variables, along with cloze probability and plausibility ratings, were used to predict single trial N400 amplitudes recorded from healthy adults as they read sentences whose final word varied in its predictability, plausibility, and semantic relationship to the likeliest sentence completion. Statistical model comparison indicated GPT-3 surprisal provided the best account of N400 amplitude and suggested that apparently disparate N400 effects of expectancy, plausibility, and contextual semantic similarity can be reduced to variation in the predictability of words. The results are argued to support predictive coding in the human language network.

摘要

关于N400的理论解释存在分歧，即N400对刺激的反应幅度反映的是刺激被预测的程度、刺激与前文语境在语义上的相似程度，还是两者兼而有之。我们使用最先进的机器学习工具来研究这三种解释中哪一种最有证据支持。GPT-3是一种经过训练以根据前文单词计算任何单词的条件概率的神经语言模型，用于实现语境可预测性。具体而言，我们使用了一种称为意外性（条件概率的负对数）的信息论结构。通过使用两种基于高质量共现的单词向量表示法（GloVe和fastText）来实现语境语义相似性。句子框架和最后一个单词的向量表示之间的余弦用于得出语境余弦相似性估计值。构建了一系列回归模型，其中这些变量，连同完形填空概率和合理性评级，用于预测健康成年人阅读句子时记录的单次试验N400幅度，这些句子的最后一个单词在可预测性、合理性以及与最可能的句子完成形式的语义关系方面各不相同。统计模型比较表明，GPT-3意外性对N400幅度的解释最佳，并表明预期、合理性和语境语义相似性方面明显不同的N400效应可以归结为单词可预测性的变化。这些结果被认为支持人类语言网络中的预测编码。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d92a/11025652/dcfcc72cce81/nol-5-1-107-g001.jpg

相似文献

Strong Prediction: Language Model Surprisal Explains Multiple N400 Effects.

Neurobiol Lang (Camb). 2024 Apr 1;5(1):107-135. doi: 10.1162/nol_a_00105. eCollection 2024.

Prior Context and Individual Alpha Frequency Influence Predictive Processing during Language Comprehension.

J Cogn Neurosci. 2024 Sep 1;36(9):1898-1936. doi: 10.1162/jocn_a_02196.

Tracking Lexical and Semantic Prediction Error Underlying the N400 Using Artificial Neural Network Models of Sentence Processing.

Neurobiol Lang (Camb). 2024 Apr 1;5(1):136-166. doi: 10.1162/nol_a_00134. eCollection 2024.

Seeing words in context: the interaction of lexical and sentence level information during reading.

Brain Res Cogn Brain Res. 2004 Mar;19(1):59-73. doi: 10.1016/j.cogbrainres.2003.10.022.

Deep Artificial Neural Networks Reveal a Distributed Cortical Network Encoding Propositional Sentence-Level Meaning.

J Neurosci. 2021 May 5;41(18):4100-4119. doi: 10.1523/JNEUROSCI.1152-20.2021. Epub 2021 Mar 22.

On the Mathematical Relationship Between Contextual Probability and N400 Amplitude.

Open Mind (Camb). 2024 Jun 28;8:859-897. doi: 10.1162/opmi_a_00150. eCollection 2024.

The effects of processing requirements on neurophysiological responses to spoken sentences.

Brain Lang. 1990 Aug;39(2):302-18. doi: 10.1016/0093-934x(90)90016-a.

Hemispheric asymmetry in interpreting novel literal language: an event-related potential study.

Neuropsychologia. 2013 Apr;51(5):907-21. doi: 10.1016/j.neuropsychologia.2013.01.018. Epub 2013 Feb 1.

Predictability and novelty in literal language comprehension: an ERP study.

Brain Res. 2011 Oct 18;1418:70-82. doi: 10.1016/j.brainres.2011.07.039. Epub 2011 Jul 23.

Dissociable effects of prediction and integration during language comprehension: evidence from a large-scale study using brain potentials.

Philos Trans R Soc Lond B Biol Sci. 2020 Feb 3;375(1791):20180522. doi: 10.1098/rstb.2018.0522. Epub 2019 Dec 16.

引用本文的文献

Exploring the relationship between features calculated from contextual embeddings and EEG band power during sentence reading in Chinese.

Front Neurosci. 2025 Jul 30;19:1656519. doi: 10.3389/fnins.2025.1656519. eCollection 2025.

Sentence processing by humans and machines: Large language models as a tool to better understand human reading.

Psychon Bull Rev. 2025 Aug 13. doi: 10.3758/s13423-025-02756-9.

What's Surprising About Surprisal.

Comput Brain Behav. 2025;8(2):233-248. doi: 10.1007/s42113-025-00237-9. Epub 2025 Feb 21.

Turing Jest: Distributional Semantics and One-Line Jokes.

Cogn Sci. 2025 May;49(5):e70066. doi: 10.1111/cogs.70066.

The sociolinguistic foundations of language modeling.

Front Artif Intell. 2025 Jan 13;7:1472411. doi: 10.3389/frai.2024.1472411. eCollection 2024.

On the Mathematical Relationship Between Contextual Probability and N400 Amplitude.

Open Mind (Camb). 2024 Jun 28;8:859-897. doi: 10.1162/opmi_a_00150. eCollection 2024.

Clinical efficacy of pre-trained large language models through the lens of aphasia.

Sci Rep. 2024 Jul 6;14(1):15573. doi: 10.1038/s41598-024-66576-y.

Prediction during language comprehension: what is next?

Trends Cogn Sci. 2023 Nov;27(11):1032-1052. doi: 10.1016/j.tics.2023.08.003. Epub 2023 Sep 11.

Driving and suppressing the human language network using large language models.

bioRxiv. 2023 Oct 30:2023.04.16.537080. doi: 10.1101/2023.04.16.537080.

本文引用的文献

A Model of Online Temporal-Spatial Integration for Immediacy and Overrule in Discourse Comprehension.

Neurobiol Lang (Camb). 2021 Jan 1;2(1):83-105. doi: 10.1162/nol_a_00026. eCollection 2021.

Context-based facilitation of semantic access follows both logarithmic and linear functions of stimulus probability.

J Mem Lang. 2022 Apr;123. doi: 10.1016/j.jml.2021.104311. Epub 2021 Dec 20.

Comprehending surprising sentences: sensitivity of post-N400 positivities to contextual congruity and semantic relatedness.

Lang Cogn Neurosci. 2020;35(8):1044-1063. doi: 10.1080/23273798.2019.1708960. Epub 2020 Jan 6.

Using natural language processing to understand people and culture.

Am Psychol. 2022 May-Jun;77(4):525-537. doi: 10.1037/amp0000882. Epub 2021 Dec 16.

The N400 ERP component reflects an error-based implicit learning signal during language comprehension.

Eur J Neurosci. 2021 Nov;54(9):7125-7140. doi: 10.1111/ejn.15462. Epub 2021 Oct 4.

Connecting and considering: Electrophysiology provides insights into comprehension.

Psychophysiology. 2022 Jan;59(1):e13940. doi: 10.1111/psyp.13940. Epub 2021 Sep 14.

Shared understanding of color among sighted and blind adults.

Proc Natl Acad Sci U S A. 2021 Aug 17;118(33). doi: 10.1073/pnas.2020192118.

Word predictability effects are linear, not logarithmic: Implications for probabilistic models of sentence comprehension.

J Mem Lang. 2021 Feb;116. doi: 10.1016/j.jml.2020.104174. Epub 2020 Sep 18.

Tea With Milk? A Hierarchical Generative Framework of Sequential Event Comprehension.

Top Cogn Sci. 2021 Jan;13(1):256-298. doi: 10.1111/tops.12518. Epub 2020 Oct 6.

An exploratory data analysis of word form prediction during word-by-word reading.

Proc Natl Acad Sci U S A. 2020 Aug 25;117(34):20483-20494. doi: 10.1073/pnas.1922028117. Epub 2020 Aug 11.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

强预测：语言模型意外值解释多种N400效应。

Strong Prediction: Language Model Surprisal Explains Multiple N400 Effects.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献