维度和渐变：大脑和深度语言模型动态中句子整合的特征。

Dimensionality and Ramping: Signatures of Sentence Integration in the Dynamics of Brains and Deep Language Models.

机构信息

Meta AI Research, Paris 75002, France; and Cognitive Neuroimaging Unit NeuroSpin center, 91191, Gif-sur-Yvette, France.

Cognitive Neuroimaging Unit NeuroSpin center, Gif-sur-Yvette, 91191, France.

出版信息

J Neurosci. 2023 Jul 19;43(29):5350-5364. doi: 10.1523/JNEUROSCI.1163-22.2023. Epub 2023 May 22.

DOI:10.1523/JNEUROSCI.1163-22.2023

PMID:37217308

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10359032/

Abstract

A sentence is more than the sum of its words: its meaning depends on how they combine with one another. The brain mechanisms underlying such semantic composition remain poorly understood. To shed light on the neural vector code underlying semantic composition, we introduce two hypotheses: (1) the intrinsic dimensionality of the space of neural representations should increase as a sentence unfolds, paralleling the growing complexity of its semantic representation; and (2) this progressive integration should be reflected in ramping and sentence-final signals. To test these predictions, we designed a dataset of closely matched normal and jabberwocky sentences (composed of meaningless pseudo words) and displayed them to deep language models and to 11 human participants (5 men and 6 women) monitored with simultaneous MEG and intracranial EEG. In both deep language models and electrophysiological data, we found that representational dimensionality was higher for meaningful sentences than jabberwocky. Furthermore, multivariate decoding of normal versus jabberwocky confirmed three dynamic patterns: (1) a phasic pattern following each word, peaking in temporal and parietal areas; (2) a ramping pattern, characteristic of bilateral inferior and middle frontal gyri; and (3) a sentence-final pattern in left superior frontal gyrus and right orbitofrontal cortex. These results provide a first glimpse into the neural geometry of semantic integration and constrain the search for a neural code of linguistic composition. Starting from general linguistic concepts, we make two sets of predictions in neural signals evoked by reading multiword sentences. First, the intrinsic dimensionality of the representation should grow with additional meaningful words. Second, the neural dynamics should exhibit signatures of encoding, maintaining, and resolving semantic composition. We successfully validated these hypotheses in deep neural language models, artificial neural networks trained on text and performing very well on many natural language processing tasks. Then, using a unique combination of MEG and intracranial electrodes, we recorded high-resolution brain data from human participants while they read a controlled set of sentences. Time-resolved dimensionality analysis showed increasing dimensionality with meaning, and multivariate decoding allowed us to isolate the three dynamical patterns we had hypothesized.

摘要

句子的意义不仅仅取决于其组成的单词，还取决于这些单词之间的组合方式。然而，大脑中这种语义组合的机制仍然知之甚少。为了揭示语义组合背后的神经向量代码，我们提出了两个假设：（1）随着句子的展开，神经表示的内在维度应该增加，与语义表示的复杂性增长相平行；（2）这种渐进式整合应该反映在递增和句子结尾的信号中。为了验证这些预测，我们设计了一个由紧密匹配的正常句子和 Jabberwocky 句子（由无意义的伪词组成）组成的数据集，并将其展示给深度语言模型和 11 名同时进行 MEG 和颅内 EEG 监测的人类参与者。在深度语言模型和电生理数据中，我们发现有意义的句子的表示维度高于 Jabberwocky。此外，正常句子与 Jabberwocky 之间的多元解码证实了三种动态模式：（1）每个单词之后的相位模式，在颞叶和顶叶区域达到峰值；（2）一个递增模式，特征在于双侧下额和中额回；（3）在左额上回和右眶额皮质中的句子结尾模式。这些结果提供了语义整合神经几何结构的初步见解，并限制了对语言组合神经代码的搜索。从一般的语言概念出发，我们对阅读多词句子时引发的神经信号做出了两组预测。首先，随着有意义的单词的增加，代表的内在维度应该增加。其次，神经动力学应该表现出编码、保持和解决语义组合的特征。我们在深度神经语言模型中成功验证了这些假设，这些模型是基于文本训练的人工神经网络，在许多自然语言处理任务中表现非常出色。然后，我们使用 MEG 和颅内电极的独特组合，在人类参与者阅读一组受控句子时记录了高分辨率的大脑数据。时间分辨的维度分析显示出随着意义的增加而增加的维度，多元解码使我们能够分离出我们假设的三种动态模式。

相似文献

Dimensionality and Ramping: Signatures of Sentence Integration in the Dynamics of Brains and Deep Language Models.维度和渐变：大脑和深度语言模型动态中句子整合的特征。

J Neurosci. 2023 Jul 19;43(29):5350-5364. doi: 10.1523/JNEUROSCI.1163-22.2023. Epub 2023 May 22.

Combined eye tracking and fMRI reveals neural basis of linguistic predictions during sentence comprehension.眼动追踪与功能磁共振成像相结合揭示句子理解过程中语言预测的神经基础。

Cortex. 2015 Jul;68:33-47. doi: 10.1016/j.cortex.2015.04.011. Epub 2015 Apr 27.

Deep Artificial Neural Networks Reveal a Distributed Cortical Network Encoding Propositional Sentence-Level Meaning.深度人工神经网络揭示命题句级意义的分布式皮层网络编码。

J Neurosci. 2021 May 5;41(18):4100-4119. doi: 10.1523/JNEUROSCI.1152-20.2021. Epub 2021 Mar 22.

Proc Natl Acad Sci U S A. 2023 Apr 25;120(17):e2300252120. doi: 10.1073/pnas.2300252120. Epub 2023 Apr 17.

Delta-Band Neural Responses to Individual Words Are Modulated by Sentence Processing.Delta 波段神经对单个单词的反应受句子处理的调节。

J Neurosci. 2023 Jun 28;43(26):4867-4883. doi: 10.1523/JNEUROSCI.0964-22.2023. Epub 2023 May 23.

Neural Encoding and Decoding With Distributed Sentence Representations.分布式句子表示的神经编码和解码。

IEEE Trans Neural Netw Learn Syst. 2021 Feb;32(2):589-603. doi: 10.1109/TNNLS.2020.3027595. Epub 2021 Feb 4.

Deep neural networks reveal topic-level representations of sentences in medial prefrontal cortex, lateral anterior temporal lobe, precuneus, and angular gyrus.深度神经网络揭示了内侧前额叶皮质、外侧前颞叶、楔前叶和角回中句子的主题层次表示。

Neuroimage. 2022 May 1;251:119005. doi: 10.1016/j.neuroimage.2022.119005. Epub 2022 Feb 14.

An Integrated Neural Decoder of Linguistic and Experiential Meaning.语言和体验意义的综合神经解码器。

J Neurosci. 2019 Nov 6;39(45):8969-8987. doi: 10.1523/JNEUROSCI.2575-18.2019. Epub 2019 Sep 30.

Multiple Regions of a Cortical Network Commonly Encode the Meaning of Words in Multiple Grammatical Positions of Read Sentences.皮质网络的多个区域通常对阅读句子中多个语法位置的单词的含义进行编码。

Cereb Cortex. 2019 Jun 1;29(6):2396-2411. doi: 10.1093/cercor/bhy110.

Time course of semantic processes during sentence comprehension: an fMRI study.句子理解过程中语义加工的时间进程：一项功能磁共振成像研究。

Neuroimage. 2007 Jul 1;36(3):924-32. doi: 10.1016/j.neuroimage.2007.03.059. Epub 2007 Apr 10.

引用本文的文献

Challenges and Methods in Annotating Natural Speech for Neurolinguistic Research.神经语言学研究中自然言语标注的挑战与方法

Neurobiol Lang (Camb). 2025 Sep 5;6. doi: 10.1162/nol.a.12. eCollection 2025.

Low-frequency cortical activity reflects context-dependent parsing of word sequences.低频皮层活动反映了对词序列的上下文相关解析。

iScience. 2025 May 13;28(6):112650. doi: 10.1016/j.isci.2025.112650. eCollection 2025 Jun 20.

Cortical language areas are coupled via a soft hierarchy of model-based linguistic features.皮质语言区域通过基于模型的语言特征的软层次结构相互耦合。

bioRxiv. 2025 Jun 3:2025.06.02.657491. doi: 10.1101/2025.06.02.657491.

The human auditory cortex concurrently tracks syllabic and phonemic timescales via acoustic spectral flux.人类听觉皮层通过声谱通量同时追踪音节和音素时间尺度。

Sci Adv. 2024 Dec 20;10(51):eado8915. doi: 10.1126/sciadv.ado8915.

Neural populations in the language network differ in the size of their temporal receptive windows.语言网络中的神经群体在时间感受窗的大小上存在差异。

Nat Hum Behav. 2024 Oct;8(10):1924-1942. doi: 10.1038/s41562-024-01944-2. Epub 2024 Aug 26.

Language is primarily a tool for communication rather than thought.语言主要是一种交流工具，而不是思维工具。

Nature. 2024 Jun;630(8017):575-586. doi: 10.1038/s41586-024-07522-w. Epub 2024 Jun 19.

Hierarchical dynamic coding coordinates speech comprehension in the human brain.分层动态编码协调人类大脑中的言语理解。

bioRxiv. 2025 Mar 3:2024.04.19.590280. doi: 10.1101/2024.04.19.590280.

Lesion-symptom Mapping of Acceptability Judgments in Chronic Poststroke Aphasia Reveals the Neurobiological Underpinnings of Receptive Syntax.慢性卒中后失语症可接受性判断的病灶-症状映射揭示了接受性句法的神经生物学基础。

J Cogn Neurosci. 2024 Jun 1;36(6):1141-1155. doi: 10.1162/jocn_a_02134.

Driving and suppressing the human language network using large language models.使用大语言模型驱动和抑制人类语言网络。

bioRxiv. 2023 Oct 30:2023.04.16.537080. doi: 10.1101/2023.04.16.537080.

ROSE: A Neurocomputational Architecture for Syntax.ROSE：一种用于句法的神经计算架构。

ArXiv. 2023 Mar 15:arXiv:2303.08877v1.

本文引用的文献

High-performing neural network models of visual cortex benefit from high latent dimensionality.高表现的视觉皮层神经网络模型受益于高潜在维度。

PLoS Comput Biol. 2024 Jan 10;20(1):e1011792. doi: 10.1371/journal.pcbi.1011792. eCollection 2024 Jan.

Multivariate pattern analysis of EEG reveals nuanced impact of negation on sentence processing in the N400 and later time windows.脑电的多变量模式分析揭示了否定对 N400 及后续时间窗口中句子处理的细微影响。

Psychophysiology. 2024 Apr;61(4):e14491. doi: 10.1111/psyp.14491. Epub 2023 Nov 28.

Biwhitening Reveals the Rank of a Count Matrix.双白化揭示计数矩阵的秩。

SIAM J Math Data Sci. 2022;4(4):1420-1446. doi: 10.1137/21m1456807.

Neural representational geometry underlies few-shot concept learning.神经表象几何是少样本概念学习的基础。

Proc Natl Acad Sci U S A. 2022 Oct 25;119(43):e2200800119. doi: 10.1073/pnas.2200800119. Epub 2022 Oct 17.

A hierarchy of linguistic predictions during natural language comprehension.自然语言理解过程中的语言预测层次。

Proc Natl Acad Sci U S A. 2022 Aug 9;119(32):e2201968119. doi: 10.1073/pnas.2201968119. Epub 2022 Aug 3.

Comparison of Structural Parsers and Neural Language Models as Surprisal Estimators.作为惊奇度估计器的结构解析器与神经语言模型的比较。

Front Artif Intell. 2022 Mar 3;5:777963. doi: 10.3389/frai.2022.777963. eCollection 2022.

Shared computational principles for language processing in humans and deep language models.人类和深度语言模型语言处理的共享计算原则。

Nat Neurosci. 2022 Mar;25(3):369-380. doi: 10.1038/s41593-022-01026-4. Epub 2022 Mar 7.

Geometry of sequence working memory in macaque prefrontal cortex.灵长类动物前额叶皮层中序列工作记忆的几何结构。

Science. 2022 Feb 11;375(6581):632-639. doi: 10.1126/science.abm0204. Epub 2022 Feb 10.

The neural architecture of language: Integrative modeling converges on predictive processing.语言的神经结构：综合建模趋向于预测处理。

Proc Natl Acad Sci U S A. 2021 Nov 9;118(45). doi: 10.1073/pnas.2105646118.

The population doctrine in cognitive neuroscience.认知神经科学中的群体学说。

Neuron. 2021 Oct 6;109(19):3055-3068. doi: 10.1016/j.neuron.2021.07.011. Epub 2021 Aug 19.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验