低秩张量编码模型分解自然语音理解过程。

Low-Rank Tensor Encoding Models Decompose Natural Speech Comprehension Processes.

作者信息

Lewis Lane, Pitkow Xaq, Wehbe Leila

机构信息

Neuroscience Institute, Carnegie Mellon University.

Machine Learning Dept., Carnegie Mellon University.

出版信息

bioRxiv. 2025 Jun 3:2025.06.02.657514. doi: 10.1101/2025.06.02.657514.

DOI:10.1101/2025.06.02.657514

PMID:40501791

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12157526/

Abstract

How does the brain process language over time? Research suggests that natural human language is processed hierarchically across brain regions over time. However, attempts to characterize this computation have thus far been limited to tightly controlled experimental settings that capture only a coarse picture of the brain dynamics underlying human natural language comprehension. The recent emergence of LLM encoding models promises a new avenue to discover and characterize rich semantic information in the brain, yet interpretable methods for linking information in LLMs to language processing over time are limited. In this work, we develop a low-rank tensor regression method to decompose LLM encoding models into interpretable components of semantics, time, and brain region activation, and apply the method to a Magnetoencephalography (MEG) dataset in which subjects listened to narrative stories. With only a few components, we show improved performance compared to a standard ridge regression encoding model, suggesting the low-rank models provide a good inductive bias for language encoding. In addition, our method discovers a diverse spectrum of interpretable response components that are sensitive to a rich set of low-level and semantic language features, showing that our method is able to separate distinct language processing features in neural signals. After controlling for low-level audio and sentence features, we demonstrate better capture of semantic features. Through use of low-rank tensor encoding models we are able to decompose neural responses to language features, showing improved encoding performance and interpretable processing components, suggesting our method as a useful tool for uncovering language processes in naturalistic settings.

摘要

随着时间的推移，大脑是如何处理语言的？研究表明，人类的自然语言是随着时间在大脑区域中分层处理的。然而，迄今为止，试图描述这种计算过程的尝试仅限于严格控制的实验环境，这些环境只能捕捉到人类自然语言理解背后大脑动态的粗略图景。最近出现的语言模型（LLM）编码模型有望为发现和描述大脑中丰富的语义信息提供一条新途径，但将LLM中的信息与随时间变化的语言处理联系起来的可解释方法却很有限。在这项工作中，我们开发了一种低秩张量回归方法，将LLM编码模型分解成语义、时间和脑区激活的可解释成分，并将该方法应用于一个脑磁图（MEG）数据集，在该数据集中，受试者聆听叙事故事。通过仅使用几个成分，我们展示了与标准岭回归编码模型相比更高的性能，这表明低秩模型为语言编码提供了良好的归纳偏差。此外，我们的方法发现了各种各样对丰富的低级和语义语言特征敏感的可解释响应成分，表明我们的方法能够在神经信号中分离出不同的语言处理特征。在控制了低级音频和句子特征之后，我们展示了对语义特征更好的捕捉。通过使用低秩张量编码模型，我们能够分解对语言特征的神经反应，展示出更高的编码性能和可解释的处理成分，这表明我们的方法是在自然环境中揭示语言处理过程的有用工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/73b9/12157526/bc5a9990ef43/nihpp-2025.06.02.657514v1-f0001.jpg

相似文献

Low-Rank Tensor Encoding Models Decompose Natural Speech Comprehension Processes.低秩张量编码模型分解自然语音理解过程。

bioRxiv. 2025 Jun 3:2025.06.02.657514. doi: 10.1101/2025.06.02.657514.

Short-Term Memory Impairment短期记忆障碍

Methodologies for assessing morphosyntactic ability in people with Alzheimer's disease.评估阿尔茨海默病患者形态句法能力的方法。

Int J Lang Commun Disord. 2024 Jan-Feb;59(1):38-57. doi: 10.1111/1460-6984.12862. Epub 2023 Feb 25.

Prescription of Controlled Substances: Benefits and Risks管制药品的处方：益处与风险

Cognitive decline assessment using semantic linguistic content and transformer deep learning architecture.使用语义语言内容和变压器深度学习架构评估认知能力下降。

Int J Lang Commun Disord. 2024 May-Jun;59(3):1110-1127. doi: 10.1111/1460-6984.12973. Epub 2023 Nov 16.

A systematic review of speech, language and communication interventions for children with Down syndrome from 0 to 6 years.对0至6岁唐氏综合征儿童言语、语言和沟通干预措施的系统评价。

Int J Lang Commun Disord. 2022 Mar;57(2):441-463. doi: 10.1111/1460-6984.12699. Epub 2022 Feb 22.

A systematic review on production and comprehension of linguistic prosody in people with acquired language and communication disorders resulting from unilateral brain lesions.单侧脑损伤所致后天性语言和交流障碍患者的语言韵律产生和感知的系统评价

J Commun Disord. 2023 Jan-Feb;101:106298. doi: 10.1016/j.jcomdis.2022.106298. Epub 2023 Jan 7.

Fabricating mice and dementia: opening up relations in multi-species research制造小鼠与痴呆症：开启多物种研究中的关联

The development of a novel, standardized, norm-referenced Arabic Discourse Assessment Tool (ADAT), including an examination of psychometric properties of discourse measures in aphasia.开发一种新型、标准化、基于常模的阿拉伯语语篇评估工具（ADAT），包括评估失语症患者语篇测量的心理测量特性。

Int J Lang Commun Disord. 2024 Sep-Oct;59(5):2103-2117. doi: 10.1111/1460-6984.13083. Epub 2024 Jun 18.

Neonatal Nurses' Understanding of the Factors That Enhance and Hinder Early Communication Between Preterm Infants and Their Parents: A Narrative Inquiry Study.新生儿护士对促进和阻碍早产儿与其父母早期沟通因素的理解：一项叙事探究研究。

Int J Lang Commun Disord. 2025 Jul-Aug;60(4):e70093. doi: 10.1111/1460-6984.70093.

本文引用的文献

Alignment of auditory artificial networks with massive individual fMRI brain data leads to generalisable improvements in brain encoding and downstream tasks.听觉人工网络与大量个体功能磁共振成像脑数据的对齐可带来大脑编码及下游任务方面的普遍改善。

Imaging Neurosci (Camb). 2025 Apr 8;3. doi: 10.1162/imag_a_00525. eCollection 2025.

Divergences between Language Models and Human Brains.语言模型与人类大脑之间的差异。

Adv Neural Inf Process Syst. 2024;37:137999-138031.

Foundation model of neural activity predicts response to new stimulus types.神经活动的基础模型可预测对新刺激类型的反应。

Nature. 2025 Apr;640(8058):470-477. doi: 10.1038/s41586-025-08829-y. Epub 2025 Apr 9.

Stacked regressions and structured variance partitioning for interpretable brain maps.堆叠回归和结构方差分解可用于可解释的脑图谱。

Neuroimage. 2024 Sep;298:120772. doi: 10.1016/j.neuroimage.2024.120772. Epub 2024 Aug 6.

Scaling laws for language encoding models in fMRI.功能磁共振成像中语言编码模型的标度律

Adv Neural Inf Process Syst. 2023;36:21895-21907.

Dimensionality reduction beyond neural subspaces with slice tensor component analysis.切片张量成分分析：超越神经子空间的维度降低。

Nat Neurosci. 2024 Jun;27(6):1199-1210. doi: 10.1038/s41593-024-01626-2. Epub 2024 May 6.

Computational Language Modeling and the Promise of In Silico Experimentation.计算语言建模与计算机模拟实验的前景。

Neurobiol Lang (Camb). 2024 Apr 1;5(1):80-106. doi: 10.1162/nol_a_00101. eCollection 2024.

A natural language fMRI dataset for voxelwise encoding models.基于体素的编码模型的自然语言 fMRI 数据集。

Sci Data. 2023 Aug 23;10(1):555. doi: 10.1038/s41597-023-02437-z.

Combining computational controls with natural text reveals aspects of meaning composition.将计算控制与自然文本相结合揭示了意义构成的各个方面。

Nat Comput Sci. 2022 Nov;2(11):745-757. doi: 10.1038/s43588-022-00354-6. Epub 2022 Nov 28.

A highly selective response to food in human visual cortex revealed by hypothesis-free voxel decomposition.基于无假设体素分解的人类视觉皮层对食物的高度选择性反应。

Curr Biol. 2022 Oct 10;32(19):4159-4171.e9. doi: 10.1016/j.cub.2022.08.009. Epub 2022 Aug 25.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

低秩张量编码模型分解自然语音理解过程。

Low-Rank Tensor Encoding Models Decompose Natural Speech Comprehension Processes.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献