• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

探索汉语句子阅读过程中从上下文嵌入计算出的特征与脑电频段功率之间的关系。

Exploring the relationship between features calculated from contextual embeddings and EEG band power during sentence reading in Chinese.

作者信息

Wang Yao, Xue Tiantian, Yang Xingyu

机构信息

Cognitive Science and Allied Health School, Beijing Language and Culture University, Beijing, China.

Institute of Life and Health Sciences, Beijing Language and Culture University, Beijing, China.

出版信息

Front Neurosci. 2025 Jul 30;19:1656519. doi: 10.3389/fnins.2025.1656519. eCollection 2025.

DOI:10.3389/fnins.2025.1656519
PMID:40809397
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12343585/
Abstract

INTRODUCTION

Contextual embeddings-a core component of large language models (LLMs) that generate dynamic vector representations capturing words' semantic properties-have demonstrated structural similarities to brain activity patterns at the single-word level. This alignment supports the theoretical framework proposing vector-based neural coding for natural language processing in the brain, where linguistic units may be represented as context-sensitive vectors analogous to LLM-derived embeddings. Building on this framework, we hypothesize that cumulative distance metrics between contextual embeddings of adjacent linguistic units (words/Chinese characters) in sentence contexts may quantitatively reflect neural activation intensity during reading comprehension.

METHODS

Using large-scale EEG datasets collected during reading tasks, we systematically investigated the relationship between these computationally derived distance features and frequency-specific band power measures associated with neural activity.

RESULTS

In conclusion, gamma-band power exhibited associations with various NLP features in the ChineseEEG dataset, whereas no comparable gamma-specific effects were observed in the ZuCo1.0 dataset. Additionally, significant effects were found in other frequency bands for both datasets.

DISCUSSION

The mixed yet intriguing results invite a deeper discussion of the directional associations (positive/negative) observed in Gamma and other frequency bands, their cognitive implications, and the potential influence of textual characteristics on these findings. While observed effects may be somehow text- or dataset- dependent, our analyses revealed associations between various distance metrics and neural responses, consistent with predictions derived from the vector-based neural coding framework.

摘要

引言

上下文嵌入——大语言模型(LLMs)的核心组成部分,它生成捕捉单词语义属性的动态向量表示——已在单字层面展现出与大脑活动模式的结构相似性。这种一致性支持了为大脑中的自然语言处理提出基于向量的神经编码的理论框架,在该框架中,语言单元可表示为类似于基于大语言模型得出的嵌入的上下文敏感向量。基于此框架,我们假设句子语境中相邻语言单元(单词/汉字)的上下文嵌入之间的累积距离度量可能定量反映阅读理解过程中的神经激活强度。

方法

利用在阅读任务期间收集的大规模脑电图数据集,我们系统地研究了这些通过计算得出的距离特征与与神经活动相关的特定频率带功率测量值之间的关系。

结果

总之,在中文脑电图数据集中,伽马波段功率与各种自然语言处理特征存在关联,而在ZuCo1.0数据集中未观察到类似的特定于伽马的效应。此外,在两个数据集的其他频段也发现了显著效应。

讨论

这些复杂而有趣的结果引发了对在伽马和其他频段观察到的方向性关联(正/负)、它们的认知意义以及文本特征对这些发现的潜在影响的更深入讨论。虽然观察到的效应可能在某种程度上依赖于文本或数据集,但我们的分析揭示了各种距离度量与神经反应之间的关联,这与基于向量的神经编码框架得出的预测一致。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10ac/12343585/016fc290dd7d/fnins-19-1656519-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10ac/12343585/eb881a1b38a1/fnins-19-1656519-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10ac/12343585/368547ae7949/fnins-19-1656519-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10ac/12343585/6bcf943a16c6/fnins-19-1656519-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10ac/12343585/016fc290dd7d/fnins-19-1656519-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10ac/12343585/eb881a1b38a1/fnins-19-1656519-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10ac/12343585/368547ae7949/fnins-19-1656519-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10ac/12343585/6bcf943a16c6/fnins-19-1656519-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/10ac/12343585/016fc290dd7d/fnins-19-1656519-g004.jpg

相似文献

1
Exploring the relationship between features calculated from contextual embeddings and EEG band power during sentence reading in Chinese.探索汉语句子阅读过程中从上下文嵌入计算出的特征与脑电频段功率之间的关系。
Front Neurosci. 2025 Jul 30;19:1656519. doi: 10.3389/fnins.2025.1656519. eCollection 2025.
2
Short-Term Memory Impairment短期记忆障碍
3
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
4
Psychometric Evaluation of Large Language Model Embeddings for Personality Trait Prediction.用于人格特质预测的大语言模型嵌入的心理测量评估
J Med Internet Res. 2025 Jul 8;27:e75347. doi: 10.2196/75347.
5
Algorithmic Classification of Psychiatric Disorder-Related Spontaneous Communication Using Large Language Model Embeddings: Algorithm Development and Validation.使用大语言模型嵌入对精神障碍相关自发交流进行算法分类:算法开发与验证
JMIR AI. 2025 May 30;4:e67369. doi: 10.2196/67369.
6
Coherence and comprehensibility: Large language models predict lay understanding of health-related content.连贯性与可理解性:大型语言模型预测公众对健康相关内容的理解。
J Biomed Inform. 2025 Jan;161:104758. doi: 10.1016/j.jbi.2024.104758. Epub 2024 Dec 9.
7
Classification of finger movements through optimal EEG channel and feature selection.通过最优脑电图通道和特征选择对手指运动进行分类。
Front Hum Neurosci. 2025 Jul 16;19:1633910. doi: 10.3389/fnhum.2025.1633910. eCollection 2025.
8
Sexual Harassment and Prevention Training性骚扰与预防培训
9
Gender differences in the context of interventions for improving health literacy in migrants: a qualitative evidence synthesis.移民健康素养提升干预措施背景下的性别差异:一项定性证据综合分析
Cochrane Database Syst Rev. 2024 Dec 12;12(12):CD013302. doi: 10.1002/14651858.CD013302.pub2.
10
Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.利用基础模型库进行跨设备肿瘤显微镜检查中的细胞相似性搜索。
Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.

本文引用的文献

1
ChineseEEG: A Chinese Linguistic Corpora EEG Dataset for Semantic Alignment and Neural Decoding.中文 EEG:用于语义对齐和神经解码的中文语言语料库 EEG 数据集。
Sci Data. 2024 May 29;11(1):550. doi: 10.1038/s41597-024-03398-7.
2
Strong Prediction: Language Model Surprisal Explains Multiple N400 Effects.强预测:语言模型意外值解释多种N400效应。
Neurobiol Lang (Camb). 2024 Apr 1;5(1):107-135. doi: 10.1162/nol_a_00105. eCollection 2024.
3
Alignment of brain embeddings and artificial contextual embeddings in natural language points to common geometric patterns.
脑嵌入和自然语言中人工上下文嵌入的对齐指向共同的几何模式。
Nat Commun. 2024 Mar 30;15(1):2768. doi: 10.1038/s41467-024-46631-y.
4
Entorhinal cortical delta oscillations drive memory consolidation.内嗅皮层的三角波震荡驱动记忆巩固。
Cell Rep. 2023 Oct 31;42(10):113267. doi: 10.1016/j.celrep.2023.113267. Epub 2023 Oct 14.
5
Hippocampal Theta and Episodic Memory.海马θ节律与情景记忆。
J Neurosci. 2023 Jan 25;43(4):613-620. doi: 10.1523/JNEUROSCI.1045-22.2022. Epub 2022 Dec 8.
6
A hierarchy of linguistic predictions during natural language comprehension.自然语言理解过程中的语言预测层次。
Proc Natl Acad Sci U S A. 2022 Aug 9;119(32):e2201968119. doi: 10.1073/pnas.2201968119. Epub 2022 Aug 3.
7
Using word embeddings to investigate cultural biases.利用词嵌入来研究文化偏见。
Br J Soc Psychol. 2023 Jan;62(1):617-629. doi: 10.1111/bjso.12560. Epub 2022 Jul 23.
8
Cortical beta-band power modulates with uncertainty in effector selection during motor planning.皮层β波段功率在运动规划中随效应器选择的不确定性而变化。
J Neurophysiol. 2021 Dec 1;126(6):1891-1902. doi: 10.1152/jn.00198.2021. Epub 2021 Nov 3.
9
Beta-band power modulation in the human hippocampus during a reaching task.人类海马体在伸展任务中的β波段功率调制。
J Neural Eng. 2020 Jun 12;17(3):036022. doi: 10.1088/1741-2552/ab937f.
10
ZuCo, a simultaneous EEG and eye-tracking resource for natural sentence reading.ZuCo,一个用于自然句阅读的同时 EEG 和眼动追踪资源。
Sci Data. 2018 Dec 11;5:180291. doi: 10.1038/sdata.2018.291.