• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

深度预测层级使人类言语理解计算模型能够在线提取意义。

A deep hierarchy of predictions enables online meaning extraction in a computational model of human speech comprehension.

机构信息

Department of Fundamental Neuroscience, Faculty of Medicine, University of Geneva, Geneva, Switzerland.

Swiss National Centre of Competence in Research "Evolving Language" (NCCR EvolvingLanguage), Geneva, Switzerland.

出版信息

PLoS Biol. 2023 Mar 22;21(3):e3002046. doi: 10.1371/journal.pbio.3002046. eCollection 2023 Mar.

DOI:10.1371/journal.pbio.3002046
PMID:36947552
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10079236/
Abstract

Understanding speech requires mapping fleeting and often ambiguous soundwaves to meaning. While humans are known to exploit their capacity to contextualize to facilitate this process, how internal knowledge is deployed online remains an open question. Here, we present a model that extracts multiple levels of information from continuous speech online. The model applies linguistic and nonlinguistic knowledge to speech processing, by periodically generating top-down predictions and incorporating bottom-up incoming evidence in a nested temporal hierarchy. We show that a nonlinguistic context level provides semantic predictions informed by sensory inputs, which are crucial for disambiguating among multiple meanings of the same word. The explicit knowledge hierarchy of the model enables a more holistic account of the neurophysiological responses to speech compared to using lexical predictions generated by a neural network language model (GPT-2). We also show that hierarchical predictions reduce peripheral processing via minimizing uncertainty and prediction error. With this proof-of-concept model, we demonstrate that the deployment of hierarchical predictions is a possible strategy for the brain to dynamically utilize structured knowledge and make sense of the speech input.

摘要

理解言语需要将短暂且常常模糊的声波映射到意义上。虽然众所周知,人类利用语境化能力来促进这一过程,但内部知识是如何在线部署的仍然是一个悬而未决的问题。在这里,我们提出了一个从连续语音中在线提取多个层次信息的模型。该模型通过定期生成自上而下的预测,并在嵌套的时间层次结构中结合自下而上的传入证据,将语言和非语言知识应用于语音处理。我们表明,非语言语境层提供了由感官输入提供的语义预测,这对于消除同一个词的多种含义之间的歧义至关重要。该模型的显式知识层次结构使得对言语的神经生理反应有了更全面的解释,而不是使用神经网络语言模型 (GPT-2) 生成的词汇预测。我们还表明,分层预测通过最小化不确定性和预测误差来减少外围处理。通过这个概念验证模型,我们证明了分层预测的部署是大脑动态利用结构化知识并理解言语输入的一种可能策略。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfd8/10079236/08306245c817/pbio.3002046.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfd8/10079236/873a3f1a6049/pbio.3002046.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfd8/10079236/bf1b8e504438/pbio.3002046.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfd8/10079236/5a0c5b6b77e8/pbio.3002046.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfd8/10079236/de418310bb30/pbio.3002046.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfd8/10079236/3c971b350b2f/pbio.3002046.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfd8/10079236/08306245c817/pbio.3002046.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfd8/10079236/873a3f1a6049/pbio.3002046.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfd8/10079236/bf1b8e504438/pbio.3002046.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfd8/10079236/5a0c5b6b77e8/pbio.3002046.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfd8/10079236/de418310bb30/pbio.3002046.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfd8/10079236/3c971b350b2f/pbio.3002046.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfd8/10079236/08306245c817/pbio.3002046.g006.jpg

相似文献

1
A deep hierarchy of predictions enables online meaning extraction in a computational model of human speech comprehension.深度预测层级使人类言语理解计算模型能够在线提取意义。
PLoS Biol. 2023 Mar 22;21(3):e3002046. doi: 10.1371/journal.pbio.3002046. eCollection 2023 Mar.
2
A hierarchy of linguistic predictions during natural language comprehension.自然语言理解过程中的语言预测层次。
Proc Natl Acad Sci U S A. 2022 Aug 9;119(32):e2201968119. doi: 10.1073/pnas.2201968119. Epub 2022 Aug 3.
3
Semantic Context Enhances the Early Auditory Encoding of Natural Speech.语义语境增强了对自然语音的早期听觉编码。
J Neurosci. 2019 Sep 18;39(38):7564-7575. doi: 10.1523/JNEUROSCI.0584-19.2019. Epub 2019 Aug 1.
4
Balancing Prediction and Sensory Input in Speech Comprehension: The Spatiotemporal Dynamics of Word Recognition in Context.在言语理解中平衡预测和感觉输入:语境中单词识别的时空动态。
J Neurosci. 2019 Jan 16;39(3):519-527. doi: 10.1523/JNEUROSCI.3573-17.2018. Epub 2018 Nov 20.
5
Linguistic Structure and Meaning Organize Neural Oscillations into a Content-Specific Hierarchy.语言结构和意义将神经振荡组织成内容特定的层级。
J Neurosci. 2020 Dec 2;40(49):9467-9475. doi: 10.1523/JNEUROSCI.0302-20.2020. Epub 2020 Oct 23.
6
A Heteromodal Word-Meaning Binding Site in the Visual Word Form Area under Top-Down Frontoparietal Control.顶叶顶间控制下的视觉词形区中的异模态词义绑定位点。
J Neurosci. 2021 Apr 28;41(17):3854-3869. doi: 10.1523/JNEUROSCI.2771-20.2021. Epub 2021 Mar 9.
7
Predictive Brain Mechanisms in Sound-to-Meaning Mapping during Speech Processing.言语处理过程中从声音到意义映射的预测性脑机制。
J Neurosci. 2016 Oct 19;36(42):10813-10822. doi: 10.1523/JNEUROSCI.0583-16.2016.
8
Dissociable electrophysiological measures of natural language processing reveal differences in speech comprehension strategy in healthy ageing.分离的自然语言处理的电生理测量揭示了健康老化中言语理解策略的差异。
Sci Rep. 2021 Mar 2;11(1):4963. doi: 10.1038/s41598-021-84597-9.
9
The Neural Time Course of Semantic Ambiguity Resolution in Speech Comprehension.言语理解中语义歧义消解的神经时程。
J Cogn Neurosci. 2020 Mar;32(3):403-425. doi: 10.1162/jocn_a_01493. Epub 2019 Nov 4.
10
An fMRI study investigating effects of conceptually related sentences on the perception of degraded speech.一项功能磁共振成像研究,探究概念相关句子对退化语音感知的影响。
Cortex. 2016 Jun;79:57-74. doi: 10.1016/j.cortex.2016.03.014. Epub 2016 Mar 25.

引用本文的文献

1
A primate grammar enabling incremental processing.一种支持增量处理的灵长类语法。
iScience. 2025 Mar 20;28(4):112229. doi: 10.1016/j.isci.2025.112229. eCollection 2025 Apr 18.
2
"What" and "When" Predictions Jointly Modulate Speech Processing.“什么”和“何时”预测共同调节语音处理。
J Neurosci. 2025 May 14;45(20):e1049242025. doi: 10.1523/JNEUROSCI.1049-24.2025.
3
Dog-human vocal interactions match dogs' sensory-motor tuning.狗和人类的声音互动与狗的感觉-运动调节相匹配。

本文引用的文献

1
Rhythmic modulation of prediction errors: A top-down gating role for the beta-range in speech processing.预测误差的节律调制:β频段在言语处理中的自上而下的门控作用。
PLoS Comput Biol. 2023 Nov 7;19(11):e1011595. doi: 10.1371/journal.pcbi.1011595. eCollection 2023 Nov.
2
Neural dynamics of phoneme sequences reveal position-invariant code for content and order.音素序列的神经动力学揭示了内容和顺序的位置不变代码。
Nat Commun. 2022 Nov 3;13(1):6606. doi: 10.1038/s41467-022-34326-1.
3
Deep language algorithms predict semantic comprehension from brain activity.
PLoS Biol. 2024 Oct 1;22(10):e3002789. doi: 10.1371/journal.pbio.3002789. eCollection 2024 Oct.
4
A universal preference for animate agents in hominids.人类对有生命主体的普遍偏好。
iScience. 2024 May 16;27(6):109996. doi: 10.1016/j.isci.2024.109996. eCollection 2024 Jun 21.
5
Surprisal From Language Models Can Predict ERPs in Processing Predicate-Argument Structures Only if Enriched by an Agent Preference Principle.只有通过主体偏好原则进行强化,语言模型的意外值才能预测谓词-论元结构处理过程中的事件相关电位。
Neurobiol Lang (Camb). 2024 Apr 1;5(1):167-200. doi: 10.1162/nol_a_00121. eCollection 2024.
深度语言算法可以根据大脑活动预测语义理解。
Sci Rep. 2022 Sep 29;12(1):16327. doi: 10.1038/s41598-022-20460-9.
4
A hierarchy of linguistic predictions during natural language comprehension.自然语言理解过程中的语言预测层次。
Proc Natl Acad Sci U S A. 2022 Aug 9;119(32):e2201968119. doi: 10.1073/pnas.2201968119. Epub 2022 Aug 3.
5
Brains and algorithms partially converge in natural language processing.大脑和算法在自然语言处理中部分融合。
Commun Biol. 2022 Feb 16;5(1):134. doi: 10.1038/s42003-022-03036-1.
6
A speech planning network for interactive language use.用于交互式语言使用的语音规划网络。
Nature. 2022 Feb;602(7895):117-122. doi: 10.1038/s41586-021-04270-z. Epub 2022 Jan 5.
7
The neural architecture of language: Integrative modeling converges on predictive processing.语言的神经结构:综合建模趋向于预测处理。
Proc Natl Acad Sci U S A. 2021 Nov 9;118(45). doi: 10.1073/pnas.2105646118.
8
The Same Ultra-Rapid Parallel Brain Dynamics Underpin the Production and Perception of Speech.相同的超快速并行脑动力学支撑着言语的产生与感知。
Cereb Cortex Commun. 2021 Jun 10;2(3):tgab040. doi: 10.1093/texcom/tgab040. eCollection 2021.
9
Word meaning in minds and machines.思维与机器中的词义。
Psychol Rev. 2023 Mar;130(2):401-431. doi: 10.1037/rev0000297. Epub 2021 Jul 22.
10
Acoustically Driven Cortical δ Oscillations Underpin Prosodic Chunking.声驱动皮层 δ 振荡为韵律切分提供基础。
eNeuro. 2021 Jul 9;8(4). doi: 10.1523/ENEURO.0562-20.2021. Print 2021 Jul-Aug.