• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

语音识别中的预测、贝叶斯推理与反馈

Prediction, Bayesian inference and feedback in speech recognition.

作者信息

Norris Dennis, McQueen James M, Cutler Anne

机构信息

MRC Cognition and Brain Sciences Unit , Cambridge , UK.

Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, The Netherlands; Max Planck Institute for Psycholinguistics, Nijmegen, The Netherlands.

出版信息

Lang Cogn Neurosci. 2016 Jan 2;31(1):4-18. doi: 10.1080/23273798.2015.1081703. Epub 2015 Sep 4.

DOI:10.1080/23273798.2015.1081703
PMID:26740960
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4685608/
Abstract

Speech perception involves prediction, but how is that prediction implemented? In cognitive models prediction has often been taken to imply that there is feedback of activation from lexical to pre-lexical processes as implemented in interactive-activation models (IAMs). We show that simple activation feedback does not actually improve speech recognition. However, other forms of feedback can be beneficial. In particular, feedback can enable the listener to adapt to changing input, and can potentially help the listener to recognise unusual input, or recognise speech in the presence of competing sounds. The common feature of these helpful forms of feedback is that they are all ways of optimising the performance of speech recognition using Bayesian inference. That is, listeners make predictions about speech because speech recognition is optimal in the sense captured in Bayesian models.

摘要

语音感知涉及预测,但这种预测是如何实现的呢?在认知模型中,预测通常被认为意味着存在从词汇层面到词汇前处理过程的激活反馈,就像在交互式激活模型(IAMs)中那样。我们表明,简单的激活反馈实际上并不能提高语音识别能力。然而,其他形式的反馈可能是有益的。特别是,反馈可以使听者适应不断变化的输入,并且有可能帮助听者识别异常输入,或在存在竞争声音的情况下识别语音。这些有用的反馈形式的共同特征是,它们都是使用贝叶斯推理来优化语音识别性能的方法。也就是说,听者对语音进行预测是因为语音识别在贝叶斯模型所捕捉的意义上是最优的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d70/4685608/a33d4d04bc0e/plcp_a_1081703_f0001_b.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d70/4685608/a33d4d04bc0e/plcp_a_1081703_f0001_b.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0d70/4685608/a33d4d04bc0e/plcp_a_1081703_f0001_b.jpg

相似文献

1
Prediction, Bayesian inference and feedback in speech recognition.语音识别中的预测、贝叶斯推理与反馈
Lang Cogn Neurosci. 2016 Jan 2;31(1):4-18. doi: 10.1080/23273798.2015.1081703. Epub 2015 Sep 4.
2
Predictive Neural Computations Support Spoken Word Recognition: Evidence from MEG and Competitor Priming.预测性神经计算支持口语识别:来自 MEG 和竞争启动的证据。
J Neurosci. 2021 Aug 11;41(32):6919-6932. doi: 10.1523/JNEUROSCI.1685-20.2021. Epub 2021 Jul 1.
3
Why might there be lexical-prelexical feedback in speech recognition?为什么在语音识别中可能存在词汇前词汇反馈?
Cognition. 2025 Feb;255:106025. doi: 10.1016/j.cognition.2024.106025. Epub 2024 Nov 30.
4
Shortlist B: a Bayesian model of continuous speech recognition.入围名单B:连续语音识别的贝叶斯模型。
Psychol Rev. 2008 Apr;115(2):357-95. doi: 10.1037/0033-295X.115.2.357.
5
The Optimal Speech-to-Background Ratio for Balancing Speech Recognition With Environmental Sound Recognition.在平衡语音识别和环境声音识别时的最佳语音与背景噪声比。
Ear Hear. 2024;45(6):1444-1460. doi: 10.1097/AUD.0000000000001532. Epub 2024 May 31.
6
Examination of the neighborhood activation theory in normal and hearing-impaired listeners.对正常和听力受损听众的邻域激活理论的检验。
Ear Hear. 2001 Feb;22(1):1-13. doi: 10.1097/00003446-200102000-00001.
7
Perceptual learning in speech.言语中的知觉学习。
Cogn Psychol. 2003 Sep;47(2):204-38. doi: 10.1016/s0010-0285(03)00006-9.
8
Lexical Access Changes Based on Listener Needs: Real-Time Word Recognition in Continuous Speech in Cochlear Implant Users.基于听者需求的词汇通达变化:人工耳蜗使用者连续言语中的实时单词识别。
Ear Hear. 2022;43(5):1487-1501. doi: 10.1097/AUD.0000000000001203. Epub 2022 Jan 21.
9
The Recognition of Whispered Speech in Real-Time.实时低声语音识别
Ear Hear. 2022 Mar/Apr;43(2):554-562. doi: 10.1097/AUD.0000000000001114.
10
Interaction in Spoken Word Recognition Models: Feedback Helps.口语单词识别模型中的交互作用:反馈有帮助。
Front Psychol. 2018 Apr 3;9:369. doi: 10.3389/fpsyg.2018.00369. eCollection 2018.

引用本文的文献

1
Timecourse of bottom-up and top-down language processing during a picture-based semantic priming task.基于图片的语义启动任务中自下而上和自上而下语言加工的时间进程。
Lang Cogn Neurosci. 2025;40(1):122-144. doi: 10.1080/23273798.2024.2409136. Epub 2024 Oct 7.
2
Expectation-driven sensory adaptations support enhanced acuity during categorical perception.期望驱动的感官适应有助于在分类感知过程中提高敏锐度。
Nat Neurosci. 2025 Apr;28(4):861-872. doi: 10.1038/s41593-025-01899-1. Epub 2025 Mar 13.
3
Individual Differences in the Recognition of Spectrally Degraded Speech: Associations With Neurocognitive Functions in Adult Cochlear Implant Users and With Noise-Vocoded Simulations.

本文引用的文献

1
Predictive coding.预测编码。
Wiley Interdiscip Rev Cogn Sci. 2011 Sep;2(5):580-593. doi: 10.1002/wcs.142. Epub 2011 Mar 24.
2
Robust speech perception: recognize the familiar, generalize to the similar, and adapt to the novel.强大的语音感知:识别熟悉的内容,将其推广到相似的内容,并适应新的内容。
Psychol Rev. 2015 Apr;122(2):148-203. doi: 10.1037/a0038695.
3
Interactive activation and mutual constraint satisfaction in perception and cognition.感知与认知中的交互激活与相互约束满足
频谱退化语音识别中的个体差异:与成人人工耳蜗使用者的神经认知功能及噪声声码模拟的关联
Trends Hear. 2025 Jan-Dec;29:23312165241312449. doi: 10.1177/23312165241312449.
4
Decoding contextual influences on auditory perception from primary auditory cortex.从初级听觉皮层解码对听觉感知的情境影响。
Elife. 2024 Dec 9;13:RP94296. doi: 10.7554/eLife.94296.
5
Convergent neural signatures of speech prediction error are a biological marker for spoken word recognition.语音预测误差的会聚神经特征是口语识别的生物学标记。
Nat Commun. 2024 Nov 18;15(1):9984. doi: 10.1038/s41467-024-53782-5.
6
Simple Recurrent Networks are Interactive.简单循环网络具有交互性。
Psychon Bull Rev. 2025 Jun;32(3):1032-1040. doi: 10.3758/s13423-024-02608-y. Epub 2024 Nov 13.
7
Do They Know It's Christmash? Lexical Knowledge Directly Impacts Speech Perception.他们知道这是圣诞节吗?词汇知识直接影响言语感知。
Cogn Sci. 2024 May;48(5):e13449. doi: 10.1111/cogs.13449.
8
Lexical Feedback in the Time-Invariant String Kernel (TISK) Model of Spoken Word Recognition.口语单词识别的时不变字符串核(TISK)模型中的词汇反馈。
J Cogn. 2024 Apr 26;7(1):38. doi: 10.5334/joc.362. eCollection 2024.
9
Contra assertions, feedback improves word recognition: How feedback and lateral inhibition sharpen signals over noise.与断言相反,反馈有助于提高单词识别能力:反馈和侧抑制如何在噪声中增强信号。
Cognition. 2024 Jan;242:105661. doi: 10.1016/j.cognition.2023.105661. Epub 2023 Nov 7.
10
How adults understand what young children say.成人如何理解幼儿的话语。
Nat Hum Behav. 2023 Dec;7(12):2111-2125. doi: 10.1038/s41562-023-01698-3. Epub 2023 Oct 26.
Cogn Sci. 2014 Aug;38(6):1139-89. doi: 10.1111/cogs.12146. Epub 2014 Aug 7.
4
From birdsong to human speech recognition: bayesian inference on a hierarchy of nonlinear dynamical systems.从鸟鸣到人类语音识别:基于非线性动力系统层次结构的贝叶斯推断。
PLoS Comput Biol. 2013;9(9):e1003219. doi: 10.1371/journal.pcbi.1003219. Epub 2013 Sep 12.
5
Swinging at a cocktail party: voice familiarity aids speech perception in the presence of a competing voice.在鸡尾酒会上摇摆:在存在竞争声音的情况下,语音熟悉度有助于语音感知。
Psychol Sci. 2013 Oct;24(10):1995-2004. doi: 10.1177/0956797613482467. Epub 2013 Aug 28.
6
Integrating probabilistic models of perception and interactive neural networks: a historical and tutorial review.整合感知概率模型和交互式神经网络:历史与教程综述。
Front Psychol. 2013 Aug 20;4:503. doi: 10.3389/fpsyg.2013.00503. eCollection 2013.
7
Deformable templates for face recognition.用于人脸识别的可变形模板。
J Cogn Neurosci. 1991 Winter;3(1):59-70. doi: 10.1162/jocn.1991.3.1.59.
8
Top-down influences of written text on perceived clarity of degraded speech.书面文本对退化语音感知清晰度的自上而下影响。
J Exp Psychol Hum Percept Perform. 2014 Feb;40(1):186-99. doi: 10.1037/a0033206. Epub 2013 Jun 10.
9
The speakers' accent shapes the listeners' phonological predictions during speech perception.演讲者的口音会影响听众在语音感知过程中的语音预测。
Brain Lang. 2013 Apr;125(1):82-93. doi: 10.1016/j.bandl.2013.01.007. Epub 2013 Feb 26.
10
Effects of prior information on decoding degraded speech: an fMRI study.先前信息对解码退化语音的影响:一项 fMRI 研究。
Hum Brain Mapp. 2014 Jan;35(1):61-74. doi: 10.1002/hbm.22151. Epub 2012 Aug 30.