频谱时间预测误差的快速计算有助于对退化语音的感知。

Rapid computations of spectrotemporal prediction error support perception of degraded speech.

作者信息

Sohoglu Ediz, Davis Matthew H

机构信息

School of Psychology, University of Sussex, Brighton, United Kingdom.

MRC Cognition and Brain Sciences Unit, Cambridge, United Kingdom.

出版信息

Elife. 2020 Nov 4;9:e58077. doi: 10.7554/eLife.58077.

DOI:10.7554/eLife.58077

PMID:33147138

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7641582/

Abstract

Human speech perception can be described as Bayesian perceptual inference but how are these Bayesian computations instantiated neurally? We used magnetoencephalographic recordings of brain responses to degraded spoken words and experimentally manipulated signal quality and prior knowledge. We first demonstrate that spectrotemporal modulations in speech are more strongly represented in neural responses than alternative speech representations (e.g. spectrogram or articulatory features). Critically, we found an interaction between speech signal quality and expectations from prior written text on the quality of neural representations; increased signal quality enhanced neural representations of speech that mismatched with prior expectations, but led to greater suppression of speech that matched prior expectations. This interaction is a unique neural signature of prediction error computations and is apparent in neural responses within 100 ms of speech input. Our findings contribute to the detailed specification of a computational model of speech perception based on predictive coding frameworks.

摘要

人类语音感知可被描述为贝叶斯感知推理，但这些贝叶斯计算是如何在神经层面实现的呢？我们利用脑磁图记录大脑对 degraded 口语单词的反应，并通过实验操纵信号质量和先验知识。我们首先证明，语音中的频谱时间调制在神经反应中比其他语音表征（如频谱图或发音特征）得到更强的表征。至关重要的是，我们发现语音信号质量与来自先前书面文本对神经表征质量的期望之间存在相互作用；信号质量的提高增强了与先前期望不匹配的语音的神经表征，但导致与先前期望匹配的语音受到更大程度的抑制。这种相互作用是预测误差计算的独特神经特征，并且在语音输入后100毫秒内的神经反应中很明显。我们的研究结果有助于基于预测编码框架详细说明语音感知的计算模型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a0a1/7641582/eec54cdfe4d2/elife-58077-fig1.jpg

相似文献

Rapid computations of spectrotemporal prediction error support perception of degraded speech.频谱时间预测误差的快速计算有助于对退化语音的感知。

Elife. 2020 Nov 4;9:e58077. doi: 10.7554/eLife.58077.

Predictive Neural Computations Support Spoken Word Recognition: Evidence from MEG and Competitor Priming.预测性神经计算支持口语识别：来自 MEG 和竞争启动的证据。

J Neurosci. 2021 Aug 11;41(32):6919-6932. doi: 10.1523/JNEUROSCI.1685-20.2021. Epub 2021 Jul 1.

Perceptual learning of degraded speech by minimizing prediction error.通过最小化预测误差进行退化语音的知觉学习。

Proc Natl Acad Sci U S A. 2016 Mar 22;113(12):E1747-56. doi: 10.1073/pnas.1523266113. Epub 2016 Mar 8.

Neural Prediction Errors Distinguish Perception and Misperception of Speech.神经预测误差可区分语音的感知和误解。

J Neurosci. 2018 Jul 4;38(27):6076-6089. doi: 10.1523/JNEUROSCI.3258-17.2018. Epub 2018 Jun 11.

Convergent neural signatures of speech prediction error are a biological marker for spoken word recognition.语音预测误差的会聚神经特征是口语识别的生物学标记。

Nat Commun. 2024 Nov 18;15(1):9984. doi: 10.1038/s41467-024-53782-5.

Predictive top-down integration of prior knowledge during speech perception.在言语感知过程中，基于先验知识的预测性自上而下的整合。

J Neurosci. 2012 Jun 20;32(25):8443-53. doi: 10.1523/JNEUROSCI.5069-11.2012.

Prediction Errors but Not Sharpened Signals Simulate Multivoxel fMRI Patterns during Speech Perception.预测误差而非增强信号在言语感知过程中模拟多体素功能磁共振成像模式。

PLoS Biol. 2016 Nov 15;14(11):e1002577. doi: 10.1371/journal.pbio.1002577. eCollection 2016 Nov.

Dynamic Time-Locking Mechanism in the Cortical Representation of Spoken Words.语音的皮层表征中的动态时间锁定机制。

eNeuro. 2020 Aug 31;7(4). doi: 10.1523/ENEURO.0475-19.2020. Print 2020 Jul/Aug.

Prior Expectations of Motion Direction Modulate Early Sensory Processing.运动方向的先验预期调节早期感觉处理。

J Neurosci. 2020 Aug 12;40(33):6389-6397. doi: 10.1523/JNEUROSCI.0537-20.2020. Epub 2020 Jul 8.

Temporal predictive codes for spoken words in auditory cortex.听觉皮层中口语单词的时间预测码。

Curr Biol. 2012 Apr 10;22(7):615-21. doi: 10.1016/j.cub.2012.02.015. Epub 2012 Mar 15.

引用本文的文献

Can prediction error explain predictability effects on the N1 during picture-word verification?预测误差能否解释图片-单词验证过程中对N1的可预测性效应？

Imaging Neurosci (Camb). 2024 Apr 8;2. doi: 10.1162/imag_a_00131. eCollection 2024.

Fast frequency modulation is encoded according to the listener expectations in the human subcortical auditory pathway.快速频率调制是根据人类皮层下听觉通路中的听众期望进行编码的。

Imaging Neurosci (Camb). 2024 Sep 19;2. doi: 10.1162/imag_a_00292. eCollection 2024.

An implemented predictive coding model of lexico-semantic processing explains the dynamics of univariate and multivariate activity within the left ventromedial temporal lobe during reading comprehension.一个已实施的词汇语义处理预测编码模型解释了阅读理解过程中左腹内侧颞叶内单变量和多变量活动的动态变化。

Neuroimage. 2025 Mar;308:120977. doi: 10.1016/j.neuroimage.2024.120977. Epub 2024 Dec 16.

Convergent neural signatures of speech prediction error are a biological marker for spoken word recognition.语音预测误差的会聚神经特征是口语识别的生物学标记。

Nat Commun. 2024 Nov 18;15(1):9984. doi: 10.1038/s41467-024-53782-5.

Deep-learning models reveal how context and listener attention shape electrophysiological correlates of speech-to-language transformation.深度学习模型揭示了语境和听众注意力如何塑造言语到语言转换的电生理相关性。

PLoS Comput Biol. 2024 Nov 11;20(11):e1012537. doi: 10.1371/journal.pcbi.1012537. eCollection 2024 Nov.

Linguistic feedback supports rapid adaptation to acoustically degraded speech.语言反馈有助于快速适应语音清晰度下降的言语。

iScience. 2024 May 22;27(6):110055. doi: 10.1016/j.isci.2024.110055. eCollection 2024 Jun 21.

Perceiving and misperceiving speech: lexical and sublexical processing in the superior temporal lobes.感知和误解言语：颞上回中的词汇和亚词汇加工。

Cereb Cortex. 2024 Mar 1;34(3). doi: 10.1093/cercor/bhae087.

A predictive coding model of the N400.N400的预测编码模型。

Cognition. 2024 May;246:105755. doi: 10.1016/j.cognition.2024.105755. Epub 2024 Feb 29.

Eelbrain, a Python toolkit for time-continuous analysis with temporal response functions.Eelbrain，一个用于使用时间响应函数进行时间连续分析的 Python 工具包。

Elife. 2023 Nov 29;12:e85012. doi: 10.7554/eLife.85012.

Contra assertions, feedback improves word recognition: How feedback and lateral inhibition sharpen signals over noise.与断言相反，反馈有助于提高单词识别能力：反馈和侧抑制如何在噪声中增强信号。

Cognition. 2024 Jan;242:105661. doi: 10.1016/j.cognition.2023.105661. Epub 2023 Nov 7.

本文引用的文献

Two Distinct Neural Timescales for Predictive Speech Processing.预测性言语加工的两个不同神经时程。

Neuron. 2020 Jan 22;105(2):385-393.e9. doi: 10.1016/j.neuron.2019.10.019. Epub 2019 Dec 2.

The Perceptual Prediction Paradox.感知预测悖论。

Trends Cogn Sci. 2020 Jan;24(1):13-24. doi: 10.1016/j.tics.2019.11.003. Epub 2019 Nov 29.

Neural Entrainment and Attentional Selection in the Listening Brain.听脑中的神经同步与注意选择。

Trends Cogn Sci. 2019 Nov;23(11):913-926. doi: 10.1016/j.tics.2019.08.004. Epub 2019 Oct 9.

Semantic Context Enhances the Early Auditory Encoding of Natural Speech.语义语境增强了对自然语音的早期听觉编码。

J Neurosci. 2019 Sep 18;39(38):7564-7575. doi: 10.1523/JNEUROSCI.0584-19.2019. Epub 2019 Aug 1.

Modelling the N400 brain potential as change in a probabilistic representation of meaning.将 N400 脑电位建模为意义的概率表示的变化。

Nat Hum Behav. 2018 Sep;2(9):693-705. doi: 10.1038/s41562-018-0406-4. Epub 2018 Aug 27.

The Encoding of Speech Sounds in the Superior Temporal Gyrus.颞上回中的语音编码。

Neuron. 2019 Jun 19;102(6):1096-1110. doi: 10.1016/j.neuron.2019.04.023.

Simple Acoustic Features Can Explain Phoneme-Based Predictions of Cortical Responses to Speech.简单的声学特征可以解释基于音素的皮质反应对语音的预测。

Curr Biol. 2019 Jun 17;29(12):1924-1937.e9. doi: 10.1016/j.cub.2019.04.067. Epub 2019 May 23.

Voxelwise encoding models with non-spherical multivariate normal priors.体素编码模型，具有非球形多元正态先验。

Neuroimage. 2019 Aug 15;197:482-492. doi: 10.1016/j.neuroimage.2019.04.012. Epub 2019 May 7.

Interpreting encoding and decoding models.解释编码和解码模型。

Curr Opin Neurobiol. 2019 Apr;55:167-179. doi: 10.1016/j.conb.2019.04.002. Epub 2019 Apr 28.

Spectrotemporal modulation provides a unifying framework for auditory cortical asymmetries.时频调制为听觉皮层不对称性提供了一个统一的框架。

Nat Hum Behav. 2019 Apr;3(4):393-405. doi: 10.1038/s41562-019-0548-z. Epub 2019 Mar 4.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

频谱时间预测误差的快速计算有助于对退化语音的感知。

Rapid computations of spectrotemporal prediction error support perception of degraded speech.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献