在语音感知的贝叶斯感知运动模型中评估听觉和运动信息的互补作用。

The complementary roles of auditory and motor information evaluated in a Bayesian perceptuo-motor model of speech perception.

作者信息

Laurent Raphaël, Barnaud Marie-Lou, Schwartz Jean-Luc, Bessière Pierre, Diard Julien

机构信息

GIPSALab, Université Grenoble Alpes.

Institut des Systèmes Intelligents et de Robotique, Sorbonne Universités, Université Pierre et Marie Curie.

出版信息

Psychol Rev. 2017 Oct;124(5):572-602. doi: 10.1037/rev0000069. Epub 2017 May 4.

DOI:10.1037/rev0000069

PMID:28471206

Abstract

There is a consensus concerning the view that both auditory and motor representations intervene in the perceptual processing of speech units. However, the question of the functional role of each of these systems remains seldom addressed and poorly understood. We capitalized on the formal framework of Bayesian Programming to develop COSMO (Communicating Objects using Sensory-Motor Operations), an integrative model that allows principled comparisons of purely motor or purely auditory implementations of a speech perception task and tests the gain of efficiency provided by their Bayesian fusion. Here, we show 3 main results: (a) In a set of precisely defined "perfect conditions," auditory and motor theories of speech perception are indistinguishable; (b) When a learning process that mimics speech development is introduced into COSMO, it departs from these perfect conditions. Then auditory recognition becomes more efficient than motor recognition in dealing with learned stimuli, while motor recognition is more efficient in adverse conditions. We interpret this result as a general "auditory-narrowband versus motor-wideband" property; and (c) Simulations of plosive-vowel syllable recognition reveal possible cues from motor recognition for the invariant specification of the place of plosive articulation in context that are lacking in the auditory pathway. This provides COSMO with a second property, where auditory cues would be more efficient for vowel decoding and motor cues for plosive articulation decoding. These simulations provide several predictions, which are in good agreement with experimental data and suggest that there is natural complementarity between auditory and motor processing within a perceptuo-motor theory of speech perception. (PsycINFO Database Record

摘要

关于听觉和运动表征都参与语音单元的感知处理这一观点，存在共识。然而，这些系统各自的功能作用问题仍然很少被探讨且理解不足。我们利用贝叶斯编程的形式框架开发了COSMO（使用感觉运动操作进行对象通信），这是一个整合模型，它允许对语音感知任务的纯运动或纯听觉实现进行有原则的比较，并测试其贝叶斯融合所提供的效率提升。在此，我们展示了3个主要结果：（a）在一组精确定义的“完美条件”下，语音感知的听觉和运动理论无法区分；（b）当将模拟语音发展的学习过程引入COSMO时，它偏离了这些完美条件。此时，在处理学习到的刺激时，听觉识别比运动识别更有效，而在不利条件下运动识别更有效。我们将这一结果解释为一种普遍的“听觉窄带与运动宽带”特性；（c）爆破元音音节识别的模拟揭示了运动识别中可能存在的线索，用于在语境中对爆破音发音位置进行不变性指定，而听觉通路中缺乏这些线索。这为COSMO提供了第二个特性，即听觉线索在元音解码方面更有效，而运动线索在爆破音发音解码方面更有效。这些模拟提供了几个预测，与实验数据高度吻合，并表明在语音感知的感知运动理论中，听觉和运动处理之间存在自然的互补性。（PsycINFO数据库记录）

相似文献

The complementary roles of auditory and motor information evaluated in a Bayesian perceptuo-motor model of speech perception.

Psychol Rev. 2017 Oct;124(5):572-602. doi: 10.1037/rev0000069. Epub 2017 May 4.

Computer simulations of coupled idiosyncrasies in speech perception and speech production with COSMO, a perceptuo-motor Bayesian model of speech communication.

PLoS One. 2019 Jan 11;14(1):e0210302. doi: 10.1371/journal.pone.0210302. eCollection 2019.

Reanalyzing neurocognitive data on the role of the motor system in speech perception within COSMO, a Bayesian perceptuo-motor model of speech communication.

Brain Lang. 2018 Dec;187:19-32. doi: 10.1016/j.bandl.2017.12.003. Epub 2017 Dec 12.

What drives the perceptual change resulting from speech motor adaptation? Evaluation of hypotheses in a Bayesian modeling framework.

PLoS Comput Biol. 2018 Jan 22;14(1):e1005942. doi: 10.1371/journal.pcbi.1005942. eCollection 2018 Jan.

Task-modulated Sensitivity to Vocal Pitch in the Dorsal Premotor Cortex during Multitalker Speech Recognition.

J Cogn Neurosci. 2022 Oct 1;34(11):2189-2214. doi: 10.1162/jocn_a_01907.

Prediction and constraint in audiovisual speech perception.

Cortex. 2015 Jul;68:169-81. doi: 10.1016/j.cortex.2015.03.006. Epub 2015 Mar 20.

Preferred auditory temporal processing regimes and auditory-motor synchronization.

Psychon Bull Rev. 2021 Dec;28(6):1860-1873. doi: 10.3758/s13423-021-01933-w. Epub 2021 Jun 7.

Efficient Neural Coding in Auditory and Speech Perception.

Trends Neurosci. 2019 Jan;42(1):56-65. doi: 10.1016/j.tins.2018.09.004. Epub 2018 Oct 5.

Silent articulation modulates auditory and audiovisual speech perception.

Exp Brain Res. 2013 Jun;227(2):275-88. doi: 10.1007/s00221-013-3510-8. Epub 2013 Apr 17.

Degradation of labial information modifies audiovisual speech perception in cochlear-implanted children.

Ear Hear. 2013 Jan-Feb;34(1):110-21. doi: 10.1097/AUD.0b013e3182670993.

引用本文的文献

Monolingual and bilingual infants' attention to talking faces: evidence from eye-tracking and Bayesian modeling.

Front Psychol. 2024 Mar 14;15:1373191. doi: 10.3389/fpsyg.2024.1373191. eCollection 2024.

How the conception of control influences our understanding of actions.

Nat Rev Neurosci. 2023 May;24(5):313-329. doi: 10.1038/s41583-023-00691-z. Epub 2023 Mar 30.

COSMO-Onset: A Neurally-Inspired Computational Model of Spoken Word Recognition, Combining Top-Down Prediction and Bottom-Up Detection of Syllabic Onsets.

Front Syst Neurosci. 2021 Aug 4;15:653975. doi: 10.3389/fnsys.2021.653975. eCollection 2021.

Brain-inspired model for early vocal learning and correspondence matching using free-energy optimization.

PLoS Comput Biol. 2021 Feb 18;17(2):e1008566. doi: 10.1371/journal.pcbi.1008566. eCollection 2021 Feb.

Modeling Sensory Preference in Speech Motor Planning: A Bayesian Modeling Framework.

Front Psychol. 2019 Oct 25;10:2339. doi: 10.3389/fpsyg.2019.02339. eCollection 2019.

The motor system's [modest] contribution to speech perception.

Psychon Bull Rev. 2019 Aug;26(4):1354-1366. doi: 10.3758/s13423-019-01580-2.

Formant Space Reconstruction From Brain Activity in Frontal and Temporal Regions Coding for Heard Vowels.

Front Hum Neurosci. 2019 Feb 8;13:32. doi: 10.3389/fnhum.2019.00032. eCollection 2019.

Computer simulations of coupled idiosyncrasies in speech perception and speech production with COSMO, a perceptuo-motor Bayesian model of speech communication.

PLoS One. 2019 Jan 11;14(1):e0210302. doi: 10.1371/journal.pone.0210302. eCollection 2019.

Bringing the Nonlinearity of the Movement System to Gestural Theories of Language Use: Multifractal Structure of Spoken English Supports the Compensation for Coarticulation in Human Speech Perception.

Front Physiol. 2018 Sep 3;9:1152. doi: 10.3389/fphys.2018.01152. eCollection 2018.

What drives the perceptual change resulting from speech motor adaptation? Evaluation of hypotheses in a Bayesian modeling framework.

PLoS Comput Biol. 2018 Jan 22;14(1):e1005942. doi: 10.1371/journal.pcbi.1005942. eCollection 2018 Jan.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

在语音感知的贝叶斯感知运动模型中评估听觉和运动信息的互补作用。

The complementary roles of auditory and motor information evaluated in a Bayesian perceptuo-motor model of speech perception.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献