语音的知觉融合倾向。

Perceptual fusion tendency of speech sounds.

机构信息

Department of Psychology, Peking University, Beijing, China.

出版信息

J Cogn Neurosci. 2011 Apr;23(4):1003-14. doi: 10.1162/jocn.2010.21470. Epub 2010 Mar 29.

DOI:10.1162/jocn.2010.21470

PMID:20350060

Abstract

To discriminate and to recognize sound sources in a noisy, reverberant environment, listeners need to perceptually integrate the direct wave with the reflections of each sound source. It has been confirmed that perceptual fusion between direct and reflected waves of a speech sound helps listeners recognize this speech sound in a simulated reverberant environment with disrupting sound sources. When the delay between a direct sound wave and its reflected wave is sufficiently short, the two waves are perceptually fused into a single sound image as coming from the source location. Interestingly, compared with nonspeech sounds such as clicks and noise bursts, speech sounds have a much larger perceptual fusion tendency. This study investigated why the fusion tendency for speech sounds is so large. Here we show that when the temporal amplitude fluctuation of speech was artificially time reversed, a large perceptual fusion tendency of speech sounds disappeared, regardless of whether the speech acoustic carrier was in normal or reversed temporal order. Moreover, perceptual fusion of normal-order speech, but not that of time-reversed speech, was accompanied by increased coactivation of the attention-control-related, spatial-processing-related, and speech-processing-related cortical areas. Thus, speech-like acoustic carriers modulated by speech amplitude fluctuation selectively activate a cortical network for top-down modulations of speech processing, leading to an enhancement of perceptual fusion of speech sounds. This mechanism represents a perceptual-grouping strategy for unmasking speech under adverse conditions.

摘要

为了在嘈杂、混响的环境中辨别和识别声源，听众需要在感知上整合直达波和每个声源的反射波。已经证实，语音的直达波和反射波之间的感知融合有助于听众在具有干扰声源的模拟混响环境中识别该语音。当直达声波与其反射波之间的延迟足够短时，这两个波在感知上融合成来自声源位置的单个声音图像。有趣的是，与点击声和噪声突发等非语音声音相比，语音声音具有更大的感知融合倾向。这项研究探讨了为什么语音的融合倾向如此之大。在这里，我们表明，当语音的时间幅度波动被人为地时间反转时，无论语音声载波处于正常还是反转的时间顺序，语音的大感知融合倾向都会消失。此外，正常顺序的语音的感知融合，但不是时间反转的语音的感知融合，伴随着注意力控制相关、空间处理相关和语音处理相关皮质区域的协同激活增加。因此，受语音幅度波动调制的类似语音的声载波选择性地激活了用于语音处理的自上而下调制的皮质网络，从而增强了语音声音的感知融合。这种机制代表了在不利条件下解蔽语音的感知分组策略。

相似文献

Perceptual fusion tendency of speech sounds.

J Cogn Neurosci. 2011 Apr;23(4):1003-14. doi: 10.1162/jocn.2010.21470. Epub 2010 Mar 29.

Responsiveness of the human auditory cortex to degraded speech sounds: reduction of amplitude resolution vs. additive noise.

Brain Res. 2011 Jan 7;1367:298-309. doi: 10.1016/j.brainres.2010.10.037. Epub 2010 Oct 20.

Segmental processing in the human auditory dorsal stream.

Brain Res. 2008 Jul 18;1220:179-90. doi: 10.1016/j.brainres.2007.11.013. Epub 2007 Nov 17.

Cortical networks representing object categories and high-level attributes of familiar real-world action sounds.

J Cogn Neurosci. 2011 Aug;23(8):2079-101. doi: 10.1162/jocn.2010.21570. Epub 2010 Sep 2.

Deviant processing of letters and speech sounds as proximate cause of reading failure: a functional magnetic resonance imaging study of dyslexic children.

Brain. 2010 Mar;133(Pt 3):868-79. doi: 10.1093/brain/awp308. Epub 2010 Jan 7.

Is discrimination training necessary to cause changes in the P2 auditory event-related brain potential to speech sounds?

Brain Res Cogn Brain Res. 2005 Oct;25(2):547-53. doi: 10.1016/j.cogbrainres.2005.08.007. Epub 2005 Sep 28.

Distinct fMRI responses to laughter, speech, and sounds along the human peri-sylvian cortex.

Brain Res Cogn Brain Res. 2005 Jul;24(2):291-306. doi: 10.1016/j.cogbrainres.2005.02.008. Epub 2005 Mar 29.

Cortical differentiation of speech and nonspeech sounds at 100 ms: implications for dyslexia.

Cereb Cortex. 2005 Jul;15(7):1054-63. doi: 10.1093/cercor/bhh206. Epub 2004 Nov 24.

Perceived target-masker separation unmasks responses of lateral amygdala to the emotionally conditioned target sounds in awake rats.

Neuroscience. 2012 Dec 6;225:249-57. doi: 10.1016/j.neuroscience.2012.08.022. Epub 2012 Aug 21.

Time course of early audiovisual interactions during speech and nonspeech central auditory processing: a magnetoencephalography study.

J Cogn Neurosci. 2009 Feb;21(2):259-74. doi: 10.1162/jocn.2008.21019.

引用本文的文献

Neural correlates of perceptual separation-induced enhancement of prepulse inhibition of startle in humans.

Sci Rep. 2018 Jan 11;8(1):472. doi: 10.1038/s41598-017-18793-x.

Sensitivity to an Illusion of Sound Location in Human Auditory Cortex.

Front Syst Neurosci. 2017 May 23;11:35. doi: 10.3389/fnsys.2017.00035. eCollection 2017.

Auditory midbrain representation of a break in interaural correlation.

J Neurophysiol. 2015 Oct;114(4):2258-64. doi: 10.1152/jn.00645.2015. Epub 2015 Aug 12.

Primitive auditory memory is correlated with spatial unmasking that is based on direct-reflection integration.

PLoS One. 2013 Apr 29;8(4):e63106. doi: 10.1371/journal.pone.0063106. Print 2013.

Differentially organized top-down modulation of prepulse inhibition of startle.

J Neurosci. 2011 Sep 21;31(38):13644-53. doi: 10.1523/JNEUROSCI.1292-11.2011.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

语音的知觉融合倾向。

Perceptual fusion tendency of speech sounds.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献