Suppr超能文献

语音的知觉融合倾向。

Perceptual fusion tendency of speech sounds.

机构信息

Department of Psychology, Peking University, Beijing, China.

出版信息

J Cogn Neurosci. 2011 Apr;23(4):1003-14. doi: 10.1162/jocn.2010.21470. Epub 2010 Mar 29.

Abstract

To discriminate and to recognize sound sources in a noisy, reverberant environment, listeners need to perceptually integrate the direct wave with the reflections of each sound source. It has been confirmed that perceptual fusion between direct and reflected waves of a speech sound helps listeners recognize this speech sound in a simulated reverberant environment with disrupting sound sources. When the delay between a direct sound wave and its reflected wave is sufficiently short, the two waves are perceptually fused into a single sound image as coming from the source location. Interestingly, compared with nonspeech sounds such as clicks and noise bursts, speech sounds have a much larger perceptual fusion tendency. This study investigated why the fusion tendency for speech sounds is so large. Here we show that when the temporal amplitude fluctuation of speech was artificially time reversed, a large perceptual fusion tendency of speech sounds disappeared, regardless of whether the speech acoustic carrier was in normal or reversed temporal order. Moreover, perceptual fusion of normal-order speech, but not that of time-reversed speech, was accompanied by increased coactivation of the attention-control-related, spatial-processing-related, and speech-processing-related cortical areas. Thus, speech-like acoustic carriers modulated by speech amplitude fluctuation selectively activate a cortical network for top-down modulations of speech processing, leading to an enhancement of perceptual fusion of speech sounds. This mechanism represents a perceptual-grouping strategy for unmasking speech under adverse conditions.

摘要

为了在嘈杂、混响的环境中辨别和识别声源,听众需要在感知上整合直达波和每个声源的反射波。已经证实,语音的直达波和反射波之间的感知融合有助于听众在具有干扰声源的模拟混响环境中识别该语音。当直达声波与其反射波之间的延迟足够短时,这两个波在感知上融合成来自声源位置的单个声音图像。有趣的是,与点击声和噪声突发等非语音声音相比,语音声音具有更大的感知融合倾向。这项研究探讨了为什么语音的融合倾向如此之大。在这里,我们表明,当语音的时间幅度波动被人为地时间反转时,无论语音声载波处于正常还是反转的时间顺序,语音的大感知融合倾向都会消失。此外,正常顺序的语音的感知融合,但不是时间反转的语音的感知融合,伴随着注意力控制相关、空间处理相关和语音处理相关皮质区域的协同激活增加。因此,受语音幅度波动调制的类似语音的声载波选择性地激活了用于语音处理的自上而下调制的皮质网络,从而增强了语音声音的感知融合。这种机制代表了在不利条件下解蔽语音的感知分组策略。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验