Department of Psychology, Cluster of Excellence "Hearing4all", European Medical School, Carl von Ossietzky University of Oldenburg, D-26111 Oldenburg, Germany.
Department of Physics, School of Natural Sciences, Chemnitz University of Technology, D-09126 Chemnitz, Germany.
Neuroscience. 2018 Oct 1;389:19-29. doi: 10.1016/j.neuroscience.2017.07.021. Epub 2017 Jul 20.
In everyday listening environments, a main task for our auditory system is to follow one out of multiple speakers talking simultaneously. The present study was designed to find electrophysiological indicators of two central processes involved - segregating the speech mixture into distinct speech sequences corresponding to the two speakers, and then attending to one of the speech sequences. We generated multistable speech stimuli that were set up to create ambiguity as to whether only one or two speakers are talking. Thereby we were able to investigate three perceptual alternatives (no segregation, segregated - speaker A in the foreground, segregated - speaker B in the foreground) without any confounding stimulus changes. Participants listened to a continuously repeating sequence of syllables, which were uttered alternately by two human speakers, and indicated whether they perceived the sequence as an inseparable mixture or as originating from two separate speakers. In the latter case, they distinguished which speaker was in their attentional foreground. Our data show a long-lasting event-related potential (ERP) modulation starting at 130ms after stimulus onset, which can be explained by the perceptual organization of the two speech sequences into attended foreground and ignored background streams. Our paradigm extends previous work with pure-tone sequences toward speech stimuli and adds the possibility to obtain neural correlates of the difficulty to segregate a speech mixture into distinct streams.
在日常的听力环境中,我们的听觉系统的主要任务之一是从多个同时说话的说话者中听出一个。本研究旨在寻找两个涉及的核心过程的电生理指标——将语音混合物分离成对应于两个说话者的不同语音序列,然后关注其中一个语音序列。我们生成了多稳定的语音刺激,这些刺激的设置造成了一种模糊性,即不确定是只有一个还是两个说话者在说话。这样,我们就能够在没有任何刺激变化干扰的情况下,研究三种感知选择(不分隔、分隔 - 说话者 A 在前景、分隔 - 说话者 B 在前景)。参与者听由两个人类说话者交替发出的音节的连续重复序列,并指出他们是否将序列视为不可分割的混合物,还是源自两个单独的说话者。在后一种情况下,他们会区分哪个说话者处于他们的注意力前景中。我们的数据显示,在刺激开始后 130 毫秒出现了一个持久的事件相关电位(ERP)调制,可以用将两个语音序列组织成注意的前景和忽略的背景流来解释。我们的范式扩展了以前使用纯音序列的工作,增加了获得将语音混合物分离成不同流的难度的神经相关物的可能性。