Department of Communication Disorders, University of Massachusetts, Amherst, Massachusetts, USA.
Ear Hear. 2012 Jan-Feb;33(1):124-33. doi: 10.1097/AUD.0b013e31822b5bee.
The primary goal of this study was to investigate how speech perception is altered by the provision of a preview or "prime" of a sample of speech just before it is presented in masking. A same-different test paradigm was developed which enabled the effect of priming to be measured with energetic maskers in addition to those that most likely produced both energetic and informational masking. Using this paradigm, the benefit of priming in overcoming energetic and informational masking was compared.
Twenty-four normal-hearing subjects listened to nonsense sentences presented in a background of competing speech (two-talker babble) or one of two types of speech-shaped noise. Both target and masker were presented via loudspeaker directly in front of the listeners. In the baseline condition, the listeners were then shown a sentence on a computer screen that either matched the auditory target sentence exactly or contained a replacement for one of the three target key words. Their task was to judge whether the printed sentence matched the auditory target and respond via computer keyboard. In the first experimental condition, the printed sentence preceded rather than followed the auditory presentation (the priming condition). In the second experimental condition, the perception of spatial separation was created between target and masker by presenting the masker from two loudspeakers (front and 60° to the right) and imposing a 4-msec delay in the masker coming from the front loudspeaker. This resulted in the target being heard from the front while, because of the precedence effect, the masker was heard well to the right (the spatial condition). In a third experimental condition, spatial separation and priming were combined. A total of five signal-to-noise ratios were tested for each masker.
The competing speech masker produced more masking than noise, consistent with previous findings. For the competing speech masker, the signal-to-noise ratio for 80% correct performance was approximately 6.7 dB lower when the listeners read the sentences first (the priming condition) than in the baseline condition. This priming effect was similar to the improvement obtained when the target and masker were separated spatially. Significant priming effects were also observed with speech-shaped noise maskers, and when there was perceived spatial separation between target and masker, conditions in which informational masking was believed to have been minimal. There seemed to be an additive effect of spatial separation and priming in the two-talker babble condition.
(1) Priming was effective in improving speech perception in all conditions, including those consisting of primarily energetic masking. (2) It is not clear how much benefit from priming could be attributed to release from informational masking. (3) Performance on the same-different task was linearly related to performance on an open-set speech recognition task using the same target and masker.
本研究的主要目的是研究在掩蔽之前提供语音样本的预览或“启动”时,语音感知如何发生变化。开发了一种相同-不同测试范式,该范式使我们能够使用能量掩蔽器以及可能产生能量和信息掩蔽的掩蔽器来测量启动的效果。使用此范式,比较了启动克服能量和信息掩蔽的效果。
24 名正常听力受试者听取了在竞争语音(双说话人杂音)或两种语音成形噪声之一的背景下呈现的无意义句子。目标和掩蔽器均通过扬声器直接放置在听众面前。在基线条件下,然后在计算机屏幕上向听众显示一个句子,该句子要么与听觉目标句子完全匹配,要么包含三个目标关键字之一的替代词。他们的任务是判断打印的句子是否与听觉目标匹配,并通过计算机键盘进行响应。在第一种实验条件下,打印的句子先于而不是跟随听觉呈现(启动条件)。在第二种实验条件下,通过从两个扬声器(前面和 60°向右)呈现掩蔽器并在前扬声器中的掩蔽器施加 4 毫秒的延迟来创建目标和掩蔽器之间的空间分离感知。这导致目标从前部听到,而由于抢先效应,掩蔽器从右侧很好地听到(空间条件)。在第三种实验条件下,同时进行空间分离和启动。对于每个掩蔽器,测试了五个信噪比。
竞争语音掩蔽器产生的掩蔽比噪声多,这与先前的发现一致。对于竞争语音掩蔽器,当听众先阅读句子(启动条件)时,80%正确性能的信噪比比基线条件低约 6.7dB。这种启动效应与目标和掩蔽器空间分离时获得的改善相似。在语音成形噪声掩蔽器中也观察到了显著的启动效应,并且当目标和掩蔽器之间存在感知的空间分离时,信息掩蔽被认为最小的条件下也观察到了启动效应。在双说话人杂音条件下,空间分离和启动似乎具有附加效应。
(1)在所有条件下,启动都有效地改善了语音感知,包括主要由能量掩蔽组成的条件。(2)尚不清楚可以归因于信息掩蔽释放的启动收益有多少。(3)在相同-不同任务上的表现与使用相同目标和掩蔽器的开放式语音识别任务上的表现呈线性相关。