Auditory Behavioral Research Lab, Department of Communication Sciences and Disorders, University of South Florida, Tampa, Florida 33620, USA.
J Acoust Soc Am. 2023 Oct 1;154(4):2530-2538. doi: 10.1121/10.0021874.
Modern hearing research has identified the ability of listeners to segregate simultaneous speech streams with a reliance on three major voice cues, fundamental frequency, level, and location. Few of these studies evaluated reliance for these cues presented simultaneously as occurs in nature, and fewer still considered the listeners' relative reliance on these cues owing to the cues' different units of measure. In the present study trial-by-trial analyses were used to isolate the listener's simultaneous reliance on the three voice cues, with the behavior of an ideal observer [Green and Swets (1966). (Wiley, New York), pp.151-178] serving as a comparison standard for evaluating relative reliance. Listeners heard on each trial a pair of randomly selected, simultaneous recordings of naturally spoken sentences. One of the recordings was always from the same talker, a distracter, and the other, with equal probability, was from one of two target talkers differing in the three voice cues. The listener's task was to identify the target talker. Among 33 clinically normal-hearing adults only one relied predominantly on voice level, the remaining were split between voice fundamental frequency and/or location. The results are discussed regarding their implications for the common practice in studies of using target-distracter level as a dependent measure of speech-on-speech masking.
现代听力研究已经确定了听众能够依靠三个主要的语音线索(基频、水平和位置)来分离同时出现的语音流的能力。这些研究中很少有评估这些线索同时出现时的依赖程度,因为这些线索的测量单位不同。在本研究中,通过逐个试验的分析来分离听众对三个语音线索的同时依赖程度,理想观察者的行为[Green 和 Swets(1966)。(Wiley,New York),第 151-178 页]作为评估相对依赖程度的比较标准。在每次试验中,听众都会听到一对随机选择的、同时录制的自然说话的句子。其中一个录音总是来自同一个说话者,即干扰者,而另一个录音则以相等的概率来自两个目标说话者之一,这两个说话者在三个语音线索上有所不同。听众的任务是识别目标说话者。在 33 名临床正常听力的成年人中,只有一名主要依赖于语音水平,其余的人则在语音基频和/或位置之间存在分歧。结果将根据其对研究中常用的以目标-干扰者水平作为语音掩蔽的依赖度量的影响进行讨论。