Psychology, School of Life and Health Sciences, Aston University, Birmingham, B4 7ET, UK.
Department of Psychology, University of York, Heslington, York, YO10 5DD, UK.
J Assoc Res Otolaryngol. 2012 Apr;13(2):269-280. doi: 10.1007/s10162-011-0307-y. Epub 2011 Dec 13.
How speech is separated perceptually from other speech remains poorly understood. Recent research suggests that the ability of an extraneous formant to impair intelligibility depends on the modulation of its frequency, but not its amplitude, contour. This study further examined the effect of formant-frequency variation on intelligibility by manipulating the rate of formant-frequency change. Target sentences were synthetic three-formant (F1 + F2 + F3) analogues of natural utterances. Perceptual organization was probed by presenting stimuli dichotically (F1 + F2C + F3C; F2 + F3), where F2C + F3C constitute a competitor for F2 and F3 that listeners must reject to optimize recognition. Competitors were derived using formant-frequency contours extracted from extended passages spoken by the same talker and processed to alter the rate of formant-frequency variation, such that rate scale factors relative to the target sentences were 0, 0.25, 0.5, 1, 2, and 4 (0 = constant frequencies). Competitor amplitude contours were either constant, or time-reversed and rate-adjusted in parallel with the frequency contour. Adding a competitor typically reduced intelligibility; this reduction increased with competitor rate until the rate was at least twice that of the target sentences. Similarity in the results for the two amplitude conditions confirmed that formant amplitude contours do not influence across-formant grouping. The findings indicate that competitor efficacy is not tuned to the rate of the target sentences; most probably, it depends primarily on the overall rate of frequency variation in the competitor formants. This suggests that, when segregating the speech of concurrent talkers, differences in speech rate may not be a significant cue for across-frequency grouping of formants.
言语如何在知觉上与其他言语区分开来仍未被很好地理解。最近的研究表明,多余的共振峰对可懂度的影响取决于其频率的调制,而不是其幅度和轮廓的调制。本研究通过控制共振峰频率变化的速率进一步考察了共振峰频率变化对可懂度的影响。目标句子是自然话语的三共振峰(F1+F2+F3)模拟。通过呈现双声道刺激(F1+F2C+F3C;F2+F3C)来探测感知组织,其中 F2C+F3C 构成了 F2 和 F3 的竞争者,听众必须拒绝 F2C+F3C 以优化识别。竞争者是从同一个说话者说的扩展段落中提取的共振峰频率轮廓,经过处理以改变共振峰频率变化的速率,使得相对于目标句子的速率比例因子为 0、0.25、0.5、1、2 和 4(0=恒定频率)。竞争者的幅度轮廓要么是恒定的,要么是与频率轮廓时间反转并速率调整平行的。添加竞争者通常会降低可懂度;这种降低随着竞争者的速率增加而增加,直到速率至少是目标句子的两倍。两种幅度条件下的结果相似,证实了共振峰幅度轮廓不会影响跨共振峰分组。研究结果表明,竞争者的有效性不是针对目标句子的速率进行调整的;最有可能的是,它主要取决于竞争者共振峰中频率变化的总速率。这表明,在分离同时说话者的言语时,言语速率的差异可能不是跨频率分组的重要线索。