Vestergaard Martin D, Fyson Nicholas R C, Patterson Roy D
Department of Physiology, Centre for the Neural Basis of Hearing, University of Cambridge, Cambridge, United Kingdom.
J Acoust Soc Am. 2009 Feb;125(2):1114-24. doi: 10.1121/1.3050321.
In concurrent-speech recognition, performance is enhanced when either the glottal pulse rate (GPR) or the vocal tract length (VTL) of the target speaker differs from that of the distracter, but relatively little is known about the trading relationship between the two variables, or how they interact with other cues such as signal-to-noise ratio (SNR). This paper presents a study in which listeners were asked to identify a target syllable in the presence of a distracter syllable, with carefully matched temporal envelopes. The syllables varied in GPR and VTL over a large range, and they were presented at different SNRs. The results showed that performance is particularly sensitive to the combination of GPR and VTL when the SNR is 0 dB. Equal-performance contours showed that when there are no other cues, a two-semitone difference in GPR produced the same advantage in performance as a 20% difference in VTL. This corresponds to a trading relationship between GPR and VTL of 1.6. The results illustrate that the auditory system can use any combination of differences in GPR, VTL, and SNR to segregate competing speech signals.
在同步语音识别中,当目标说话者的声门脉冲率(GPR)或声道长度(VTL)与干扰者不同时,识别性能会得到提高,但对于这两个变量之间的权衡关系,或者它们如何与其他线索(如信噪比,SNR)相互作用,人们了解得相对较少。本文介绍了一项研究,要求听众在存在干扰音节的情况下识别目标音节,干扰音节的时间包络经过精心匹配。音节的GPR和VTL在很大范围内变化,并以不同的SNR呈现。结果表明,当SNR为0 dB时,性能对GPR和VTL的组合特别敏感。等性能轮廓表明,在没有其他线索的情况下,GPR相差两个半音所产生的性能优势与VTL相差20%相同。这对应于GPR和VTL之间1.6的权衡关系。结果表明,听觉系统可以利用GPR、VTL和SNR的差异的任何组合来分离竞争的语音信号。