Shen Yi, Manzano Nicole K, Richards Virginia M
Department of Speech and Hearing Sciences, Indiana University Bloomington, 200 S Jordan Avenue, Bloomington, Indiana 47405-7000, USA.
Department of Cognitive Sciences, University of California, Irvine, 3151 Social Science Plaza, Irvine, California 92687-5100, USA.
J Acoust Soc Am. 2015 Dec;138(6):3613-24. doi: 10.1121/1.4937613.
Listeners' speech reception is better when speech is masked by a modulated masker compared to an unmodulated masker with the same long-term root-mean-square level. It has been suggested that listeners take advantage of brief periods of quiescence in a modulated masker to extract speech information. Two experiments examined the contribution of such "dip-listening" models. The first experiment estimated psychometric functions for speech intelligibility using sentences masked by sinusoidally modulated and unmodulated speech-shaped noises and the second experiment estimated detection thresholds for a tone pip added at the central dip in the masker. Modulation rates ranging from 1 to 64 Hz were tested. In experiment 1 the slopes of the psychometric functions were shallower for lower modulation rates and the pattern of speech reception thresholds as a function of modulation rate was nonmonotonic with a minimum near 16 Hz. In contrast, the detection thresholds from experiment 2 increased monotonically with modulation rate. The results suggest that the benefits of listening to speech in temporally fluctuating maskers cannot be solely ascribed to the temporal acuity of the auditory system.
与具有相同长期均方根水平的未调制掩蔽声相比,当语音被调制掩蔽声掩蔽时,听者的言语接受度更好。有人提出,听者利用调制掩蔽声中的短暂静音期来提取语音信息。两项实验检验了这种“低谷聆听”模型的作用。第一个实验使用由正弦调制和未调制的语音形状噪声掩蔽的句子来估计言语可懂度的心理测量函数,第二个实验估计在掩蔽声的中心低谷处添加的纯音短脉冲的检测阈值。测试了1至64赫兹的调制率。在实验1中,较低调制率下心理测量函数的斜率较浅,并且作为调制率函数的言语接受阈值模式是非单调的,在16赫兹附近有最小值。相比之下,实验2的检测阈值随调制率单调增加。结果表明,在时间波动的掩蔽声中聆听语音的益处不能仅仅归因于听觉系统的时间敏锐度。