Carlyon Robert P, Flanagan Sheila, Deeks John M
MRC Cognition & Brain Sciences Unit, University of Cambridge, 15 Chaucer Rd, Cambridge, CB1 3DA, UK.
J Assoc Res Otolaryngol. 2017 Dec;18(6):815-825. doi: 10.1007/s10162-017-0637-5. Epub 2017 Aug 23.
Forward masking of a sinusoidal signal is determined not only by the masker's power spectrum but also by its phase spectrum. Specifically, when the phase spectrum is such that the output of an auditory filter centred on the signal has a highly modulated ("peaked") envelope, there is less masking than when that envelope is flat. This finding has been attributed to non-linearities, such as compression, reducing the average neural response to maskers that produce more peaked auditory filter outputs (Carlyon and Datta, J Acoust Soc Am 101:3636-3647, 1997). Here we evaluate an alternative explanation proposed by Wotcjzak and Oxenham (Wojtczak and Oxenham, J Assoc Res Otolaryngol 10:595-607, 2009). They reported a masker phase effect for 6-kHz signals when the masker components were at least an octave below the signal frequency. Wotcjzak and Oxenham argued that this effect was inconsistent with cochlear compression, and, because it did not occur at lower signal frequencies, was also inconsistent with more central compression. It was instead attributed to activation of the efferent system reducing the response to the subsequent probe. Here, experiment 1 replicated their main findings. Experiment 2 showed that the phase effect on off-frequency forward masking is similar at signal frequencies of 2 and 6 kHz, provided that one equates the number of components likely to interact within an auditory filter centred on the signal, thereby roughly equating the effect of masker phase on the peakiness of that filter output. Experiment 3 showed that for some subjects, masker phase also had a strong influence on off-frequency backward masking of the signal, and that the size of this effect correlated across subjects with that observed in forward masking. We conclude that the masker phase effect is mediated mainly by cochlear non-linearities, with a possible additional effect of more central compression. The data are not consistent with a role for the efferent system.
正弦信号的前掩蔽不仅取决于掩蔽声的功率谱,还取决于其相位谱。具体而言,当相位谱使得以该信号为中心的听觉滤波器的输出具有高度调制(“尖峰状”)的包络时,掩蔽作用比包络平坦时要小。这一发现归因于诸如压缩等非线性因素,它会降低对产生更尖峰状听觉滤波器输出的掩蔽声的平均神经反应(卡利昂和达塔,《美国声学学会杂志》101:3636 - 3647,1997)。在此,我们评估了沃伊茨扎克和奥克森汉姆提出的另一种解释(沃伊茨扎克和奥克森汉姆,《耳鼻咽喉研究学会杂志》10:595 - 607,2009)。他们报告称,当掩蔽声声成分至少比信号频率低一个倍频程时,对于6千赫的信号存在掩蔽声相位效应。沃伊茨扎克和奥克森汉姆认为,这种效应与耳蜗压缩不一致,并且由于在较低信号频率时未出现,所以也与更中枢的压缩不一致。相反,它被归因于传出系统的激活降低了对后续探测声的反应。在此,实验1重复了他们的主要发现。实验2表明,在2千赫和6千赫的信号频率下,对偏离频率的前掩蔽的相位效应是相似的,前提是使以信号为中心的听觉滤波器内可能相互作用的成分数量相等,从而大致使掩蔽声相位对该滤波器输出尖峰度的影响相等。实验3表明,对于一些受试者,掩蔽声相位对信号的偏离频率后掩蔽也有强烈影响,并且这种效应的大小在不同受试者之间与在前掩蔽中观察到的效应相关。我们得出结论,掩蔽声相位效应主要由耳蜗非线性介导,可能还有更中枢压缩的额外效应。这些数据与传出系统的作用不一致。