Stone Michael A, Moore Brian C J
Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, England.
J Acoust Soc Am. 2008 Feb;123(2):1063-76. doi: 10.1121/1.2821969.
These experiments are concerned with the intelligibility of target speech in the presence of a background talker. Using a noise vocoder, Stone and Moore [J. Acoust. Soc. Am. 114, 1023-1034 (2003)] showed that single-channel fast-acting compression degraded intelligibility, but slow compression did not. Stone and Moore [J. Acoust. Soc. Am. 116, 2311-2323 (2004)] showed that intelligibility was lower when fast single-channel compression was applied to the target and background after mixing rather than before, and suggested that this was partly due to compression after mixing introducing "comodulation" between the target and background talkers. Experiment 1 here showed a similar effect for multi-channel compression. In experiment 2, intelligibility was measured as a function of the speed of multi-channel compression applied after mixing. For both eight- and 12-channel vocoders with one compressor per channel, intelligibility decreased as compression speed increased. For the eight-channel vocoder, a compressor that only affected modulation depth for rates below 2 Hz still reduced intelligibility. Experiment 3 used 12- or 18-channel vocoders. There were between 1 and 12 compression channels, and four speeds of compression. Intelligibility decreased as the number and speed of compression channels increased. The results are interpreted using several measures of the effects of compression, especially "across-source modulation correlation."
这些实验关注的是在存在背景讲话者的情况下目标语音的可懂度。斯通和穆尔[《美国声学学会杂志》114, 1023 - 1034 (2003)]使用噪声声码器表明,单通道快速作用压缩会降低可懂度,但慢速压缩则不会。斯通和穆尔[《美国声学学会杂志》116, 2311 - 2323 (2004)]表明,在混合后而非混合前对目标和背景应用快速单通道压缩时,可懂度较低,并指出这部分是由于混合后压缩在目标和背景讲话者之间引入了“共调制”。此处的实验1显示了多通道压缩的类似效果。在实验2中,可懂度是作为混合后应用的多通道压缩速度的函数来测量的。对于每通道有一个压缩器的8通道和12通道声码器,可懂度随着压缩速度的增加而降低。对于8通道声码器,一个仅在低于2赫兹的速率下影响调制深度的压缩器仍然降低了可懂度。实验3使用了12通道或18通道声码器。有1至12个压缩通道,以及四种压缩速度。可懂度随着压缩通道的数量和速度的增加而降低。使用几种压缩效果的测量方法,特别是“跨源调制相关性”来解释这些结果。