Medizinische Physik and Cluster of Excellence Hearing4All Carl-von-Ossietzky Universität Oldenburg, Oldenburg, Germany.
Trends Hear. 2020 Jan-Dec;24:2331216520975630. doi: 10.1177/2331216520975630.
The equalization cancellation model is often used to predict the binaural masking level difference. Previously its application to speech in noise has required separate knowledge about the speech and noise signals to maximize the signal-to-noise ratio (SNR). Here, a novel, equalization cancellation model is introduced that can use the mixed signals. This approach does not require any assumptions about particular sound source directions. It uses different strategies for positive and negative SNRs, with the switching between the two steered by a blind decision stage utilizing modulation cues. The output of the model is a single-channel signal with enhanced SNR, which we analyzed using the speech intelligibility index to compare speech intelligibility predictions. In a first experiment, the model was tested on experimental data obtained in a scenario with spatially separated target and masker signals. Predicted speech recognition thresholds were in good agreement with measured speech recognition thresholds with a root mean square error less than 1 dB. A second experiment investigated signals at positive SNRs, which was achieved using time compressed and low-pass filtered speech. The results demonstrated that binaural unmasking of speech occurs at positive SNRs and that the modulation-based switching strategy can predict the experimental results.
均衡抵消模型常用于预测双耳掩蔽级差。在此之前,其在噪声中语音的应用需要关于语音和噪声信号的单独知识以实现最大信噪比(SNR)。在此引入了一种新颖的均衡抵消模型,它可以使用混合信号。这种方法不需要对特定声源方向做出任何假设。它为正 SNR 和负 SNR 使用不同的策略,两者之间的切换由利用调制线索的盲目决策阶段引导。该模型的输出是一个具有增强 SNR 的单声道信号,我们使用语音可懂度指数来分析该信号,以比较语音可懂度预测。在第一个实验中,该模型在目标信号和掩蔽信号空间分离的场景中获取的实验数据上进行了测试。预测的语音识别阈值与测量的语音识别阈值非常吻合,均方根误差小于 1dB。第二个实验研究了正 SNR 下的信号,该 SNR 是通过时间压缩和低通滤波语音实现的。结果表明,正 SNR 下会发生语音的双耳掩蔽,且基于调制的切换策略可以预测实验结果。