Baumgärtel Regina M, Krawczyk-Becker Martin, Marquardt Daniel, Völker Christoph, Hu Hongmei, Herzke Tobias, Coleman Graham, Adiloğlu Kamil, Ernst Stephan M A, Gerkmann Timo, Doclo Simon, Kollmeier Birger, Hohmann Volker, Dietz Mathias
Medical Physics Group, Carl von Ossietzky Universität Oldenburg, Oldenburg, Germany Cluster of Excellence "Hearing4all," Oldenburg, Germany
Cluster of Excellence "Hearing4all," Oldenburg, Germany Speech Signal Processing Group, Carl von Ossietzky Universität Oldenburg, Oldenburg, Germany.
Trends Hear. 2015 Dec 30;19:2331216515617916. doi: 10.1177/2331216515617916.
In a collaborative research project, several monaural and binaural noise reduction algorithms have been comprehensively evaluated. In this article, eight selected noise reduction algorithms were assessed using instrumental measures, with a focus on the instrumental evaluation of speech intelligibility. Four distinct, reverberant scenarios were created to reflect everyday listening situations: a stationary speech-shaped noise, a multitalker babble noise, a single interfering talker, and a realistic cafeteria noise. Three instrumental measures were employed to assess predicted speech intelligibility and predicted sound quality: the intelligibility-weighted signal-to-noise ratio, the short-time objective intelligibility measure, and the perceptual evaluation of speech quality. The results show substantial improvements in predicted speech intelligibility as well as sound quality for the proposed algorithms. The evaluated coherence-based noise reduction algorithm was able to provide improvements in predicted audio signal quality. For the tested single-channel noise reduction algorithm, improvements in intelligibility-weighted signal-to-noise ratio were observed in all but the nonstationary cafeteria ambient noise scenario. Binaural minimum variance distortionless response beamforming algorithms performed particularly well in all noise scenarios.
在一个合作研究项目中,对几种单声道和双耳降噪算法进行了全面评估。在本文中,使用仪器测量方法对八种选定的降噪算法进行了评估,重点是语音清晰度的仪器评估。创建了四种不同的混响场景以反映日常聆听情况:平稳的语音形状噪声、多说话者的嘈杂噪声、单个干扰说话者以及逼真的自助餐厅噪声。采用了三种仪器测量方法来评估预测的语音清晰度和预测的声音质量:清晰度加权信噪比、短时客观清晰度测量和语音质量感知评估。结果表明,所提出的算法在预测的语音清晰度和声音质量方面有显著提高。所评估的基于相干性的降噪算法能够改善预测的音频信号质量。对于测试的单通道降噪算法,除了非平稳的自助餐厅环境噪声场景外,在所有场景中都观察到了清晰度加权信噪比的提高。双耳最小方差无失真响应波束形成算法在所有噪声场景中表现尤其出色。