Schubotz Wiebke, Brand Thomas, Kollmeier Birger, Ewert Stephan D
Medizinische Physik and Cluster of Excellence Hearing4all, Universität Oldenburg, D-26111 Oldenburg, Germany.
J Acoust Soc Am. 2016 Jul;140(1):524. doi: 10.1121/1.4955079.
Speech intelligibility is strongly affected by the presence of maskers. Depending on the spectro-temporal structure of the masker and its similarity to the target speech, different masking aspects can occur which are typically referred to as energetic, amplitude modulation, and informational masking. In this study speech intelligibility and speech detection was measured in maskers that vary systematically in the time-frequency domain from steady-state noise to a single interfering talker. Male and female target speech was used in combination with maskers based on speech for the same or different gender. Observed data were compared to predictions of the speech intelligibility index, extended speech intelligibility index, multi-resolution speech-based envelope-power-spectrum model, and the short-time objective intelligibility measure. The different models served as analysis tool to help distinguish between the different masking aspects. Comparison shows that overall masking can to a large extent be explained by short-term energetic masking. However, the other masking aspects (amplitude modulation an informational masking) influence speech intelligibility as well. Additionally, it was obvious that all models showed considerable deviations from the data. Therefore, the current study provides a benchmark for further evaluation of speech prediction models.
掩蔽声的存在会对言语可懂度产生强烈影响。根据掩蔽声的频谱 - 时间结构及其与目标语音的相似性,会出现不同的掩蔽情况,通常分别称为能量掩蔽、幅度调制掩蔽和信息掩蔽。在本研究中,在从稳态噪声到单个干扰说话者的时频域中系统变化的掩蔽声条件下测量了言语可懂度和言语检测。使用男性和女性目标语音,并结合基于相同或不同性别的语音的掩蔽声。将观察到的数据与言语可懂度指数、扩展言语可懂度指数、多分辨率基于语音的包络 - 功率谱模型以及短时客观可懂度测量的预测结果进行比较。不同的模型用作分析工具,以帮助区分不同的掩蔽情况。比较表明,总体掩蔽在很大程度上可以由短期能量掩蔽来解释。然而,其他掩蔽情况(幅度调制掩蔽和信息掩蔽)也会影响言语可懂度。此外,很明显所有模型与数据都存在相当大的偏差。因此,本研究为进一步评估语音预测模型提供了一个基准。