1 Medizinische Physik and Cluster of Excellence "Hearing4all," Carl von Ossietzky Universität Oldenburg, Germany.
Trends Hear. 2018 Jan-Dec;22:2331216518768954. doi: 10.1177/2331216518768954.
The simulation framework for auditory discrimination experiments (FADE) was adopted and validated to predict the individual speech-in-noise recognition performance of listeners with normal and impaired hearing with and without a given hearing-aid algorithm. FADE uses a simple automatic speech recognizer (ASR) to estimate the lowest achievable speech reception thresholds (SRTs) from simulated speech recognition experiments in an objective way, independent from any empirical reference data. Empirical data from the literature were used to evaluate the model in terms of predicted SRTs and benefits in SRT with the German matrix sentence recognition test when using eight single- and multichannel binaural noise-reduction algorithms. To allow individual predictions of SRTs in binaural conditions, the model was extended with a simple better ear approach and individualized by taking audiograms into account. In a realistic binaural cafeteria condition, FADE explained about 90% of the variance of the empirical SRTs for a group of normal-hearing listeners and predicted the corresponding benefits with a root-mean-square prediction error of 0.6 dB. This highlights the potential of the approach for the objective assessment of benefits in SRT without prior knowledge about the empirical data. The predictions for the group of listeners with impaired hearing explained 75% of the empirical variance, while the individual predictions explained less than 25%. Possibly, additional individual factors should be considered for more accurate predictions with impaired hearing. A competing talker condition clearly showed one limitation of current ASR technology, as the empirical performance with SRTs lower than -20 dB could not be predicted.
采用了听觉辨别实验模拟框架 (FADE) 对正常和听力受损的个体在有无特定助听器算法情况下的言语感知能力进行预测。FADE 使用简单的自动语音识别器 (ASR) 客观地从模拟语音识别实验中估计最低可实现的言语接受阈限 (SRT),而不依赖任何经验参考数据。使用文献中的经验数据,从德国矩阵句识别测试的 SRT 预测值和 SRT 增益方面评估模型,使用了八种单通道和多通道双耳降噪算法。为了能够在双耳条件下进行个体 SRT 预测,该模型通过采用简单的优势耳方法进行了扩展,并通过考虑听力图进行了个体化。在逼真的双耳自助餐厅环境中,FADE 解释了一组正常听力个体的经验 SRT 的约 90%的方差,并以 0.6dB 的均方根预测误差预测了相应的增益。这突显了该方法在无需事先了解经验数据的情况下对 SRT 增益进行客观评估的潜力。对于听力受损的听众组的预测解释了经验方差的 75%,而个体预测的解释不到 25%。可能需要考虑更多的个体因素,以便对听力受损的情况进行更准确的预测。竞争说话者条件清楚地显示了当前 ASR 技术的一个局限性,因为无法预测 SRT 低于-20dB 的经验表现。