使用机器学习对不同听众群体的助听器效果进行预测：双耳降噪算法的言语识别性能。

Objective Prediction of Hearing Aid Benefit Across Listener Groups Using Machine Learning: Speech Recognition Performance With Binaural Noise-Reduction Algorithms.

机构信息

1 Medizinische Physik and Cluster of Excellence "Hearing4all," Carl von Ossietzky Universität Oldenburg, Germany.

出版信息

Trends Hear. 2018 Jan-Dec;22:2331216518768954. doi: 10.1177/2331216518768954.

DOI:10.1177/2331216518768954

PMID:29692200

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5949929/

Abstract

The simulation framework for auditory discrimination experiments (FADE) was adopted and validated to predict the individual speech-in-noise recognition performance of listeners with normal and impaired hearing with and without a given hearing-aid algorithm. FADE uses a simple automatic speech recognizer (ASR) to estimate the lowest achievable speech reception thresholds (SRTs) from simulated speech recognition experiments in an objective way, independent from any empirical reference data. Empirical data from the literature were used to evaluate the model in terms of predicted SRTs and benefits in SRT with the German matrix sentence recognition test when using eight single- and multichannel binaural noise-reduction algorithms. To allow individual predictions of SRTs in binaural conditions, the model was extended with a simple better ear approach and individualized by taking audiograms into account. In a realistic binaural cafeteria condition, FADE explained about 90% of the variance of the empirical SRTs for a group of normal-hearing listeners and predicted the corresponding benefits with a root-mean-square prediction error of 0.6 dB. This highlights the potential of the approach for the objective assessment of benefits in SRT without prior knowledge about the empirical data. The predictions for the group of listeners with impaired hearing explained 75% of the empirical variance, while the individual predictions explained less than 25%. Possibly, additional individual factors should be considered for more accurate predictions with impaired hearing. A competing talker condition clearly showed one limitation of current ASR technology, as the empirical performance with SRTs lower than -20 dB could not be predicted.

摘要

采用了听觉辨别实验模拟框架 (FADE) 对正常和听力受损的个体在有无特定助听器算法情况下的言语感知能力进行预测。FADE 使用简单的自动语音识别器 (ASR) 客观地从模拟语音识别实验中估计最低可实现的言语接受阈限 (SRT)，而不依赖任何经验参考数据。使用文献中的经验数据，从德国矩阵句识别测试的 SRT 预测值和 SRT 增益方面评估模型，使用了八种单通道和多通道双耳降噪算法。为了能够在双耳条件下进行个体 SRT 预测，该模型通过采用简单的优势耳方法进行了扩展，并通过考虑听力图进行了个体化。在逼真的双耳自助餐厅环境中，FADE 解释了一组正常听力个体的经验 SRT 的约 90%的方差，并以 0.6dB 的均方根预测误差预测了相应的增益。这突显了该方法在无需事先了解经验数据的情况下对 SRT 增益进行客观评估的潜力。对于听力受损的听众组的预测解释了经验方差的 75%，而个体预测的解释不到 25%。可能需要考虑更多的个体因素，以便对听力受损的情况进行更准确的预测。竞争说话者条件清楚地显示了当前 ASR 技术的一个局限性，因为无法预测 SRT 低于-20dB 的经验表现。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b41/5949929/0a886534b383/10.1177_2331216518768954-fig1.jpg

相似文献

Objective Prediction of Hearing Aid Benefit Across Listener Groups Using Machine Learning: Speech Recognition Performance With Binaural Noise-Reduction Algorithms.使用机器学习对不同听众群体的助听器效果进行预测：双耳降噪算法的言语识别性能。

Trends Hear. 2018 Jan-Dec;22:2331216518768954. doi: 10.1177/2331216518768954.

Comparing Binaural Pre-processing Strategies III: Speech Intelligibility of Normal-Hearing and Hearing-Impaired Listeners.双耳预处理策略比较III：正常听力和听力受损听众的言语可懂度

Trends Hear. 2015 Dec 30;19:2331216515618609. doi: 10.1177/2331216515618609.

Individual Aided Speech-Recognition Performance and Predictions of Benefit for Listeners With Impaired Hearing Employing FADE.个体辅助语音识别表现和利用 FADE 对听力受损听众获益的预测。

Trends Hear. 2020 Jan-Dec;24:2331216520938929. doi: 10.1177/2331216520938929.

Modelling speech reception thresholds and their improvements due to spatial noise reduction algorithms in bimodal cochlear implant users.模拟双模式人工耳蜗使用者的言语接受阈及其因空间降噪算法而提高的情况。

Hear Res. 2022 Jul;420:108507. doi: 10.1016/j.heares.2022.108507. Epub 2022 Apr 11.

DARF: A data-reduced FADE version for simulations of speech recognition thresholds with real hearing aids.DARF：一种数据简化的 FADE 版本，用于使用真实助听器模拟语音识别阈值。

Hear Res. 2021 May;404:108217. doi: 10.1016/j.heares.2021.108217. Epub 2021 Feb 22.

Speech reception with different bilateral directional processing schemes: Influence of binaural hearing, audiometric asymmetry, and acoustic scenario.采用不同双边定向处理方案时的言语接收：双耳听力、听力不对称及声学场景的影响

Hear Res. 2017 Sep;353:36-48. doi: 10.1016/j.heares.2017.07.014. Epub 2017 Jul 29.

The influence of age, hearing, and working memory on the speech comprehension benefit derived from an automatic speech recognition system.年龄、听力和工作记忆对从自动语音识别系统获得的语音理解增益的影响。

Ear Hear. 2009 Apr;30(2):262-72. doi: 10.1097/AUD.0b013e3181987063.

Spectrotemporal Modulation Sensitivity as a Predictor of Speech-Reception Performance in Noise With Hearing Aids.助听后语音感知的频谱时间调制敏感性预测。

Trends Hear. 2016 Nov 4;20:2331216516670387. doi: 10.1177/2331216516670387.

Sentence Recognition Prediction for Hearing-impaired Listeners in Stationary and Fluctuation Noise With FADE: Empowering the Attenuation and Distortion Concept by Plomp With a Quantitative Processing Model.平稳噪声和起伏噪声中听力受损者的句子识别预测：用 FADE 增强 Plomp 的衰减和失真概念，并使用定量处理模型。

Trends Hear. 2016 Sep 7;20:2331216516655795. doi: 10.1177/2331216516655795.

Effect of multiple speechlike maskers on binaural speech recognition in normal and impaired hearing.多种类语音掩蔽器对正常听力和听力受损者双耳语音识别的影响。

J Acoust Soc Am. 1992 Dec;92(6):3132-9. doi: 10.1121/1.404209.

引用本文的文献

Binaural Speech Intelligibility in Noise and Reverberation: Prediction of Group Performance for Normal-hearing and Hearing-impaired Listeners.噪声和混响环境下的双耳言语可懂度：正常听力和听力受损听众群体表现的预测

Trends Hear. 2025 Jan-Dec;29:23312165251344947. doi: 10.1177/23312165251344947. Epub 2025 May 28.

Automatic development of speech-in-noise hearing tests using machine learning.利用机器学习自动开展噪声环境下言语听力测试

Sci Rep. 2025 Apr 15;15(1):12878. doi: 10.1038/s41598-025-96312-z.

Characterization and Prediction of Speech Intelligibility at the Output of Hearing Aids in a Noisy Working Environment.在嘈杂的工作环境中，助听器输出的语音可懂度的特征描述和预测。

Noise Health. 2023 Jul-Sep;25(118):183-194. doi: 10.4103/nah.nah_8_23.

Microscopic and Blind Prediction of Speech Intelligibility: Theory and Practice.语音可懂度的微观与盲预测：理论与实践

IEEE/ACM Trans Audio Speech Lang Process. 2022;30:2141-2155. doi: 10.1109/taslp.2022.3184888. Epub 2022 Jun 30.

A computational model to simulate spectral modulation and speech perception experiments of cochlear implant users.一种用于模拟人工耳蜗使用者频谱调制和言语感知实验的计算模型。

Front Neuroinform. 2023 Mar 9;17:934472. doi: 10.3389/fninf.2023.934472. eCollection 2023.

Spatio-temporal Integration of Speech Reflections in Hearing-Impaired Listeners.言语回声的时空整合：听障者的研究。

Trends Hear. 2022 Jan-Dec;26:23312165221143901. doi: 10.1177/23312165221143901.

Hear Res. 2022 Jul;420:108507. doi: 10.1016/j.heares.2022.108507. Epub 2022 Apr 11.

Using Automatic Speech Recognition to Optimize Hearing-Aid Time Constants.使用自动语音识别优化助听器时间常数。

Front Neurosci. 2022 Mar 17;16:779062. doi: 10.3389/fnins.2022.779062. eCollection 2022.

Thoughts on the potential to compensate a hearing loss in noise.噪声环境下听力损失补偿的思考

F1000Res. 2021 Apr 22;10:311. doi: 10.12688/f1000research.51784.1. eCollection 2021.

Trends Hear. 2020 Jan-Dec;24:2331216520938929. doi: 10.1177/2331216520938929.

本文引用的文献

The role of short-time intensity and envelope power for speech intelligibility and psychoacoustic masking.短时强度和包络功率对语音清晰度及心理声学掩蔽的作用。

J Acoust Soc Am. 2017 Aug;142(2):1098. doi: 10.1121/1.4999059.

Functionality of hearing aids: state-of-the-art and future model-based solutions.助听器的功能：基于模型的最新技术与未来解决方案。

Int J Audiol. 2018 Jun;57(sup3):S3-S28. doi: 10.1080/14992027.2016.1256504. Epub 2016 Dec 13.

Trends Hear. 2016 Sep 7;20:2331216516655795. doi: 10.1177/2331216516655795.

A simulation framework for auditory discrimination experiments: Revealing the importance of across-frequency processing in speech perception.用于听觉辨别实验的模拟框架：揭示跨频率处理在语音感知中的重要性。

J Acoust Soc Am. 2016 May;139(5):2708. doi: 10.1121/1.4948772.

Trends Hear. 2015 Dec 30;19:2331216515618609. doi: 10.1177/2331216515618609.

Comparing Binaural Pre-processing Strategies I: Instrumental Evaluation.比较双耳预处理策略I：仪器评估。

Trends Hear. 2015 Dec 30;19:2331216515617916. doi: 10.1177/2331216515617916.

Matrix sentence intelligibility prediction using an automatic speech recognition system.使用自动语音识别系统进行矩阵句子可懂度预测。

Int J Audiol. 2015;54 Suppl 2:100-7. doi: 10.3109/14992027.2015.1061708. Epub 2015 Sep 18.

Separable spectro-temporal Gabor filter bank features: Reducing the complexity of robust features for automatic speech recognition.可分离的频谱-时间Gabor滤波器组特征：降低用于自动语音识别的稳健特征的复杂度。

J Acoust Soc Am. 2015 Apr;137(4):2047-59. doi: 10.1121/1.4916618.

A multi-resolution envelope-power based model for speech intelligibility.基于多分辨率包络功率的语音可懂度模型。

J Acoust Soc Am. 2013 Jul;134(1):436-46. doi: 10.1121/1.4807563.

An evaluation of objective measures for intelligibility prediction of time-frequency weighted noisy speech.基于时频加权噪声语音可懂度预测的客观测量评估。

J Acoust Soc Am. 2011 Nov;130(5):3013-27. doi: 10.1121/1.3641373.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用机器学习对不同听众群体的助听器效果进行预测：双耳降噪算法的言语识别性能。

Objective Prediction of Hearing Aid Benefit Across Listener Groups Using Machine Learning: Speech Recognition Performance With Binaural Noise-Reduction Algorithms.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献