• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用听觉模型预测正常听力者在噪声环境下的言语识别能力。

Microscopic prediction of speech recognition for listeners with normal hearing in noise using an auditory model.

机构信息

Medizinische Physik, Universitat Oldenburg, D-26111 Oldenburg, Germany.

出版信息

J Acoust Soc Am. 2009 Nov;126(5):2635-48. doi: 10.1121/1.3224721.

DOI:10.1121/1.3224721
PMID:19894841
Abstract

This study compares the phoneme recognition performance in speech-shaped noise of a microscopic model for speech recognition with the performance of normal-hearing listeners. "Microscopic" is defined in terms of this model twofold. First, the speech recognition rate is predicted on a phoneme-by-phoneme basis. Second, microscopic modeling means that the signal waveforms to be recognized are processed by mimicking elementary parts of human's auditory processing. The model is based on an approach by Holube and Kollmeier [J. Acoust. Soc. Am. 100, 1703-1716 (1996)] and consists of a psychoacoustically and physiologically motivated preprocessing and a simple dynamic-time-warp speech recognizer. The model is evaluated while presenting nonsense speech in a closed-set paradigm. Averaged phoneme recognition rates, specific phoneme recognition rates, and phoneme confusions are analyzed. The influence of different perceptual distance measures and of the model's a-priori knowledge is investigated. The results show that human performance can be predicted by this model using an optimal detector, i.e., identical speech waveforms for both training of the recognizer and testing. The best model performance is yielded by distance measures which focus mainly on small perceptual distances and neglect outliers.

摘要

本研究比较了语音识别微观模型与正常听力者在语音噪声中的音位识别性能。“微观”在该模型中有两方面的定义。首先,语音识别率是逐音位预测的。其次,微观建模意味着要识别的信号波形通过模仿人类听觉处理的基本部分进行处理。该模型基于 Holube 和 Kollmeier 的方法[J. Acoust. Soc. Am. 100, 1703-1716 (1996)],由一个具有心理声学和生理学动机的预处理和一个简单的动态时间扭曲语音识别器组成。该模型在闭集范式中呈现无意义语音时进行评估。分析了平均音位识别率、特定音位识别率和音位混淆。研究了不同感知距离度量和模型先验知识的影响。结果表明,使用最优检测器可以通过该模型预测人类性能,即训练识别器和测试的语音波形完全相同。最佳的模型性能由主要关注小感知距离且忽略异常值的距离度量产生。

相似文献

1
Microscopic prediction of speech recognition for listeners with normal hearing in noise using an auditory model.利用听觉模型预测正常听力者在噪声环境下的言语识别能力。
J Acoust Soc Am. 2009 Nov;126(5):2635-48. doi: 10.1121/1.3224721.
2
Effect of training on word-recognition performance in noise for young normal-hearing and older hearing-impaired listeners.训练对年轻听力正常者和老年听力受损者在噪声环境下单词识别能力的影响。
Ear Hear. 2006 Jun;27(3):263-78. doi: 10.1097/01.aud.0000215980.21158.a2.
3
The benefit obtained from visually displayed text from an automatic speech recognizer during listening to speech presented in noise.在收听有噪声干扰的语音时,从自动语音识别器的可视文本显示中获得的益处。
Ear Hear. 2008 Dec;29(6):838-52. doi: 10.1097/AUD.0b013e31818005bd.
4
Speech recognition in noise: estimating effects of compressive nonlinearities in the basilar-membrane response.噪声中的语音识别:估计基底膜反应中压缩非线性的影响。
Ear Hear. 2007 Sep;28(5):682-93. doi: 10.1097/AUD.0b013e31812f7156.
5
Prediction of the influence of reverberation on binaural speech intelligibility in noise and in quiet.预测混响对噪声和安静环境下双耳语音可懂度的影响。
J Acoust Soc Am. 2011 Nov;130(5):2999-3012. doi: 10.1121/1.3641368.
6
Spectral and temporal cues for phoneme recognition in noise.噪声中音素识别的频谱和时间线索。
J Acoust Soc Am. 2007 Sep;122(3):1758. doi: 10.1121/1.2767000.
7
Perception of clear fricatives by normal-hearing and simulated hearing-impaired listeners.正常听力和模拟听力受损听众对清擦音的感知。
J Acoust Soc Am. 2008 Feb;123(2):1114-25. doi: 10.1121/1.2821966.
8
A computer model of auditory efferent suppression: implications for the recognition of speech in noise.听觉传出抑制的计算机模型:对噪声中语音识别的影响。
J Acoust Soc Am. 2010 Feb;127(2):943-54. doi: 10.1121/1.3273893.
9
Effect of slow-acting wide dynamic range compression on measures of intelligibility and ratings of speech quality in simulated-loss listeners.慢作用宽动态范围压缩对模拟听力损失听众的言语清晰度测量及言语质量评分的影响。
J Speech Lang Hear Res. 2005 Jun;48(3):702-14. doi: 10.1044/1092-4388(2005/048).
10
Microscopic prediction of speech intelligibility in spatially distributed speech-shaped noise for normal-hearing listeners.正常听力听众在空间分布的言语形状噪声中言语可懂度的微观预测。
J Acoust Soc Am. 2015 Dec;138(6):4004-15. doi: 10.1121/1.4938230.

引用本文的文献

1
Automatic development of speech-in-noise hearing tests using machine learning.利用机器学习自动开展噪声环境下言语听力测试
Sci Rep. 2025 Apr 15;15(1):12878. doi: 10.1038/s41598-025-96312-z.
2
Auditory Spatial Bisection of Blind and Normally Sighted Individuals in Free Field and Virtual Acoustics.在自由声场和虚拟声学环境中,盲人和正常视力个体的听觉空间二分法。
Trends Hear. 2024 Jan-Dec;28:23312165241230947. doi: 10.1177/23312165241230947.
3
Microscopic and Blind Prediction of Speech Intelligibility: Theory and Practice.语音可懂度的微观与盲预测:理论与实践
IEEE/ACM Trans Audio Speech Lang Process. 2022;30:2141-2155. doi: 10.1109/taslp.2022.3184888. Epub 2022 Jun 30.
4
Modelling speech reception thresholds and their improvements due to spatial noise reduction algorithms in bimodal cochlear implant users.模拟双模式人工耳蜗使用者的言语接受阈及其因空间降噪算法而提高的情况。
Hear Res. 2022 Jul;420:108507. doi: 10.1016/j.heares.2022.108507. Epub 2022 Apr 11.
5
Auditory Nerve Fiber Discrimination and Representation of Naturally-Spoken Vowels in Noise.听觉神经纤维对自然语音在噪声中的辨别和表现。
eNeuro. 2022 Feb 14;9(1). doi: 10.1523/ENEURO.0474-21.2021. Print 2022 Jan-Feb.
6
Impact of depression on speech perception in noise.抑郁对噪声下言语感知的影响。
PLoS One. 2019 Aug 15;14(8):e0220928. doi: 10.1371/journal.pone.0220928. eCollection 2019.
7
Spatial Speech-in-Noise Performance in Bimodal and Single-Sided Deaf Cochlear Implant Users.双模式和单侧聋人工耳蜗植入者的空间言语噪声下的表现。
Trends Hear. 2019 Jan-Dec;23:2331216519858311. doi: 10.1177/2331216519858311.
8
Objective Prediction of Hearing Aid Benefit Across Listener Groups Using Machine Learning: Speech Recognition Performance With Binaural Noise-Reduction Algorithms.使用机器学习对不同听众群体的助听器效果进行预测:双耳降噪算法的言语识别性能。
Trends Hear. 2018 Jan-Dec;22:2331216518768954. doi: 10.1177/2331216518768954.
9
The effects of electrical field spatial spread and some cognitive factors on speech-in-noise performance of individual cochlear implant users-A computer model study.电场空间扩展及一些认知因素对个体人工耳蜗使用者噪声下言语感知的影响——一项计算机模型研究。
PLoS One. 2018 Apr 13;13(4):e0193842. doi: 10.1371/journal.pone.0193842. eCollection 2018.
10
Speech recognition error patterns for steady-state noise and interrupted speech.稳态噪声和间断语音的语音识别错误模式。
J Acoust Soc Am. 2017 Sep;142(3):EL306. doi: 10.1121/1.5003916.