• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

噪声环境下语音识别与稀疏性的关系。

Relationship between speech recognition in noise and sparseness.

机构信息

Institute of Sound and Vibration Research, University of Southampton, Southampton, UK.

出版信息

Int J Audiol. 2012 Feb;51(2):75-82. doi: 10.3109/14992027.2011.625984. Epub 2011 Nov 22.

DOI:10.3109/14992027.2011.625984
PMID:22107445
Abstract

OBJECTIVE

Established methods for predicting speech recognition in noise require knowledge of clean speech signals, placing limitations on their application. The study evaluates an alternative approach based on characteristics of noisy speech, specifically its sparseness as represented by the statistic kurtosis.

DESIGN

Experiments 1 and 2 involved acoustic analysis of vowel-consonant-vowel (VCV) syllables in babble noise, comparing kurtosis, glimpsing areas, and extended speech intelligibility index (ESII) of noisy speech signals with one another and with pre-existing speech recognition scores. Experiment 3 manipulated kurtosis of VCV syllables and investigated effects on speech recognition scores in normal-hearing listeners.

STUDY SAMPLE

Pre-existing speech recognition data for Experiments 1 and 2; seven normal-hearing participants for Experiment 3.

RESULTS

Experiments 1 and 2 demonstrated that kurtosis calculated in the time-domain from noisy speech is highly correlated (r > 0.98) with established prediction models: glimpsing and ESII. All three measures predicted speech recognition scores well. The final experiment showed a clear monotonic relationship between speech recognition scores and kurtosis.

CONCLUSIONS

Speech recognition performance in noise is closely related to the sparseness (kurtosis) of the noisy speech signal, at least for the types of speech and noise used here and for listeners with normal hearing.

摘要

目的

现有的噪声环境下语音识别预测方法需要干净语音信号的知识,这限制了它们的应用。本研究评估了一种基于噪声语音特征的替代方法,特别是其稀疏性,以统计量峰度来表示。

设计

实验 1 和 2 涉及在背景噪声中元音-辅音-元音(VCV)音节的声学分析,比较了噪声语音信号的峰度、瞥见区域和扩展语音可懂度指数(ESII)彼此之间以及与现有语音识别分数之间的关系。实验 3 操纵 VCV 音节的峰度,并研究了其对正常听力受试者语音识别分数的影响。

样本

实验 1 和 2 的现有语音识别数据;实验 3 的 7 名正常听力参与者。

结果

实验 1 和 2 表明,从噪声语音中计算出的时域峰度与现有的预测模型(瞥见和 ESII)高度相关(r>0.98)。所有这三个指标都能很好地预测语音识别分数。最后一个实验表明,语音识别分数与峰度之间存在明显的单调关系。

结论

噪声环境下的语音识别性能与噪声语音信号的稀疏性(峰度)密切相关,至少对于这里使用的语音和噪声类型以及听力正常的听众而言是如此。

相似文献

1
Relationship between speech recognition in noise and sparseness.噪声环境下语音识别与稀疏性的关系。
Int J Audiol. 2012 Feb;51(2):75-82. doi: 10.3109/14992027.2011.625984. Epub 2011 Nov 22.
2
Predicting speech intelligibility based on the signal-to-noise envelope power ratio after modulation-frequency selective processing.基于调制频率选择性处理后的信噪比包络功率比预测语音可懂度。
J Acoust Soc Am. 2011 Sep;130(3):1475-87. doi: 10.1121/1.3621502.
3
Improving word recognition in noise among hearing-impaired subjects with a single-channel cochlear noise-reduction algorithm.提高单通道耳蜗降噪算法助听受试者噪声中单词识别能力。
J Acoust Soc Am. 2012 Sep;132(3):1718-31. doi: 10.1121/1.4739441.
4
Phoneme recognition in vocoded maskers by normal-hearing and aided hearing-impaired listeners.正常听力者和助听听力受损者对带通滤波掩蔽声中的音素识别
J Acoust Soc Am. 2014 Aug;136(2):859-66. doi: 10.1121/1.4889863.
5
Speech-in-noise measures: variable versus fixed speech and noise levels.语音噪声测试:语音和噪声的可变与固定水平。
Int J Audiol. 2012 Sep;51(9):708-12. doi: 10.3109/14992027.2012.684407. Epub 2012 May 28.
6
Comparison of fluctuating maskers for speech recognition tests.比较用于语音识别测试的波动掩蔽器。
Int J Audiol. 2011 Jan;50(1):2-13. doi: 10.3109/14992027.2010.505582. Epub 2010 Nov 23.
7
English vowel identification in long-term speech-shaped noise and multi-talker babble for English and Chinese listeners.英语和汉语受试者在长期语音噪声和多说话人噪声中的英语元音识别。
J Acoust Soc Am. 2013 May;133(5):EL391-7. doi: 10.1121/1.4800191.
8
Sentence perception in listening conditions having similar speech intelligibility indices.在言语可懂度指数相似的聆听条件下的句子感知。
Int J Audiol. 2011 Jan;50(1):34-40. doi: 10.3109/14992027.2010.521198. Epub 2010 Nov 4.
9
The effects of binaural spectral resolution mismatch on Mandarin speech perception in simulated electric hearing.双耳频谱分辨率不匹配对人工耳蜗模拟语音感知的影响
J Acoust Soc Am. 2012 Aug;132(2):EL142-8. doi: 10.1121/1.4737595.
10
The relative importance of spectral cues for vowel recognition in severe noise.在强噪声环境中,语音识别中频谱线索的相对重要性。
J Acoust Soc Am. 2012 Oct;132(4):2652-62. doi: 10.1121/1.4751543.

引用本文的文献

1
Sparse Nonnegative Matrix Factorization Strategy for Cochlear Implants.用于人工耳蜗的稀疏非负矩阵分解策略
Trends Hear. 2015 Dec 30;19:2331216515616941. doi: 10.1177/2331216515616941.
2
Development of a real time sparse non-negative matrix factorization module for cochlear implants by using xPC target.利用 xPC 目标开发用于人工耳蜗的实时稀疏非负矩阵分解模块。
Sensors (Basel). 2013 Oct 14;13(10):13861-78. doi: 10.3390/s131013861.
3
A sparse neural code for some speech sounds but not for others.对于某些语音而言,稀疏神经编码,但对于其他语音则不然。
PLoS One. 2012;7(7):e40953. doi: 10.1371/journal.pone.0040953. Epub 2012 Jul 16.