• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

针对英语语音优化的稀疏伽马通信号模型与人类听觉滤波器不匹配。

Sparse gammatone signal model optimized for English speech does not match the human auditory filters.

作者信息

Strahl Stefan, Mertins Alfred

机构信息

International Graduate School for Neurosensory Science and Systems, Carl von Ossietzky University, D-26111 Oldenburg, Germany.

出版信息

Brain Res. 2008 Jul 18;1220:224-33. doi: 10.1016/j.brainres.2007.11.059. Epub 2007 Dec 7.

DOI:10.1016/j.brainres.2007.11.059
PMID:18201689
Abstract

Evidence that neurosensory systems use sparse signal representations as well as improved performance of signal processing algorithms using sparse signal models raised interest in sparse signal coding in the last years. For natural audio signals like speech and environmental sounds, gammatone atoms have been derived as expansion functions that generate a nearly optimal sparse signal model (Smith, E., Lewicki, M., 2006. Efficient auditory coding. Nature 439, 978-982). Furthermore, gammatone functions are established models for the human auditory filters. Thus far, a practical application of a sparse gammatone signal model has been prevented by the fact that deriving the sparsest representation is, in general, computationally intractable. In this paper, we applied an accelerated version of the matching pursuit algorithm for gammatone dictionaries allowing real-time and large data set applications. We show that a sparse signal model in general has advantages in audio coding and that a sparse gammatone signal model encodes speech more efficiently in terms of sparseness than a sparse modified discrete cosine transform (MDCT) signal model. We also show that the optimal gammatone parameters derived for English speech do not match the human auditory filters, suggesting for signal processing applications to derive the parameters individually for each applied signal class instead of using psychometrically derived parameters. For brain research, it means that care should be taken with directly transferring findings of optimality for technical to biological systems.

摘要

近年来,神经感觉系统使用稀疏信号表示的证据以及使用稀疏信号模型的信号处理算法性能的提升引发了人们对稀疏信号编码的兴趣。对于语音和环境声音等自然音频信号,伽马通原子已被推导为生成近乎最优稀疏信号模型的扩展函数(史密斯,E.,莱维基,M.,2006年。高效听觉编码。《自然》439卷,978 - 982页)。此外,伽马通函数是人类听觉滤波器的既定模型。到目前为止,稀疏伽马通信号模型的实际应用受到这样一个事实的阻碍,即一般来说,推导最稀疏表示在计算上是难以处理的。在本文中,我们将匹配追踪算法的加速版本应用于伽马通字典,从而实现实时和大数据集应用。我们表明,一般而言,稀疏信号模型在音频编码方面具有优势,并且稀疏伽马通信号模型在稀疏性方面比稀疏改进离散余弦变换(MDCT)信号模型更有效地编码语音。我们还表明,为英语语音推导的最优伽马通参数与人类听觉滤波器不匹配,这表明在信号处理应用中应针对每个应用的信号类别单独推导参数,而不是使用心理测量推导的参数。对于脑研究而言,这意味着在将技术系统的最优性研究结果直接应用于生物系统时应谨慎行事。

相似文献

1
Sparse gammatone signal model optimized for English speech does not match the human auditory filters.针对英语语音优化的稀疏伽马通信号模型与人类听觉滤波器不匹配。
Brain Res. 2008 Jul 18;1220:224-33. doi: 10.1016/j.brainres.2007.11.059. Epub 2007 Dec 7.
2
Analysis and design of gammatone signal models.伽马 tone 信号模型的分析与设计。
J Acoust Soc Am. 2009 Nov;126(5):2379-89. doi: 10.1121/1.3212919.
3
Efficient auditory coding.高效听觉编码
Nature. 2006 Feb 23;439(7079):978-82. doi: 10.1038/nature04485.
4
Perceptual relevance of the temporal envelope to the speech signal in the 4-7 kHz band.4-7千赫兹频段中时间包络与语音信号的感知相关性。
J Acoust Soc Am. 2007 Sep;122(3):EL88. doi: 10.1121/1.2761927.
5
Estimating sparse spectro-temporal receptive fields with natural stimuli.利用自然刺激估计稀疏的光谱-时间感受野。
Network. 2007 Sep;18(3):191-212. doi: 10.1080/09548980701609235. Epub 2007 Sep 7.
6
Auditory memory: a comparison between humans and starlings.听觉记忆:人类与椋鸟的比较
Brain Res. 2008 Jul 18;1220:33-46. doi: 10.1016/j.brainres.2008.01.049. Epub 2008 Jan 30.
7
Resolving precise temporal processing properties of the auditory system using continuous stimuli.使用连续刺激来解析听觉系统精确的时间处理特性。
J Neurophysiol. 2009 Jul;102(1):349-59. doi: 10.1152/jn.90896.2008. Epub 2009 May 13.
8
Mu wave suppression during the perception of meaningless syllables: EEG evidence of motor recruitment.无意义音节感知过程中的缪波抑制:运动募集的脑电图证据。
Neuropsychologia. 2009 Oct;47(12):2558-63. doi: 10.1016/j.neuropsychologia.2009.05.001. Epub 2009 May 13.
9
Auditory and auditory-visual intelligibility of speech in fluctuating maskers for normal-hearing and hearing-impaired listeners.正常听力和听力受损听众在波动掩蔽声中语音的听觉及视听清晰度
J Acoust Soc Am. 2009 May;125(5):3358-72. doi: 10.1121/1.3110132.
10
Estimation of sparse nonnegative sources from noisy overcomplete mixtures using MAP.使用最大后验概率(MAP)从含噪超完备混合信号中估计稀疏非负源。
Neural Comput. 2009 Dec;21(12):3487-518. doi: 10.1162/neco.2009.08-08-846.