• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用耳蜗模型预测增强宽带语音的质量。

Predicting the quality of enhanced wideband speech with a cochlear model.

作者信息

Wirtzfeld Michael R, Pourmand Nazanin, Parsa Vijay, Bruce Ian C

机构信息

Department of Electrical and Computer Engineering, McMaster University, 1280 Main Street West, Hamilton, Ontario L8S 4K1, Canada.

Knowles Intelligent Audio, Mountain View, California 94043, USA.

出版信息

J Acoust Soc Am. 2017 Sep;142(3):EL319. doi: 10.1121/1.5003785.

DOI:10.1121/1.5003785
PMID:28964067
Abstract

Objective measures are commonly used in the development of speech coding algorithms as an adjunct to human subjective evaluation. Predictors of speech quality based on models of physiological or perceptual processing tend to perform better than measures based on simple acoustical properties. Here, a modeling method based on a detailed physiological model and a neurogram similarity measure is developed and optimized to predict the quality of an enhanced wideband speech dataset. A model capturing temporal modulations in neural activity up to 267 Hz was found to perform as well as or better than several existing objective quality measures.

摘要

客观测量在语音编码算法的开发中通常作为人类主观评估的辅助手段被广泛使用。基于生理或感知处理模型的语音质量预测指标往往比基于简单声学特性的指标表现更好。在此,开发并优化了一种基于详细生理模型和神经图相似性度量的建模方法,以预测增强型宽带语音数据集的质量。结果发现,一个能够捕捉高达267Hz神经活动时间调制的模型,其性能与几种现有的客观质量度量相当或更好。

相似文献

1
Predicting the quality of enhanced wideband speech with a cochlear model.使用耳蜗模型预测增强宽带语音的质量。
J Acoust Soc Am. 2017 Sep;142(3):EL319. doi: 10.1121/1.5003785.
2
Perceptual and Model-Based Evaluation of Ideal Time-Frequency Noise Reduction in Hearing-Impaired Listeners.听觉障碍者理想时频降噪的感知和基于模型的评估。
IEEE Trans Neural Syst Rehabil Eng. 2018 Mar;26(3):687-697. doi: 10.1109/TNSRE.2018.2794557.
3
Speech intelligibility is best predicted by intensity, not cochlea-scaled entropy.语音清晰度最好由强度来预测,而非耳蜗标度熵。
J Acoust Soc Am. 2017 Sep;142(3):EL264. doi: 10.1121/1.5002149.
4
Perceptual relevance of the temporal envelope to the speech signal in the 4-7 kHz band.4-7千赫兹频段中时间包络与语音信号的感知相关性。
J Acoust Soc Am. 2007 Sep;122(3):EL88. doi: 10.1121/1.2761927.
5
Speech Categorization Reveals the Role of Early-Stage Temporal-Coherence Processing in Auditory Scene Analysis.言语分类揭示了早期时间相干性处理在听觉场景分析中的作用。
J Neurosci. 2022 Jan 12;42(2):240-254. doi: 10.1523/JNEUROSCI.1610-21.2021. Epub 2021 Nov 11.
6
Psychophysiological analyses demonstrate the importance of neural envelope coding for speech perception in noise.心理生理分析表明,神经包络编码对噪声中言语感知很重要。
J Neurosci. 2012 Feb 1;32(5):1747-56. doi: 10.1523/JNEUROSCI.4493-11.2012.
7
Comparing Binaural Pre-processing Strategies I: Instrumental Evaluation.比较双耳预处理策略I:仪器评估。
Trends Hear. 2015 Dec 30;19:2331216515617916. doi: 10.1177/2331216515617916.
8
A composite model of the auditory periphery for simulating responses to complex sounds.一种用于模拟对复杂声音响应的听觉外周复合模型。
J Acoust Soc Am. 1999 Oct;106(4 Pt 1):1852-64. doi: 10.1121/1.427935.
9
Spectro-temporal modulation energy based mask for robust speaker identification.基于谱时调制能量的掩蔽稳健说话人识别。
J Acoust Soc Am. 2012 May;131(5):EL368-74. doi: 10.1121/1.3697534.
10
Speech quality evaluation of a sparse coding shrinkage noise reduction algorithm with normal hearing and hearing impaired listeners.针对正常听力和听力受损听众的稀疏编码收缩降噪算法的语音质量评估
Hear Res. 2015 Sep;327:175-85. doi: 10.1016/j.heares.2015.07.019. Epub 2015 Jul 29.

引用本文的文献

1
An overview of the HASPI and HASQI metrics for predicting speech intelligibility and speech quality for normal hearing, hearing loss, and hearing aids.HASPI 和 HASQI 指标概述,用于预测正常听力、听力损失和助听器的语音可懂度和语音质量。
Hear Res. 2022 Dec;426:108608. doi: 10.1016/j.heares.2022.108608. Epub 2022 Sep 13.
2
Simple transformations capture auditory input to cortex.简单的转换可以捕捉到听觉输入到大脑皮层。
Proc Natl Acad Sci U S A. 2020 Nov 10;117(45):28442-28451. doi: 10.1073/pnas.1922033117. Epub 2020 Oct 23.