• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

语音增强算法的主观比较与评估

Subjective comparison and evaluation of speech enhancement algorithms.

作者信息

Hu Yi, Loizou Philipos C

机构信息

Department of Electrical Engineering The University of Texas at Dallas Richardson, Texas 75083-0688, USA.

出版信息

Speech Commun. 2007 Jul;49(7):588-601. doi: 10.1016/j.specom.2006.12.006.

DOI:10.1016/j.specom.2006.12.006
PMID:18046463
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2098693/
Abstract

Making meaningful comparisons between the performance of the various speech enhancement algorithms proposed over the years, has been elusive due to lack of a common speech database, differences in the types of noise used and differences in the testing methodology. To facilitate such comparisons, we report on the development of a noisy speech corpus suitable for evaluation of speech enhancement algorithms. This corpus is subsequently used for the subjective evaluation of 13 speech enhancement methods encompassing four classes of algorithms: spectral subtractive, subspace, statistical-model based and Wiener-type algorithms. The subjective evaluation was performed by Dynastat, Inc. using the ITU-T P.835 methodology designed to evaluate the speech quality along three dimensions: signal distortion, noise distortion and overall quality. This paper reports the results of the subjective tests.

摘要

多年来,由于缺乏通用的语音数据库、所使用噪声类型的差异以及测试方法的不同,要对提出的各种语音增强算法的性能进行有意义的比较一直难以实现。为便于进行此类比较,我们报告了一个适用于评估语音增强算法的带噪语音语料库的开发情况。随后,该语料库被用于对涵盖四类算法(谱减法、子空间法、基于统计模型的方法和维纳型算法)的13种语音增强方法进行主观评估。主观评估由Dynastat公司采用国际电联电信标准化部门(ITU-T)P.835方法进行,该方法旨在从信号失真、噪声失真和整体质量三个维度评估语音质量。本文报告了主观测试的结果。

相似文献

1
Subjective comparison and evaluation of speech enhancement algorithms.语音增强算法的主观比较与评估
Speech Commun. 2007 Jul;49(7):588-601. doi: 10.1016/j.specom.2006.12.006.
2
Wearable Hearing Device Spectral Enhancement Driven by Non-Negative Sparse Coding-Based Residual Noise Reduction.基于非负稀疏编码的残余噪声降低驱动的可穿戴听力设备频谱增强
Sensors (Basel). 2020 Oct 10;20(20):5751. doi: 10.3390/s20205751.
3
A comparative intelligibility study of single-microphone noise reduction algorithms.单麦克风降噪算法的可懂度对比研究。
J Acoust Soc Am. 2007 Sep;122(3):1777. doi: 10.1121/1.2766778.
4
Two microphones spectral-coherence based speech enhancement for hearing aids using smartphone as an assistive device.基于两个麦克风频谱相干性的助听器语音增强,利用智能手机作为辅助设备。
Annu Int Conf IEEE Eng Med Biol Soc. 2016 Aug;2016:3670-3673. doi: 10.1109/EMBC.2016.7591524.
5
Estimation of a priori signal-to-noise ratio using neurograms for speech enhancement.使用神经图估计先验信噪比以进行语音增强。
J Acoust Soc Am. 2020 Jun;147(6):3830. doi: 10.1121/10.0001324.
6
A Hybrid Speech Enhancement Algorithm for Voice Assistance Application.一种用于语音助手应用的混合语音增强算法。
Sensors (Basel). 2021 Oct 23;21(21):7025. doi: 10.3390/s21217025.
7
A speech enhancement method for cochlear implant listeners.一种针对人工耳蜗使用者的语音增强方法。
Annu Int Conf IEEE Eng Med Biol Soc. 2013;2013:2036-9. doi: 10.1109/EMBC.2013.6609931.
8
Speech quality evaluation of a sparse coding shrinkage noise reduction algorithm with normal hearing and hearing impaired listeners.针对正常听力和听力受损听众的稀疏编码收缩降噪算法的语音质量评估
Hear Res. 2015 Sep;327:175-85. doi: 10.1016/j.heares.2015.07.019. Epub 2015 Jul 29.
9
Auditory models of suprathreshold distortion and speech intelligibility in persons with impaired hearing.听力受损者的超阈值失真与言语可懂度的听觉模型。
J Am Acad Audiol. 2013 Apr;24(4):307-28. doi: 10.3766/jaaa.24.4.6.
10
Evaluation of the sparse coding shrinkage noise reduction algorithm in normal hearing and hearing impaired listeners.正常听力和听力受损听众中稀疏编码收缩降噪算法的评估
Hear Res. 2014 Apr;310:36-47. doi: 10.1016/j.heares.2014.01.006. Epub 2014 Feb 2.

引用本文的文献

1
Audio signal analysis using a modified forward-forward algorithm with enhanced segmentation for soil pest detection.使用改进的前向算法和增强分割进行音频信号分析以检测土壤害虫。
Sci Rep. 2025 Aug 27;15(1):31542. doi: 10.1038/s41598-025-15770-7.
2
Mixed T-domain and TF-domain Magnitude and Phase representations for GAN-based speech enhancement.基于生成对抗网络的语音增强的混合T域和TF域幅度与相位表示
Sci Rep. 2024 Jul 31;14(1):17698. doi: 10.1038/s41598-024-68708-w.
3
Improved tactile speech robustness to background noise with a dual-path recurrent neural network noise-reduction method.基于双路径递归神经网络降噪方法提高触觉语音对背景噪声的鲁棒性。
Sci Rep. 2024 Mar 28;14(1):7357. doi: 10.1038/s41598-024-57312-7.
4
Real-Time Multirate Multiband Amplification for Hearing Aids.用于助听器的实时多速率多频段放大
IEEE Access. 2022;10:54301-54312. doi: 10.1109/access.2022.3176368. Epub 2022 May 20.
5
Characterization of Deep Learning-Based Speech-Enhancement Techniques in Online Audio Processing Applications.基于深度学习的语音增强技术在在线音频处理应用中的特性分析。
Sensors (Basel). 2023 Apr 29;23(9):4394. doi: 10.3390/s23094394.
6
Multirate Audiometric Filter Bank for Hearing Aid Devices.用于助听器设备的多速率听力计滤波器组
Conf Rec Asilomar Conf Signals Syst Comput. 2021 Oct-Nov;2021:1436-1442. doi: 10.1109/IEEECONF53345.2021.9723257.
7
Message Passing-Based Inference for Time-Varying Autoregressive Models.基于消息传递的时变自回归模型推理
Entropy (Basel). 2021 May 28;23(6):683. doi: 10.3390/e23060683.
8
ALTIS: A new algorithm for adaptive long-term SNR estimation in multi-talker babble.ALTIS:一种用于多说话者嘈杂环境中自适应长期信噪比估计的新算法。
Comput Speech Lang. 2019 Nov;58:231-246. doi: 10.1016/j.csl.2019.05.001. Epub 2019 May 10.
9
A priori SNR estimation and noise estimation for speech enhancement.用于语音增强的先验信噪比估计和噪声估计。
EURASIP J Adv Signal Process. 2016;2016(1):101. doi: 10.1186/s13634-016-0398-z. Epub 2016 Sep 22.
10
Comparing Binaural Pre-processing Strategies II: Speech Intelligibility of Bilateral Cochlear Implant Users.双耳预处理策略比较II:双侧人工耳蜗使用者的言语可懂度
Trends Hear. 2015 Dec 30;19:2331216515617917. doi: 10.1177/2331216515617917.