• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

两种算法:深度递归神经网络和谱减法在噪声环境下对主观可懂度和语音质量影响的比较。

Comparison of effects on subjective intelligibility and quality of speech in babble for two algorithms: A deep recurrent neural network and spectral subtraction.

机构信息

Department of Psychology, University of Cambridge, Cambridge, United Kingdom.

MRC Cognition and Brain Sciences Unit, University of Cambridge, Cambridge, United Kingdom.

出版信息

J Acoust Soc Am. 2019 Mar;145(3):1493. doi: 10.1121/1.5094765.

DOI:10.1121/1.5094765
PMID:31067946
Abstract

The effects on speech intelligibility and sound quality of two noise-reduction algorithms were compared: a deep recurrent neural network (RNN) and spectral subtraction (SS). The RNN was trained using sentences spoken by a large number of talkers with a variety of accents, presented in babble. Different talkers were used for testing. Participants with mild-to-moderate hearing loss were tested. Stimuli were given frequency-dependent linear amplification to compensate for the individual hearing losses. A paired-comparison procedure was used to compare all possible combinations of three conditions. The conditions were: speech in babble with no processing (NP) or processed using the RNN or SS. In each trial, the same sentence was played twice using two different conditions. The participants indicated which one was better and by how much in terms of speech intelligibility and (in separate blocks) sound quality. Processing using the RNN was significantly preferred over NP and over SS processing for both subjective intelligibility and sound quality, although the magnitude of the preferences was small. SS processing was not significantly preferred over NP for either subjective intelligibility or sound quality. Objective computational measures of speech intelligibility predicted better intelligibility for RNN than for SS or NP.

摘要

比较了两种降噪算法(深度递归神经网络(RNN)和谱减法(SS))对语音可懂度和音质的影响:RNN 使用大量具有各种口音的说话者在嘈杂环境中所说的句子进行训练,然后对不同的说话者进行测试。参与者的听力损失为轻度至中度。使用频率相关的线性放大来补偿个体听力损失。使用配对比较程序比较了三种条件的所有可能组合。条件是:嘈杂环境中未经处理(NP)或使用 RNN 或 SS 处理的语音。在每次试验中,使用两种不同的条件播放同一个句子两次。参与者根据语音可懂度(在单独的块中)和音质来表示哪一个更好,以及好多少。与 NP 和 SS 处理相比,使用 RNN 处理在主观可懂度和音质方面都明显更受欢迎,尽管偏好程度较小。SS 处理在主观可懂度或音质方面均不明显优于 NP。语音可懂度的客观计算测量表明,RNN 的可懂度优于 SS 或 NP。

相似文献

1
Comparison of effects on subjective intelligibility and quality of speech in babble for two algorithms: A deep recurrent neural network and spectral subtraction.两种算法:深度递归神经网络和谱减法在噪声环境下对主观可懂度和语音质量影响的比较。
J Acoust Soc Am. 2019 Mar;145(3):1493. doi: 10.1121/1.5094765.
2
Transient Noise Reduction Using a Deep Recurrent Neural Network: Effects on Subjective Speech Intelligibility and Listening Comfort.使用深度循环神经网络降低瞬态噪声:对主观语音清晰度和聆听舒适度的影响。
Trends Hear. 2021 Jan-Dec;25:23312165211041475. doi: 10.1177/23312165211041475.
3
Use of a Deep Recurrent Neural Network to Reduce Wind Noise: Effects on Judged Speech Intelligibility and Sound Quality.使用深度递归神经网络降低风噪声:对感知语音清晰度和声音质量的影响。
Trends Hear. 2018 Jan-Dec;22:2331216518770964. doi: 10.1177/2331216518770964.
4
Relationship Among Signal Fidelity, Hearing Loss, and Working Memory for Digital Noise Suppression.数字噪声抑制中信号保真度、听力损失与工作记忆之间的关系。
Ear Hear. 2015 Sep-Oct;36(5):505-16. doi: 10.1097/AUD.0000000000000173.
5
Effects of noise, nonlinear processing, and linear filtering on perceived speech quality.噪声、非线性处理和线性滤波对感知语音质量的影响。
Ear Hear. 2010 Jun;31(3):420-36. doi: 10.1097/AUD.0b013e3181d3d4f3.
6
Large-scale training to increase speech intelligibility for hearing-impaired listeners in novel noises.大规模训练以提高听力受损者在新型噪声环境下的言语可懂度。
J Acoust Soc Am. 2016 May;139(5):2604. doi: 10.1121/1.4948445.
7
Working memory, age, and hearing loss: susceptibility to hearing aid distortion.工作记忆、年龄与听力损失:助听器失真的易感性。
Ear Hear. 2013 May-Jun;34(3):251-60. doi: 10.1097/AUD.0b013e318271aa5e.
8
Effects of transient noise reduction algorithms on speech intelligibility and ratings of hearing aid users.瞬态降噪算法对助听器使用者言语可懂度及评分的影响。
Am J Audiol. 2011 Dec;20(2):140-50. doi: 10.1044/1059-0889(2011/10-0007). Epub 2011 Sep 22.
9
A comparison of speech intelligibility and subjective quality with hearing-aid processing in older adults with hearing loss.老年听力损失患者使用助听器处理后的言语可懂度和主观质量比较。
Int J Audiol. 2022 Jan;61(1):46-58. doi: 10.1080/14992027.2021.1900609. Epub 2021 Apr 29.
10
Binaural speech intelligibility for combinations of noise, reverberation, and hearing-aid signal processing.噪声、混响和助听器信号处理组合下的双耳语音可懂度。
PLoS One. 2025 Jan 15;20(1):e0317266. doi: 10.1371/journal.pone.0317266. eCollection 2025.

引用本文的文献

1
Improved tactile speech robustness to background noise with a dual-path recurrent neural network noise-reduction method.基于双路径递归神经网络降噪方法提高触觉语音对背景噪声的鲁棒性。
Sci Rep. 2024 Mar 28;14(1):7357. doi: 10.1038/s41598-024-57312-7.
2
Progress made in the efficacy and viability of deep-learning-based noise reduction.基于深度学习的降噪功效和可行性的进展。
J Acoust Soc Am. 2023 May 1;153(5):2751. doi: 10.1121/10.0019341.
3
Listening to Music Through Hearing Aids: Potential Lessons for Cochlear Implants.通过助听器听音乐:对人工耳蜗植入的潜在启示。
Trends Hear. 2022 Jan-Dec;26:23312165211072969. doi: 10.1177/23312165211072969.
4
A causal and talker-independent speaker separation/dereverberation deep learning algorithm: Cost associated with conversion to real-time capable operation.一种因果且与说话人无关的说话人分离/去混响深度学习算法:转换为实时运行能力的相关成本。
J Acoust Soc Am. 2021 Nov;150(5):3976. doi: 10.1121/10.0007134.
5
Deep learning based speaker separation and dereverberation can generalize across different languages to improve intelligibility.基于深度学习的说话人分离和去混响可以跨不同语言进行泛化,以提高可懂度。
J Acoust Soc Am. 2021 Oct;150(4):2526. doi: 10.1121/10.0006565.
6
Transient Noise Reduction Using a Deep Recurrent Neural Network: Effects on Subjective Speech Intelligibility and Listening Comfort.使用深度循环神经网络降低瞬态噪声:对主观语音清晰度和聆听舒适度的影响。
Trends Hear. 2021 Jan-Dec;25:23312165211041475. doi: 10.1177/23312165211041475.
7
An effectively causal deep learning algorithm to increase intelligibility in untrained noises for hearing-impaired listeners.一种有效的因果深度学习算法,用于提高听力受损听众在未经训练的噪声中的可理解度。
J Acoust Soc Am. 2021 Jun;149(6):3943. doi: 10.1121/10.0005089.
8
Electro-Haptic Stimulation: A New Approach for Improving Cochlear-Implant Listening.电触觉刺激:一种改善人工耳蜗听力的新方法。
Front Neurosci. 2021 Jun 9;15:581414. doi: 10.3389/fnins.2021.581414. eCollection 2021.
9
A talker-independent deep learning algorithm to increase intelligibility for hearing-impaired listeners in reverberant competing talker conditions.一种独立于说话者的深度学习算法,用于在混响的竞争性说话者环境中提高听力受损听众的可懂度。
J Acoust Soc Am. 2020 Jun;147(6):4106. doi: 10.1121/10.0001441.
10
Using recurrent neural networks to improve the perception of speech in non-stationary noise by people with cochlear implants.利用递归神经网络提高人工耳蜗使用者在非平稳噪声环境下的语音感知能力。
J Acoust Soc Am. 2019 Jul;146(1):705. doi: 10.1121/1.5119226.