• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用递归神经网络提高人工耳蜗使用者在非平稳噪声环境下的语音感知能力。

Using recurrent neural networks to improve the perception of speech in non-stationary noise by people with cochlear implants.

机构信息

Medical Research Council Cognition and Brain Sciences Unit, University of Cambridge, 15 Chaucer Road, Cambridge CB2 7EF, United Kingdom.

Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, United Kingdom.

出版信息

J Acoust Soc Am. 2019 Jul;146(1):705. doi: 10.1121/1.5119226.

DOI:10.1121/1.5119226
PMID:31370586
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6773603/
Abstract

Speech-in-noise perception is a major problem for users of cochlear implants (CIs), especially with non-stationary background noise. Noise-reduction algorithms have produced benefits but relied on a priori information about the target speaker and/or background noise. A recurrent neural network (RNN) algorithm was developed for enhancing speech in non-stationary noise and its benefits were evaluated for speech perception, using both objective measures and experiments with CI simulations and CI users. The RNN was trained using speech from many talkers mixed with multi-talker or traffic noise recordings. Its performance was evaluated using speech from an unseen talker mixed with different noise recordings of the same class, either babble or traffic noise. Objective measures indicated benefits of using a recurrent over a feed-forward architecture, and predicted better speech intelligibility with than without the processing. The experimental results showed significantly improved intelligibility of speech in babble noise but not in traffic noise. CI subjects rated the processed stimuli as significantly better in terms of speech distortions, noise intrusiveness, and overall quality than unprocessed stimuli for both babble and traffic noise. These results extend previous findings for CI users to mostly unseen acoustic conditions with non-stationary noise.

摘要

噪声环境下言语感知是人工耳蜗使用者面临的一个主要问题,尤其是在非平稳背景噪声下。降噪算法已取得了一定的成效,但仍依赖于关于目标说话人以及/或背景噪声的先验信息。本文提出了一种用于增强非平稳噪声下言语感知的递归神经网络算法,并通过 CI 仿真和 CI 用户实验,从客观评估和主观评估两个方面对其性能进行了评估。该 RNN 算法通过将来自多位说话人的语音与多人语音或交通噪声录音相混合进行训练。它的性能是通过使用来自未知说话人的语音与相同类型的不同噪声录音(例如环境噪声或交通噪声)相混合进行评估的。客观评估结果表明,相较于前馈架构,递归架构具有一定优势,并且经过处理后的语音的可懂度明显高于未经处理的语音。实验结果表明,在环境噪声下,该算法可显著提高语音的可懂度,但在交通噪声下效果不明显。CI 受试者认为,相较于未经处理的刺激,经过处理的刺激在语音失真、噪声干扰和整体质量方面的评分明显更高,无论是环境噪声还是交通噪声下均是如此。这些结果将之前针对 CI 用户的研究结果扩展到了具有非平稳噪声的大部分未听过的声学条件下。

相似文献

1
Using recurrent neural networks to improve the perception of speech in non-stationary noise by people with cochlear implants.利用递归神经网络提高人工耳蜗使用者在非平稳噪声环境下的语音感知能力。
J Acoust Soc Am. 2019 Jul;146(1):705. doi: 10.1121/1.5119226.
2
Speech enhancement based on neural networks improves speech intelligibility in noise for cochlear implant users.基于神经网络的语音增强技术可提高人工耳蜗使用者在噪声环境中的语音清晰度。
Hear Res. 2017 Feb;344:183-194. doi: 10.1016/j.heares.2016.11.012. Epub 2016 Nov 30.
3
Deep learning restores speech intelligibility in multi-talker interference for cochlear implant users.深度学习恢复多说话人干扰下人工耳蜗使用者的言语可懂度。
Sci Rep. 2024 Jun 9;14(1):13241. doi: 10.1038/s41598-024-63675-8.
4
Comparison of effects on subjective intelligibility and quality of speech in babble for two algorithms: A deep recurrent neural network and spectral subtraction.两种算法:深度递归神经网络和谱减法在噪声环境下对主观可懂度和语音质量影响的比较。
J Acoust Soc Am. 2019 Mar;145(3):1493. doi: 10.1121/1.5094765.
5
Sound Localization and Speech Perception in Noise of Pediatric Cochlear Implant Recipients: Bimodal Fitting Versus Bilateral Cochlear Implants.儿童人工耳蜗植入者的声音定位和噪声下言语感知:双模式适配与双侧人工耳蜗。
Ear Hear. 2017 Jul/Aug;38(4):426-440. doi: 10.1097/AUD.0000000000000401.
6
Effect of Spectral Contrast Enhancement on Speech-on-Speech Intelligibility and Voice Cue Sensitivity in Cochlear Implant Users.频谱对比度增强对人工耳蜗使用者语音对语音可懂度及语音线索敏感性的影响。
Ear Hear. 2021 Mar/Apr;42(2):271-289. doi: 10.1097/AUD.0000000000000936.
7
Perceptual adaptation and intelligibility of multiple talkers for two types of degraded speech.多说话人感知适应和两种类型语音失真的可懂度。
J Acoust Soc Am. 2009 Nov;126(5):2660-9. doi: 10.1121/1.3212930.
8
Perception of speech produced by native and nonnative talkers by listeners with normal hearing and listeners with cochlear implants.正常听力者和人工耳蜗植入者对母语者和非母语者说话者所产生的语音感知。
J Speech Lang Hear Res. 2014 Apr 1;57(2):532-54. doi: 10.1044/2014_JSLHR-H-12-0404.
9
Use of a Deep Recurrent Neural Network to Reduce Wind Noise: Effects on Judged Speech Intelligibility and Sound Quality.使用深度递归神经网络降低风噪声:对感知语音清晰度和声音质量的影响。
Trends Hear. 2018 Jan-Dec;22:2331216518770964. doi: 10.1177/2331216518770964.
10
Differences in neural encoding of speech in noise between cochlear implant users with and without preserved acoustic hearing.噪声环境下人工耳蜗植入使用者与未保留残余听力者言语神经编码差异。
Hear Res. 2023 Jan;427:108649. doi: 10.1016/j.heares.2022.108649. Epub 2022 Nov 13.

引用本文的文献

1
An Artificial Intelligence-Based Non-Invasive Approach for Cardiovascular Disease Risk Stratification in Obstructive Sleep Apnea Patients: A Narrative Review.一种基于人工智能的阻塞性睡眠呼吸暂停患者心血管疾病风险分层的非侵入性方法:叙述性综述。
Rev Cardiovasc Med. 2024 Dec 28;25(12):463. doi: 10.31083/j.rcm2512463. eCollection 2024 Dec.
2
Impact of Mask Type as Training Target for Speech Intelligibility and Quality in Cochlear-Implant Noise Reduction.口罩类型对人工耳蜗降噪言语可懂度和质量训练目标的影响。
Sensors (Basel). 2024 Oct 14;24(20):6614. doi: 10.3390/s24206614.
3
Neural Decoding of the Speech Envelope: Effects of Intelligibility and Spectral Degradation.语音包络的神经解码:可懂度和频谱劣化的影响。
Trends Hear. 2024 Jan-Dec;28:23312165241266316. doi: 10.1177/23312165241266316.
4
UltraAIGenomics: Artificial Intelligence-Based Cardiovascular Disease Risk Assessment by Fusion of Ultrasound-Based Radiomics and Genomics Features for Preventive, Personalized and Precision Medicine: A Narrative Review.超人工智能基因组学:基于超声影像组学和基因组学特征融合的人工智能心血管疾病风险评估用于预防、个性化和精准医学:一篇综述
Rev Cardiovasc Med. 2024 May 22;25(5):184. doi: 10.31083/j.rcm2505184. eCollection 2024 May.
5
Deep learning restores speech intelligibility in multi-talker interference for cochlear implant users.深度学习恢复多说话人干扰下人工耳蜗使用者的言语可懂度。
Sci Rep. 2024 Jun 9;14(1):13241. doi: 10.1038/s41598-024-63675-8.
6
Improved tactile speech robustness to background noise with a dual-path recurrent neural network noise-reduction method.基于双路径递归神经网络降噪方法提高触觉语音对背景噪声的鲁棒性。
Sci Rep. 2024 Mar 28;14(1):7357. doi: 10.1038/s41598-024-57312-7.
7
Objective intelligibility measurement of reverberant vocoded speech for normal-hearing listeners: Towards facilitating the development of speech enhancement algorithms for cochlear implants.为正常听力听众测量混响语音编码语音的客观可懂度:促进人工耳蜗语音增强算法的发展。
J Acoust Soc Am. 2024 Mar 1;155(3):2151-2168. doi: 10.1121/10.0025285.
8
Improved tactile speech perception using audio-to-tactile sensory substitution with formant frequency focusing.使用具有共振峰频率聚焦的声触觉感觉替代来提高触觉语音感知。
Sci Rep. 2024 Feb 28;14(1):4889. doi: 10.1038/s41598-024-55429-3.
9
Suppressing reverberation in cochlear implant stimulus patterns using time-frequency masks based on phoneme groups.基于音素组的时频掩码抑制人工耳蜗刺激模式中的混响。
Proc Meet Acoust. 2022 Dec;50(1). doi: 10.1121/2.0001698. Epub 2023 Feb 13.
10
Experimental Investigation of Acoustic Features to Optimize Intelligibility in Cochlear Implants.实验研究优化人工耳蜗植入中可懂度的声学特征。
Sensors (Basel). 2023 Aug 31;23(17):7553. doi: 10.3390/s23177553.

本文引用的文献

1
Comparison of effects on subjective intelligibility and quality of speech in babble for two algorithms: A deep recurrent neural network and spectral subtraction.两种算法:深度递归神经网络和谱减法在噪声环境下对主观可懂度和语音质量影响的比较。
J Acoust Soc Am. 2019 Mar;145(3):1493. doi: 10.1121/1.5094765.
2
A deep learning algorithm to increase intelligibility for hearing-impaired listeners in the presence of a competing talker and reverberation.一种深度学习算法,用于在存在竞争说话者和混响的情况下提高听力障碍者的可理解度。
J Acoust Soc Am. 2019 Mar;145(3):1378. doi: 10.1121/1.5093547.
3
Vibro-Tactile Enhancement of Speech Intelligibility in Multi-talker Noise for Simulated Cochlear Implant Listening.多说话人噪声环境下模拟人工耳蜗助听中语音可懂度的振动触觉增强。
Trends Hear. 2018 Jan-Dec;22:2331216518797838. doi: 10.1177/2331216518797838.
4
Improving competing voices segregation for hearing impaired listeners using a low-latency deep neural network algorithm.利用低延迟深度神经网络算法提高听力障碍者的竞争声音分离。
J Acoust Soc Am. 2018 Jul;144(1):172. doi: 10.1121/1.5045322.
5
Speech Understanding With Various Maskers in Cochlear-Implant and Simulated Cochlear-Implant Hearing: Effects of Spectral Resolution and Implications for Masking Release.不同掩蔽噪声下人工耳蜗及模拟人工耳蜗助听后的言语理解:频谱分辨率的影响及掩蔽释放的意义
Trends Hear. 2018 Jan-Dec;22:2331216518787276. doi: 10.1177/2331216518787276.
6
The benefit of combining a deep neural network architecture with ideal ratio mask estimation in computational speech segregation to improve speech intelligibility.将深度神经网络架构与理想比率掩蔽估计相结合,以提高语音可懂度,从而在计算语音分离中获益。
PLoS One. 2018 May 15;13(5):e0196924. doi: 10.1371/journal.pone.0196924. eCollection 2018.
7
Use of a Deep Recurrent Neural Network to Reduce Wind Noise: Effects on Judged Speech Intelligibility and Sound Quality.使用深度递归神经网络降低风噪声:对感知语音清晰度和声音质量的影响。
Trends Hear. 2018 Jan-Dec;22:2331216518770964. doi: 10.1177/2331216518770964.
8
Deep Learning-Based Noise Reduction Approach to Improve Speech Intelligibility for Cochlear Implant Recipients.基于深度学习的降噪方法以提高人工耳蜗植入者的言语可懂度
Ear Hear. 2018 Jul/Aug;39(4):795-809. doi: 10.1097/AUD.0000000000000537.
9
Cochlear implant simulator with independent representation of the full spiral ganglion.具有完整螺旋神经节独立表示的人工耳蜗模拟器。
J Acoust Soc Am. 2017 Nov;142(5):EL484. doi: 10.1121/1.5009602.
10
Tolerable delay for speech production and perception: effects of hearing ability and experience with hearing aids.言语产生与感知的可耐受延迟:听力能力及使用助听器经验的影响
Int J Audiol. 2018 Jan;57(1):61-68. doi: 10.1080/14992027.2017.1367848. Epub 2017 Aug 24.