基于深度学习的降噪方法以提高人工耳蜗植入者的言语可懂度

Deep Learning-Based Noise Reduction Approach to Improve Speech Intelligibility for Cochlear Implant Recipients.

作者信息

Lai Ying-Hui, Tsao Yu, Lu Xugang, Chen Fei, Su Yu-Ting, Chen Kuang-Chao, Chen Yu-Hsuan, Chen Li-Ching, Po-Hung Li Lieber, Lee Chin-Hui

机构信息

Department of Biomedical Engineering, National Yang-Ming University, Taipei, Taiwan.

Research Center for Information Technology Innovation, Academia Sinica, Taipei, Taiwan.

出版信息

Ear Hear. 2018 Jul/Aug;39(4):795-809. doi: 10.1097/AUD.0000000000000537.

DOI:10.1097/AUD.0000000000000537

PMID:29360687

Abstract

OBJECTIVE

We investigate the clinical effectiveness of a novel deep learning-based noise reduction (NR) approach under noisy conditions with challenging noise types at low signal to noise ratio (SNR) levels for Mandarin-speaking cochlear implant (CI) recipients.

DESIGN

The deep learning-based NR approach used in this study consists of two modules: noise classifier (NC) and deep denoising autoencoder (DDAE), thus termed (NC + DDAE). In a series of comprehensive experiments, we conduct qualitative and quantitative analyses on the NC module and the overall NC + DDAE approach. Moreover, we evaluate the speech recognition performance of the NC + DDAE NR and classical single-microphone NR approaches for Mandarin-speaking CI recipients under different noisy conditions. The testing set contains Mandarin sentences corrupted by two types of maskers, two-talker babble noise, and a construction jackhammer noise, at 0 and 5 dB SNR levels. Two conventional NR techniques and the proposed deep learning-based approach are used to process the noisy utterances. We qualitatively compare the NR approaches by the amplitude envelope and spectrogram plots of the processed utterances. Quantitative objective measures include (1) normalized covariance measure to test the intelligibility of the utterances processed by each of the NR approaches; and (2) speech recognition tests conducted by nine Mandarin-speaking CI recipients. These nine CI recipients use their own clinical speech processors during testing.

RESULTS

The experimental results of objective evaluation and listening test indicate that under challenging listening conditions, the proposed NC + DDAE NR approach yields higher intelligibility scores than the two compared classical NR techniques, under both matched and mismatched training-testing conditions.

CONCLUSIONS

When compared to the two well-known conventional NR techniques under challenging listening condition, the proposed NC + DDAE NR approach has superior noise suppression capabilities and gives less distortion for the key speech envelope information, thus, improving speech recognition more effectively for Mandarin CI recipients. The results suggest that the proposed deep learning-based NR approach can potentially be integrated into existing CI signal processors to overcome the degradation of speech perception caused by noise.

摘要

目的

我们研究了一种基于深度学习的新型降噪（NR）方法在噪声环境下对说普通话的人工耳蜗（CI）植入者的临床效果，该噪声环境具有挑战性的噪声类型且信噪比（SNR）较低。

设计

本研究中使用的基于深度学习的NR方法由两个模块组成：噪声分类器（NC）和深度去噪自动编码器（DDAE），因此称为（NC + DDAE）。在一系列综合实验中，我们对NC模块和整体NC + DDAE方法进行了定性和定量分析。此外，我们评估了NC + DDAE NR和经典单麦克风NR方法在不同噪声条件下对说普通话的CI植入者的语音识别性能。测试集包含在0和5 dB SNR水平下被两种类型的掩蔽噪声、双说话者嘈杂噪声和建筑风镐噪声破坏的普通话句子。使用两种传统的NR技术和所提出的基于深度学习的方法来处理有噪声的话语。我们通过处理后的话语的幅度包络和频谱图定性比较NR方法。定量客观指标包括：（1）归一化协方差度量，以测试每种NR方法处理的话语的可懂度；（2）由九名说普通话的CI植入者进行的语音识别测试。这九名CI植入者在测试期间使用他们自己的临床语音处理器。

结果

客观评估和听力测试的实验结果表明，在具有挑战性的听力条件下，所提出的NC + DDAE NR方法在匹配和不匹配的训练 - 测试条件下都比两种比较的经典NR技术产生更高的可懂度分数。

结论

与在具有挑战性的听力条件下的两种知名传统NR技术相比，所提出的NC + DDAE NR方法具有卓越的噪声抑制能力，并且对关键语音包络信息的失真更小，因此，能更有效地提高说普通话的CI植入者的语音识别能力。结果表明，所提出的基于深度学习的NR方法有可能集成到现有的CI信号处理器中，以克服噪声引起的语音感知退化。

相似文献

Deep Learning-Based Noise Reduction Approach to Improve Speech Intelligibility for Cochlear Implant Recipients.

Ear Hear. 2018 Jul/Aug;39(4):795-809. doi: 10.1097/AUD.0000000000000537.

A Deep Denoising Autoencoder Approach to Improving the Intelligibility of Vocoded Speech in Cochlear Implant Simulation.

IEEE Trans Biomed Eng. 2017 Jul;64(7):1568-1578. doi: 10.1109/TBME.2016.2613960. Epub 2016 Sep 27.

Evaluation of noise reduction methods for sentence recognition by Mandarin-speaking cochlear implant listeners.

Ear Hear. 2015 Jan;36(1):61-71. doi: 10.1097/AUD.0000000000000074.

Improved Environment-Aware-Based Noise Reduction System for Cochlear Implant Users Based on a Knowledge Transfer Approach: Development and Usability Study.

J Med Internet Res. 2021 Oct 28;23(10):e25460. doi: 10.2196/25460.

Evaluation of speech reception threshold in noise in young Cochlear™ Nucleus system 6 implant recipients using two different digital remote microphone technologies and a speech enhancement sound processing algorithm.

Int J Pediatr Otorhinolaryngol. 2017 Dec;103:71-75. doi: 10.1016/j.ijporl.2017.10.002. Epub 2017 Oct 5.

Combining directional microphone and single-channel noise reduction algorithms: a clinical evaluation in difficult listening conditions with cochlear implant users.

Ear Hear. 2012 Jul-Aug;33(4):e13-23. doi: 10.1097/AUD.0b013e31824b9e21.

Conversion of adult Nucleus® 5 cochlear implant users to the Nucleus® 6 system.

Cochlear Implants Int. 2015 Jul;16(4):222-32. doi: 10.1179/1754762814Y.0000000097. Epub 2014 Oct 6.

Early prelingual auditory development and speech perception at 1-year follow-up in Mandarin-speaking children after cochlear implantation.

Int J Pediatr Otorhinolaryngol. 2011 Nov;75(11):1418-26. doi: 10.1016/j.ijporl.2011.08.005. Epub 2011 Sep 3.

Results using the OPAL strategy in Mandarin speaking cochlear implant recipients.

Int J Audiol. 2017;56(sup2):S74-S85. doi: 10.1080/14992027.2016.1190872. Epub 2016 Jun 22.

Effects of Threshold Adjustment on Speech Perception in Nucleus Cochlear Implant Recipients.

Ear Hear. 2016 May-Jun;37(3):303-11. doi: 10.1097/AUD.0000000000000248.

引用本文的文献

Perfecting Sensory Restoration and the Unmet Need for Personalized Medicine in Cochlear Implant Users: A Narrative Review.

Brain Sci. 2025 May 1;15(5):479. doi: 10.3390/brainsci15050479.

Prediction of Auditory Performance in Cochlear Implants Using Machine Learning Methods: A Systematic Review.

Audiol Res. 2025 May 8;15(3):56. doi: 10.3390/audiolres15030056.

Artificial intelligence in otorhinolaryngology: current trends and application areas.

Eur Arch Otorhinolaryngol. 2025 May;282(5):2697-2707. doi: 10.1007/s00405-025-09272-5. Epub 2025 Feb 17.

Deep learning restores speech intelligibility in multi-talker interference for cochlear implant users.

Sci Rep. 2024 Jun 9;14(1):13241. doi: 10.1038/s41598-024-63675-8.

Progress made in the efficacy and viability of deep-learning-based noise reduction.

J Acoust Soc Am. 2023 May 1;153(5):2751. doi: 10.1121/10.0019341.

Translational Applications of Machine Learning in Auditory Electrophysiology.

Semin Hear. 2022 Oct 26;43(3):240-250. doi: 10.1055/s-0042-1756166. eCollection 2022 Aug.

Deep Learning-Based Speech Enhancement With a Loss Trading Off the Speech Distortion and the Noise Residue for Cochlear Implants.

Front Med (Lausanne). 2021 Nov 8;8:740123. doi: 10.3389/fmed.2021.740123. eCollection 2021.

Improved Environment-Aware-Based Noise Reduction System for Cochlear Implant Users Based on a Knowledge Transfer Approach: Development and Usability Study.

J Med Internet Res. 2021 Oct 28;23(10):e25460. doi: 10.2196/25460.

Cochlear Implant Research and Development in the Twenty-first Century: A Critical Update.

J Assoc Res Otolaryngol. 2021 Oct;22(5):481-508. doi: 10.1007/s10162-021-00811-5. Epub 2021 Aug 25.

Recent Advances in the Application of Artificial Intelligence in Otorhinolaryngology-Head and Neck Surgery.

Clin Exp Otorhinolaryngol. 2020 Nov;13(4):326-339. doi: 10.21053/ceo.2020.00654. Epub 2020 Jun 18.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于深度学习的降噪方法以提高人工耳蜗植入者的言语可懂度

Deep Learning-Based Noise Reduction Approach to Improve Speech Intelligibility for Cochlear Implant Recipients.

作者信息

机构信息

出版信息

OBJECTIVE

DESIGN

RESULTS

CONCLUSIONS

目的

设计

结果

结论

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献