• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于典范相关分析和深度学习的新型语音可懂度增强模型。

A Novel Speech Intelligibility Enhancement Model based on Canonical Correlation and Deep Learning.

出版信息

Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul;2022:2581-2584. doi: 10.1109/EMBC48229.2022.9871113.

DOI:10.1109/EMBC48229.2022.9871113
PMID:36085897
Abstract

Current deep learning (DL) based approaches to speech intelligibility enhancement in noisy environments are often trained to minimise the feature distance between noise-free speech and enhanced speech signals. Despite improving the speech quality, such approaches do not deliver required levels of speech intelligibility in everyday noisy environments. Intelligibility-oriented (I-O) loss functions have recently been developed to train DL approaches for robust speech enhancement. Here, we formulate, for the first time, a novel canonical correlation based I-O loss function to more effectively train DL algorithms. Specifically, we present a canonical-correlation based short-time objective intelligibility (CC-STOI) cost function to train a fully convolutional neural network (FCN) model. We carry out comparative simulation experiments to show that our CC-STOI based speech enhancement framework outperforms state-of-the-art DL models trained with conventional distance-based and STOI-based loss functions, using objective and subjective evaluation measures for case of both unseen speakers and noises. Ongoing future work is evaluating the proposed approach for design of robust hearing-assistive technology.

摘要

目前,基于深度学习(DL)的在噪声环境下增强语音可懂度的方法通常经过训练,可以将无噪声语音和增强后的语音信号之间的特征距离最小化。尽管这些方法提高了语音质量,但在日常嘈杂环境中,它们并不能提供所需的语音可懂度水平。最近,人们开发了面向可懂度的(I-O)损失函数,以训练用于鲁棒语音增强的 DL 方法。在这里,我们首次提出了一种新的基于典型相关的 I-O 损失函数,以更有效地训练 DL 算法。具体来说,我们提出了一种基于典型相关的短时客观可懂度(CC-STOI)代价函数,用于训练全卷积神经网络(FCN)模型。我们进行了比较模拟实验,结果表明,我们的基于 CC-STOI 的语音增强框架在使用客观和主观评估措施的情况下,在看不见的说话者和噪声的情况下,都优于使用传统基于距离和 STOI 的损失函数训练的最先进的 DL 模型。正在进行的未来工作是评估该方法在设计鲁棒性助听技术中的应用。

相似文献

1
A Novel Speech Intelligibility Enhancement Model based on Canonical Correlation and Deep Learning.基于典范相关分析和深度学习的新型语音可懂度增强模型。
Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul;2022:2581-2584. doi: 10.1109/EMBC48229.2022.9871113.
2
Using deep learning to improve the intelligibility of a target speaker in noisy multi-talker environments for people with normal hearing and hearing loss.利用深度学习提高正常听力和听力损失人群在嘈杂多说话人环境中目标说话人的可懂度。
J Acoust Soc Am. 2024 Jul 1;156(1):706-724. doi: 10.1121/10.0028007.
3
Improving the Intelligibility of Speech for Simulated Electric and Acoustic Stimulation Using Fully Convolutional Neural Networks.利用全卷积神经网络提高电刺激和声刺激模拟语音的可懂度。
IEEE Trans Neural Syst Rehabil Eng. 2021;29:184-195. doi: 10.1109/TNSRE.2020.3042655. Epub 2021 Feb 26.
4
Experimental Investigation of Acoustic Features to Optimize Intelligibility in Cochlear Implants.实验研究优化人工耳蜗植入中可懂度的声学特征。
Sensors (Basel). 2023 Aug 31;23(17):7553. doi: 10.3390/s23177553.
5
An effectively causal deep learning algorithm to increase intelligibility in untrained noises for hearing-impaired listeners.一种有效的因果深度学习算法,用于提高听力受损听众在未经训练的噪声中的可理解度。
J Acoust Soc Am. 2021 Jun;149(6):3943. doi: 10.1121/10.0005089.
6
Joint Dictionary Learning-Based Non-Negative Matrix Factorization for Voice Conversion to Improve Speech Intelligibility After Oral Surgery.基于联合字典学习的非负矩阵分解用于口腔手术后语音转换以提高语音清晰度
IEEE Trans Biomed Eng. 2017 Nov;64(11):2584-2594. doi: 10.1109/TBME.2016.2644258.
7
Large-scale training to increase speech intelligibility for hearing-impaired listeners in novel noises.大规模训练以提高听力受损者在新型噪声环境下的言语可懂度。
J Acoust Soc Am. 2016 May;139(5):2604. doi: 10.1121/1.4948445.
8
Restoring speech intelligibility for hearing aid users with deep learning.基于深度学习的助听用户语音可懂度恢复。
Sci Rep. 2023 Feb 15;13(1):2719. doi: 10.1038/s41598-023-29871-8.
9
Deep causal speech enhancement and recognition using efficient long-short term memory Recurrent Neural Network.利用高效长短时记忆递归神经网络进行深度因果语音增强和识别。
PLoS One. 2024 Jan 3;19(1):e0291240. doi: 10.1371/journal.pone.0291240. eCollection 2024.
10
Improving the performance of hearing aids in noisy environments based on deep learning technology.基于深度学习技术提高助听器在嘈杂环境中的性能。
Annu Int Conf IEEE Eng Med Biol Soc. 2018 Jul;2018:404-408. doi: 10.1109/EMBC.2018.8512277.