• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种理想的压缩式面罩,用于在不牺牲环境声音识别能力的情况下提高语音清晰度a)。

An ideal compressed mask for increasing speech intelligibility without sacrificing environmental sound recognitiona).

作者信息

Johnson Eric M, Healy Eric W

机构信息

Department of Speech and Hearing Science, and Center for Cognitive and Brain Sciences, The Ohio State University, Columbus, Ohio 43210, USA.

出版信息

J Acoust Soc Am. 2024 Dec 1;156(6):3958-3969. doi: 10.1121/10.0034599.

DOI:10.1121/10.0034599
PMID:39666959
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11646135/
Abstract

Hearing impairment is often characterized by poor speech-in-noise recognition. State-of-the-art laboratory-based noise-reduction technology can eliminate background sounds from a corrupted speech signal and improve intelligibility, but it can also hinder environmental sound recognition (ESR), which is essential for personal independence and safety. This paper presents a time-frequency mask, the ideal compressed mask (ICM), that aims to provide listeners with improved speech intelligibility without substantially reducing ESR. This is accomplished by limiting the maximum attenuation that the mask performs. Speech intelligibility and ESR for hearing-impaired and normal-hearing listeners were measured using stimuli that had been processed by ICMs with various levels of maximum attenuation. This processing resulted in significantly improved intelligibility while retaining high ESR performance for both types of listeners. It was also found that the same level of maximum attenuation provided the optimal balance of intelligibility and ESR for both listener types. It is argued that future deep-learning-based noise reduction algorithms may provide better outcomes by balancing the levels of the target speech and the background environmental sounds, rather than eliminating all signals except for the target speech. The ICM provides one such simple solution for frequency-domain models.

摘要

听力障碍通常表现为语音噪声识别能力差。基于实验室的先进降噪技术可以从受损语音信号中消除背景声音并提高可懂度,但它也可能阻碍环境声音识别(ESR),而环境声音识别对于个人独立和安全至关重要。本文提出了一种时频掩膜,即理想压缩掩膜(ICM),其目的是在不大幅降低ESR的情况下提高听众的语音可懂度。这是通过限制掩膜执行的最大衰减来实现的。使用经过具有不同最大衰减水平的ICM处理的刺激来测量听力受损和听力正常听众的语音可懂度和ESR。这种处理显著提高了可懂度,同时两种类型的听众都保持了较高的ESR性能。还发现相同水平的最大衰减为两种听众类型提供了可懂度和ESR的最佳平衡。有人认为,未来基于深度学习的降噪算法可能通过平衡目标语音和背景环境声音的水平,而不是消除除目标语音之外的所有信号,来提供更好的结果。ICM为频域模型提供了这样一种简单的解决方案。

相似文献

1
An ideal compressed mask for increasing speech intelligibility without sacrificing environmental sound recognitiona).一种理想的压缩式面罩,用于在不牺牲环境声音识别能力的情况下提高语音清晰度a)。
J Acoust Soc Am. 2024 Dec 1;156(6):3958-3969. doi: 10.1121/10.0034599.
2
Testing the role of temporal coherence on speech intelligibility with noise and single-talker maskers.测试时间相干性在噪声和单说话人掩蔽下语音可懂度中的作用。
J Acoust Soc Am. 2024 Nov 1;156(5):3285-3297. doi: 10.1121/10.0034420.
3
Effects of Masker Intelligibility and Talker Sex on Speech-in-Speech Recognition by Mandarin Speakers Across the Lifespan.掩蔽音清晰度和说话者性别对各年龄段普通话使用者的语音中语音识别的影响。
Ear Hear. 2025;46(4):1085-1094. doi: 10.1097/AUD.0000000000001655. Epub 2025 Mar 18.
4
Hearing Instruments for Unilateral Severe-to-Profound Sensorineural Hearing Loss in Adults: A Systematic Review and Meta-Analysis.成人单侧重度至极重度感音神经性听力损失的听力仪器:系统评价与荟萃分析
Ear Hear. 2016 Sep-Oct;37(5):495-507. doi: 10.1097/AUD.0000000000000313.
5
High-arousal emotional speech enhances speech intelligibility and emotion recognition in noise.高唤醒度情感语音可提高噪声环境下的语音清晰度和情感识别能力。
J Acoust Soc Am. 2025 Jun 1;157(6):4085-4096. doi: 10.1121/10.0036812.
6
Bilateral versus unilateral hearing aids for bilateral hearing impairment in adults.成人双侧听力障碍使用双侧助听器与单侧助听器的比较。
Cochrane Database Syst Rev. 2017 Dec 19;12(12):CD012665. doi: 10.1002/14651858.CD012665.pub2.
7
The Roles of Selective Attention and Asymmetric Experience in Bilateral Speech Interference for Single-Sided Deafness Cochlear Implant and Vocoder Listeners.选择性注意和不对称经验在单侧耳聋人工耳蜗和语音编码听众双侧言语干扰中的作用。
Ear Hear. 2025 Jun 19. doi: 10.1097/AUD.0000000000001687.
8
Interventions to prevent occupational noise-induced hearing loss.预防职业性噪声性听力损失的干预措施。
Cochrane Database Syst Rev. 2017 Jul 7;7(7):CD006396. doi: 10.1002/14651858.CD006396.pub4.
9
Antidepressants for pain management in adults with chronic pain: a network meta-analysis.抗抑郁药治疗成人慢性疼痛的疼痛管理:一项网络荟萃分析。
Health Technol Assess. 2024 Oct;28(62):1-155. doi: 10.3310/MKRT2948.
10
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

本文引用的文献

1
The Optimal Speech-to-Background Ratio for Balancing Speech Recognition With Environmental Sound Recognition.在平衡语音识别和环境声音识别时的最佳语音与背景噪声比。
Ear Hear. 2024;45(6):1444-1460. doi: 10.1097/AUD.0000000000001532. Epub 2024 May 31.
2
Progress made in the efficacy and viability of deep-learning-based noise reduction.基于深度学习的降噪功效和可行性的进展。
J Acoust Soc Am. 2023 May 1;153(5):2751. doi: 10.1121/10.0019341.
3
The Application of Time-Frequency Masking To Improve Intelligibility of Dysarthric Speech in Background Noise.时频掩蔽在背景噪声下改善构音障碍语音可懂度的应用。
J Speech Lang Hear Res. 2023 May 9;66(5):1853-1866. doi: 10.1044/2023_JSLHR-22-00558. Epub 2023 Mar 21.
4
Restoring speech intelligibility for hearing aid users with deep learning.基于深度学习的助听用户语音可懂度恢复。
Sci Rep. 2023 Feb 15;13(1):2719. doi: 10.1038/s41598-023-29871-8.
5
A talker-independent deep learning algorithm to increase intelligibility for hearing-impaired listeners in reverberant competing talker conditions.一种独立于说话者的深度学习算法,用于在混响的竞争性说话者环境中提高听力受损听众的可懂度。
J Acoust Soc Am. 2020 Jun;147(6):4106. doi: 10.1121/10.0001441.
6
The optimal threshold for removing noise from speech is similar across normal and impaired hearing-a time-frequency masking study.从语音中去除噪声的最佳阈值在正常和听力障碍人群中相似——时频掩蔽研究。
J Acoust Soc Am. 2019 Jun;145(6):EL581. doi: 10.1121/1.5112828.
7
A deep learning algorithm to increase intelligibility for hearing-impaired listeners in the presence of a competing talker and reverberation.一种深度学习算法,用于在存在竞争说话者和混响的情况下提高听力障碍者的可理解度。
J Acoust Soc Am. 2019 Mar;145(3):1378. doi: 10.1121/1.5093547.
8
An ideal quantized mask to increase intelligibility and quality of speech in noise.一种理想的量化掩蔽,可提高噪声中的语音可懂度和质量。
J Acoust Soc Am. 2018 Sep;144(3):1392. doi: 10.1121/1.5053115.
9
A deep learning based segregation algorithm to increase speech intelligibility for hearing-impaired listeners in reverberant-noisy conditions.基于深度学习的分割算法,可提高在混响噪声环境下听力障碍者的语音可懂度。
J Acoust Soc Am. 2018 Sep;144(3):1627. doi: 10.1121/1.5055562.
10
Hearing Difficulty Is Associated With Injuries Requiring Medical Care.听力困难与需要医疗护理的损伤有关。
Ear Hear. 2018 Jul/Aug;39(4):631-644. doi: 10.1097/AUD.0000000000000535.