• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

采用波束形成与掩蔽信息语音增强相结合的助听器处理方案进行语音识别。

Speech recognition with a hearing-aid processing scheme combining beamforming with mask-informed speech enhancement.

机构信息

Department of Speech, Hearing and Phonetic Sciences, 4919UCL, London, UK.

Department of Electrical and Electronic Engineering, 4615Imperial College, London, UK.

出版信息

Trends Hear. 2022 Jan-Dec;26:23312165211068629. doi: 10.1177/23312165211068629.

DOI:10.1177/23312165211068629
PMID:34985356
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8744079/
Abstract

A signal processing approach combining beamforming with mask-informed speech enhancement was assessed by measuring sentence recognition in listeners with mild-to-moderate hearing impairment in adverse listening conditions that simulated the output of behind-the-ear hearing aids in a noisy classroom. Two types of beamforming were compared: binaural, with the two microphones of each aid treated as a single array, and bilateral, where independent left and right beamformers were derived. Binaural beamforming produces a narrower beam, maximising improvement in signal-to-noise ratio (SNR), but eliminates the spatial diversity that is preserved in bilateral beamforming. Each beamformer type was optimised for the true target position and implemented with and without additional speech enhancement in which spectral features extracted from the beamformer output were passed to a deep neural network trained to identify time-frequency regions dominated by target speech. Additional conditions comprising binaural beamforming combined with speech enhancement implemented using Wiener filtering or modulation-domain Kalman filtering were tested in normally-hearing (NH) listeners. Both beamformer types gave substantial improvements relative to no processing, with significantly greater benefit for binaural beamforming. Performance with additional mask-informed enhancement was poorer than with beamforming alone, for both beamformer types and both listener groups. In NH listeners the addition of mask-informed enhancement produced significantly poorer performance than both other forms of enhancement, neither of which differed from the beamformer alone. In summary, the additional improvement in SNR provided by binaural beamforming appeared to outweigh loss of spatial information, while speech understanding was not further improved by the mask-informed enhancement method implemented here.

摘要

一种结合波束形成和掩蔽语音增强的信号处理方法,通过测量在模拟嘈杂教室中使用耳后助听器输出的不利聆听条件下,轻度至中度听力障碍的聆听者的句子识别能力来评估。比较了两种类型的波束形成:双耳,每个助听器的两个麦克风作为单个阵列处理;双侧,其中独立的左、右波束形成器是衍生的。双耳波束形成产生更窄的波束,最大限度地提高信噪比 (SNR) 的改善,但消除了在双侧波束形成中保留的空间多样性。每个波束形成器类型都针对真实目标位置进行了优化,并在有和没有额外语音增强的情况下进行了实施,其中从波束形成器输出中提取的频谱特征被传递到经过训练以识别由目标语音主导的时频区域的深度神经网络。在正常听力 (NH) 聆听者中测试了包含双边波束形成和使用维纳滤波或调制域卡尔曼滤波实现的语音增强的其他条件。与无处理相比,两种波束形成器类型都有很大的改进,双侧波束形成器的受益更大。与单独使用波束形成器相比,使用额外掩蔽语音增强的性能更差,两种听力组的情况均如此。在 NH 聆听者中,添加掩蔽语音增强会导致比其他两种增强形式都更差的性能,而这两种增强形式都与单独使用波束形成器没有区别。总之,双侧波束形成提供的 SNR 额外提高似乎超过了空间信息的损失,而这里实施的掩蔽语音增强方法并没有进一步提高语音理解能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4500/8744079/8a9b99e890f4/10.1177_23312165211068629-fig10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4500/8744079/7257fc5cf68f/10.1177_23312165211068629-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4500/8744079/e951610e4309/10.1177_23312165211068629-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4500/8744079/5cc098201631/10.1177_23312165211068629-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4500/8744079/d19f004d8777/10.1177_23312165211068629-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4500/8744079/f6a975d969d1/10.1177_23312165211068629-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4500/8744079/63a9c0365fa0/10.1177_23312165211068629-fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4500/8744079/996acee6f4fc/10.1177_23312165211068629-fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4500/8744079/a58f28ef7986/10.1177_23312165211068629-fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4500/8744079/5fe22a6cd883/10.1177_23312165211068629-fig9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4500/8744079/8a9b99e890f4/10.1177_23312165211068629-fig10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4500/8744079/7257fc5cf68f/10.1177_23312165211068629-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4500/8744079/e951610e4309/10.1177_23312165211068629-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4500/8744079/5cc098201631/10.1177_23312165211068629-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4500/8744079/d19f004d8777/10.1177_23312165211068629-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4500/8744079/f6a975d969d1/10.1177_23312165211068629-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4500/8744079/63a9c0365fa0/10.1177_23312165211068629-fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4500/8744079/996acee6f4fc/10.1177_23312165211068629-fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4500/8744079/a58f28ef7986/10.1177_23312165211068629-fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4500/8744079/5fe22a6cd883/10.1177_23312165211068629-fig9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4500/8744079/8a9b99e890f4/10.1177_23312165211068629-fig10.jpg

相似文献

1
Speech recognition with a hearing-aid processing scheme combining beamforming with mask-informed speech enhancement.采用波束形成与掩蔽信息语音增强相结合的助听器处理方案进行语音识别。
Trends Hear. 2022 Jan-Dec;26:23312165211068629. doi: 10.1177/23312165211068629.
2
An Evaluation of Hearing Aid Beamforming Microphone Arrays in a Noisy Laboratory Setting.在嘈杂实验室环境中对助听器波束形成麦克风阵列的评估。
J Am Acad Audiol. 2019 Feb;30(2):131-144. doi: 10.3766/jaaa.17090. Epub 2018 Jan 2.
3
Speech reception with different bilateral directional processing schemes: Influence of binaural hearing, audiometric asymmetry, and acoustic scenario.采用不同双边定向处理方案时的言语接收:双耳听力、听力不对称及声学场景的影响
Hear Res. 2017 Sep;353:36-48. doi: 10.1016/j.heares.2017.07.014. Epub 2017 Jul 29.
4
A Diagonal-Steering-Based Binaural Beamforming Algorithm Incorporating a Diagonal Speech Localizer for Persons With Bilateral Hearing Impairment.一种基于对角线转向的双耳波束形成算法,该算法结合了用于双侧听力障碍者的对角线语音定位器。
Artif Organs. 2015 Dec;39(12):1061-8. doi: 10.1111/aor.12488. Epub 2015 May 8.
5
Binaural pre-processing for contralateral sound field attenuation can improve speech-in-noise intelligibility for bilateral hearing-aid users.用于对侧声场衰减的双耳预处理可以提高双侧助听器使用者在噪声环境下的言语清晰度。
Hear Res. 2023 May;432:108743. doi: 10.1016/j.heares.2023.108743. Epub 2023 Mar 25.
6
Benefits of Beamforming With Local Spatial-Cue Preservation for Speech Localization and Segregation.具有局部空间线索保留的波束形成在语音定位和分离中的优势。
Trends Hear. 2020 Jan-Dec;24:2331216519896908. doi: 10.1177/2331216519896908.
7
Head shadow enhancement with low-frequency beamforming improves sound localization and speech perception for simulated bimodal listeners.低频波束形成增强头部阴影可改善模拟双峰听力者的声音定位和言语感知。
Hear Res. 2018 Jun;363:78-84. doi: 10.1016/j.heares.2018.03.007. Epub 2018 Mar 12.
8
The effect of hearing aid technologies on listening in an automobile.助听器技术对在汽车中聆听的影响。
J Am Acad Audiol. 2013 Jun;24(6):474-85. doi: 10.3766/jaaa.24.6.4.
9
Speech signal enhancement in cocktail party scenarios by deep learning based virtual sensing of head-mounted microphones.基于深度学习的头戴式麦克风虚拟传感在鸡尾酒会场景下的语音信号增强。
Hear Res. 2021 Sep 1;408:108294. doi: 10.1016/j.heares.2021.108294. Epub 2021 Jun 17.
10
Examination of a hybrid beamformer that preserves auditory spatial cues.对一种保留听觉空间线索的混合波束形成器的检查。
J Acoust Soc Am. 2017 Oct;142(4):EL369. doi: 10.1121/1.5007279.

引用本文的文献

1
Improving auditory attention decoding by classifying intracranial responses to glimpsed and masked acoustic events.通过对瞥见和掩蔽声学事件的颅内反应进行分类来改善听觉注意力解码。
Imaging Neurosci (Camb). 2024;2. doi: 10.1162/imag_a_00148. Epub 2024 Apr 26.
2
Acoustic Source Tracking Based on Probabilistic Data Association and Distributed Cubature Kalman Filtering in Acoustic Sensor Networks.基于概率数据关联和分布式容积卡尔曼滤波的声学传感器网络声源跟踪
Sensors (Basel). 2022 Sep 21;22(19):7160. doi: 10.3390/s22197160.

本文引用的文献

1
Creating Clarity in Noisy Environments by Using Deep Learning in Hearing Aids.通过在助听器中使用深度学习在嘈杂环境中实现清晰聆听。
Semin Hear. 2021 Aug;42(3):260-281. doi: 10.1055/s-0041-1735134. Epub 2021 Sep 24.
2
Perceptual Evaluation of Binaural MVDR-Based Algorithms to Preserve the Interaural Coherence of Diffuse Noise Fields.双耳 MVDR 基算法对扩散噪声场的立体相干性保持的感知评估。
Trends Hear. 2020 Jan-Dec;24:2331216520919573. doi: 10.1177/2331216520919573.
3
Benefits of Beamforming With Local Spatial-Cue Preservation for Speech Localization and Segregation.
具有局部空间线索保留的波束形成在语音定位和分离中的优势。
Trends Hear. 2020 Jan-Dec;24:2331216519896908. doi: 10.1177/2331216519896908.
4
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation.卷积时域音频分离网络(Conv-TasNet):超越理想时频幅度掩蔽的语音分离方法
IEEE/ACM Trans Audio Speech Lang Process. 2019 Aug;27(8):1256-1266. doi: 10.1109/TASLP.2019.2915167. Epub 2019 May 6.
5
Personalized signal-independent beamforming for binaural hearing aids.个性化信号无关波束形成技术在双耳助听设备中的应用。
J Acoust Soc Am. 2019 May;145(5):2971. doi: 10.1121/1.5102173.
6
Efficacy and Effectiveness of Advanced Hearing Aid Directional and Noise Reduction Technologies for Older Adults With Mild to Moderate Hearing Loss.老年人轻度至中度听力损失的高级助听器指向性和降噪技术的疗效和有效性。
Ear Hear. 2019 Jul/Aug;40(4):805-822. doi: 10.1097/AUD.0000000000000672.
7
Examination of a hybrid beamformer that preserves auditory spatial cues.对一种保留听觉空间线索的混合波束形成器的检查。
J Acoust Soc Am. 2017 Oct;142(4):EL369. doi: 10.1121/1.5007279.
8
Speech reception with different bilateral directional processing schemes: Influence of binaural hearing, audiometric asymmetry, and acoustic scenario.采用不同双边定向处理方案时的言语接收:双耳听力、听力不对称及声学场景的影响
Hear Res. 2017 Sep;353:36-48. doi: 10.1016/j.heares.2017.07.014. Epub 2017 Jul 29.
9
Comparing Binaural Pre-processing Strategies III: Speech Intelligibility of Normal-Hearing and Hearing-Impaired Listeners.双耳预处理策略比较III:正常听力和听力受损听众的言语可懂度
Trends Hear. 2015 Dec 30;19:2331216515618609. doi: 10.1177/2331216515618609.
10
Benefits of Acoustic Beamforming for Solving the Cocktail Party Problem.声束形成在解决鸡尾酒会问题中的优势。
Trends Hear. 2015 Jun 30;19:2331216515593385. doi: 10.1177/2331216515593385.