IEEE Trans Biomed Eng. 2024 Jul;71(7):2232-2242. doi: 10.1109/TBME.2024.3367530. Epub 2024 Jun 19.
Cochlear implants (CIs) provide a solution for individuals with severe sensorineural hearing loss to regain their hearing abilities. When someone experiences this form of hearing impairment in both ears, they may be equipped with two separate CI devices, which will typically further improve the CI benefits. This spatial hearing is particularly crucial when tackling the challenge of understanding speech in noisy environments, a common issue CI users face. Currently, extensive research is dedicated to developing algorithms that can autonomously filter out undesired background noises from desired speech signals. At present, some research focuses on achieving end-to-end denoising, either as an integral component of the initial CI signal processing or by fully integrating the denoising process into the CI sound coding strategy. This work is presented in the context of bilateral CI (BiCI) systems, where we propose a deep-learning-based bilateral speech enhancement model that shares information between both hearing sides. Specifically, we connect two monaural end-to-end deep denoising sound coding techniques through intermediary latent fusion layers. These layers amalgamate the latent representations generated by these techniques by multiplying them together, resulting in an enhanced ability to reduce noise and improve learning generalization. The objective instrumental results demonstrate that the proposed fused BiCI sound coding strategy achieves higher interaural coherence, superior noise reduction, and enhanced predicted speech intelligibility scores compared to the baseline methods. Furthermore, our speech-in-noise intelligibility results in BiCI users reveal that the deep denoising sound coding strategy can attain scores similar to those achieved in quiet conditions.
人工耳蜗(Cochlear implants,CIs)为重度感音神经性听力损失患者提供了恢复听力的解决方案。当一个人双耳都患有这种听力障碍时,他们可能会配备两个独立的 CI 设备,这通常会进一步提高 CI 的效果。这种空间听觉在处理嘈杂环境中的言语理解挑战时尤为重要,这是 CI 用户面临的常见问题。目前,大量研究致力于开发能够自动从所需语音信号中滤除不需要的背景噪声的算法。目前,一些研究侧重于实现端到端降噪,要么作为初始 CI 信号处理的一个组成部分,要么通过将降噪过程完全集成到 CI 声音编码策略中。这项工作是在双侧 CI(Bilateral CI,BiCI)系统的背景下进行的,我们提出了一种基于深度学习的双侧语音增强模型,该模型可以在两个听力侧之间共享信息。具体来说,我们通过中间的潜在融合层将两个单声道端到端深度去噪声音编码技术连接起来。这些层通过将它们相乘来合并这两种技术生成的潜在表示,从而提高了降低噪声和提高学习泛化的能力。客观仪器结果表明,与基线方法相比,所提出的融合 BiCI 声音编码策略实现了更高的双侧相干性、更好的降噪效果和增强的预测言语可懂度得分。此外,我们在 BiCI 用户中的语音噪声可懂度结果表明,深度去噪声音编码策略可以达到与安静条件下相似的得分。