• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CompNet:用于单通道语音增强的互补网络。

CompNet: Complementary network for single-channel speech enhancement.

作者信息

Fan Cunhang, Zhang Hongmei, Li Andong, Xiang Wang, Zheng Chengshi, Lv Zhao, Wu Xiaopei

机构信息

Anhui Province Key Laboratory of Multimodal Cognitive Computation, School of Computer Science and Technology, Anhui University, Hefei 230601, China.

Key Laboratory of Noise and Vibration Research, Institute of Acoustics, Chinese Academy of Sciences, 100190, Beijing, China.

出版信息

Neural Netw. 2023 Nov;168:508-517. doi: 10.1016/j.neunet.2023.09.041. Epub 2023 Sep 25.

DOI:10.1016/j.neunet.2023.09.041
PMID:37832318
Abstract

Recent multi-domain processing methods have demonstrated promising performance for monaural speech enhancement tasks. However, few of them explain why they behave better over single-domain approaches. As an attempt to fill this gap, this paper presents a complementary single-channel speech enhancement network (CompNet) that demonstrates promising denoising capabilities and provides a unique perspective to understand the improvements introduced by multi-domain processing. Specifically, the noisy speech is initially enhanced through a time-domain network. However, despite the waveform can be feasibly recovered, the distribution of the time-frequency bins may still be partly different from the target spectrum when we reconsider the problem in the frequency domain. To solve this problem, we design a dedicated dual-path network as a post-processing module to independently filter the magnitude and refine the phase. This further drives the estimated spectrum to closely approximate the target spectrum in the time-frequency domain. We conduct extensive experiments with the WSJ0-SI84 and VoiceBank + Demand datasets. Objective test results show that the performance of the proposed system is highly competitive with existing systems.

摘要

最近的多域处理方法在单声道语音增强任务中表现出了良好的性能。然而,其中很少有方法能解释它们为何比单域方法表现得更好。作为填补这一空白的尝试,本文提出了一种互补单通道语音增强网络(CompNet),该网络展示了良好的去噪能力,并为理解多域处理带来的改进提供了独特的视角。具体而言,有噪语音首先通过一个时域网络进行增强。然而,尽管波形可以得到合理恢复,但当我们在频域重新考虑这个问题时,时频 bins 的分布可能仍然与目标频谱部分不同。为了解决这个问题,我们设计了一个专用的双路径网络作为后处理模块,以独立地对幅度进行滤波并细化相位。这进一步促使估计频谱在时频域中更接近目标频谱。我们使用 WSJ0 - SI84 和 VoiceBank + Demand 数据集进行了广泛的实验。客观测试结果表明,所提出系统的性能与现有系统相比具有很强的竞争力。

相似文献

1
CompNet: Complementary network for single-channel speech enhancement.CompNet:用于单通道语音增强的互补网络。
Neural Netw. 2023 Nov;168:508-517. doi: 10.1016/j.neunet.2023.09.041. Epub 2023 Sep 25.
2
CNN-based noise reduction for multi-channel speech enhancement system with discrete wavelet transform (DWT) preprocessing.基于卷积神经网络(CNN)的多通道语音增强系统的降噪方法,采用离散小波变换(DWT)预处理。
PeerJ Comput Sci. 2024 Feb 28;10:e1901. doi: 10.7717/peerj-cs.1901. eCollection 2024.
3
Low-latency monaural speech enhancement with deep filter-bank equalizer.基于深度滤波器组均衡器的低延迟单声道语音增强。
J Acoust Soc Am. 2022 May;151(5):3291. doi: 10.1121/10.0011396.
4
Wearable Hearing Device Spectral Enhancement Driven by Non-Negative Sparse Coding-Based Residual Noise Reduction.基于非负稀疏编码的残余噪声降低驱动的可穿戴听力设备频谱增强
Sensors (Basel). 2020 Oct 10;20(20):5751. doi: 10.3390/s20205751.
5
Single-channel noise reduction using optimal rectangular filtering matrices.使用最优矩形滤波矩阵进行单通道降噪。
J Acoust Soc Am. 2013 Feb;133(2):1090-101. doi: 10.1121/1.4773269.
6
Neural Cascade Architecture with Triple-domain Loss for Speech Enhancement.用于语音增强的具有三域损失的神经级联架构
IEEE/ACM Trans Audio Speech Lang Process. 2022;30:734-743. doi: 10.1109/taslp.2021.3138716. Epub 2021 Dec 28.
7
Adaptive Weiner filtering with AR-GWO based optimized fuzzy wavelet neural network for enhanced speech enhancement.基于自回归-灰狼优化算法的自适应维纳滤波优化模糊小波神经网络用于增强语音增强
Multimed Tools Appl. 2022 Dec 9:1-25. doi: 10.1007/s11042-022-14180-5.
8
Wavelet speech enhancement algorithm using exponential semi-soft mask filtering.基于指数半软掩蔽滤波的小波语音增强算法。
Bioengineered. 2016 Sep 2;7(5):352-356. doi: 10.1080/21655979.2016.1197617. Epub 2016 Jul 19.
9
Improved Transformer-Based Dual-Path Network with Amplitude and Complex Domain Feature Fusion for Speech Enhancement.基于改进Transformer的双路径网络,融合幅度和复域特征用于语音增强
Entropy (Basel). 2023 Jan 26;25(2):228. doi: 10.3390/e25020228.
10
Low-dimensional recurrent neural network-based Kalman filter for speech enhancement.基于低维递归神经网络的语音增强卡尔曼滤波器。
Neural Netw. 2015 Jul;67:131-9. doi: 10.1016/j.neunet.2015.03.008. Epub 2015 Apr 7.