• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于语音增强的先验信噪比估计和噪声估计。

A priori SNR estimation and noise estimation for speech enhancement.

作者信息

Yao Rui, Zeng ZeQing, Zhu Ping

机构信息

College of Automation Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing, China.

出版信息

EURASIP J Adv Signal Process. 2016;2016(1):101. doi: 10.1186/s13634-016-0398-z. Epub 2016 Sep 22.

DOI:10.1186/s13634-016-0398-z
PMID:27729928
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5031741/
Abstract

A priori signal-to-noise ratio (SNR) estimation and noise estimation are important for speech enhancement. In this paper, a novel modified decision-directed (DD) a priori SNR estimation approach based on single-frequency entropy, named DDBSE, is proposed. DDBSE replaces the fixed weighting factor in the DD approach with an adaptive one calculated according to change of single-frequency entropy. Simultaneously, a new noise power estimation approach based on unbiased minimum mean square error (MMSE) and voice activity detection (VAD), named UMVAD, is proposed. UMVAD adopts different strategies to estimate noise in order to reduce over-estimation and under-estimation of noise. UMVAD improves the classical statistical model-based VAD by utilizing an adaptive threshold to replace the original fixed one and modifies the unbiased MMSE-based noise estimation approach using an adaptive a priori speech presence probability calculated by entropy instead of the original fixed one. Experimental results show that DDBSE can provide greater noise suppression than DD and UMVAD can improve the accuracy of noise estimation. Compared to existing approaches, speech enhancement based on UMVAD and DDBSE can obtain a better segment SNR score and composite measure score, especially in adverse environments such as non-stationary noise and low-SNR.

摘要

先验信噪比(SNR)估计和噪声估计对于语音增强很重要。本文提出了一种基于单频熵的新型改进决策导向(DD)先验SNR估计方法,称为DDBSE。DDBSE用根据单频熵变化计算的自适应加权因子取代了DD方法中的固定加权因子。同时,提出了一种基于无偏最小均方误差(MMSE)和语音活动检测(VAD)的新噪声功率估计方法,称为UMVAD。UMVAD采用不同策略估计噪声,以减少噪声的高估和低估。UMVAD通过使用自适应阈值取代原始固定阈值来改进基于经典统计模型的VAD,并使用由熵计算的自适应先验语音存在概率取代原始固定概率来修改基于无偏MMSE的噪声估计方法。实验结果表明,DDBSE比DD能提供更大的噪声抑制,UMVAD能提高噪声估计的准确性。与现有方法相比,基于UMVAD和DDBSE的语音增强可以获得更好的分段SNR分数和综合测量分数,特别是在非平稳噪声和低SNR等不利环境中。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f66/5031741/0801eccf6a47/13634_2016_398_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f66/5031741/0499a79566b1/13634_2016_398_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f66/5031741/9c3413107328/13634_2016_398_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f66/5031741/8caa14d2ab93/13634_2016_398_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f66/5031741/b468e339fef6/13634_2016_398_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f66/5031741/63956dcf99ca/13634_2016_398_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f66/5031741/b998c3405410/13634_2016_398_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f66/5031741/c3ea4d2a52bc/13634_2016_398_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f66/5031741/84abe6d32e7e/13634_2016_398_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f66/5031741/0801eccf6a47/13634_2016_398_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f66/5031741/0499a79566b1/13634_2016_398_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f66/5031741/9c3413107328/13634_2016_398_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f66/5031741/8caa14d2ab93/13634_2016_398_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f66/5031741/b468e339fef6/13634_2016_398_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f66/5031741/63956dcf99ca/13634_2016_398_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f66/5031741/b998c3405410/13634_2016_398_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f66/5031741/c3ea4d2a52bc/13634_2016_398_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f66/5031741/84abe6d32e7e/13634_2016_398_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f66/5031741/0801eccf6a47/13634_2016_398_Fig9_HTML.jpg

相似文献

1
A priori SNR estimation and noise estimation for speech enhancement.用于语音增强的先验信噪比估计和噪声估计。
EURASIP J Adv Signal Process. 2016;2016(1):101. doi: 10.1186/s13634-016-0398-z. Epub 2016 Sep 22.
2
Spectral distortion level resulting in a just-noticeable difference between an a priori signal-to-noise ratio estimate and its instantaneous case.导致先验信噪比估计与其瞬时情况之间存在可察觉差异的频谱失真程度。
J Acoust Soc Am. 2020 Oct;148(4):1879. doi: 10.1121/10.0002113.
3
Estimation of a priori signal-to-noise ratio using neurograms for speech enhancement.使用神经图估计先验信噪比以进行语音增强。
J Acoust Soc Am. 2020 Jun;147(6):3830. doi: 10.1121/10.0001324.
4
CNN-based noise reduction for multi-channel speech enhancement system with discrete wavelet transform (DWT) preprocessing.基于卷积神经网络(CNN)的多通道语音增强系统的降噪方法,采用离散小波变换(DWT)预处理。
PeerJ Comput Sci. 2024 Feb 28;10:e1901. doi: 10.7717/peerj-cs.1901. eCollection 2024.
5
A novel voice sensor for the detection of speech signals.一种用于检测语音信号的新型声音传感器。
Sensors (Basel). 2013 Dec 2;13(12):16533-50. doi: 10.3390/s131216533.
6
Speech enhancement via two-stage dual tree complex wavelet packet transform with a speech presence probability estimator.基于语音存在概率估计器的两阶段双树复数小波包变换语音增强
J Acoust Soc Am. 2017 Feb;141(2):808. doi: 10.1121/1.4976049.
7
Auditory filter-bank compression improves estimation of signal-to-noise ratio for speech in noise.听觉滤波器组压缩可改善噪声环境中语音的信噪比估计。
J Acoust Soc Am. 2020 May;147(5):3197. doi: 10.1121/10.0001168.
8
Incorporation of a modified temporal cepstrum smoothing in both signal-to-noise ratio and speech presence probability estimation for speech enhancement.在语音增强的信噪比和语音存在概率估计中融入改进的时域倒谱平滑。
J Acoust Soc Am. 2024 Jun 1;155(6):3678-3689. doi: 10.1121/10.0026223.
9
On training targets for deep learning approaches to clean speech magnitude spectrum estimation.深度学习方法在语音谱估计中对训练目标的研究。
J Acoust Soc Am. 2021 May;149(5):3273. doi: 10.1121/10.0004823.
10
Speech enhancement for cochlear implant recipients.人工耳蜗植入者的语音增强。
J Acoust Soc Am. 2018 Apr;143(4):2244. doi: 10.1121/1.5031112.

本文引用的文献

1
Reasons why current speech-enhancement algorithms do not improve speech intelligibility and suggested solutions.当前语音增强算法未能提高语音清晰度的原因及建议的解决方案。
IEEE Trans Audio Speech Lang Process. 2011;19(1):47-56. doi: 10.1109/TASL.2010.2045180.
2
Subjective comparison and evaluation of speech enhancement algorithms.语音增强算法的主观比较与评估
Speech Commun. 2007 Jul;49(7):588-601. doi: 10.1016/j.specom.2006.12.006.