• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用短期倒谱分析对语音信号进行噪声估计。

Noise estimation in voice signals using short-term cepstral analysis.

作者信息

Murphy Peter J, Akande Olatunji O

机构信息

Department of Electronic and Computer Engineering, University of Limerick, Limerick, Ireland.

出版信息

J Acoust Soc Am. 2007 Mar;121(3):1679-90. doi: 10.1121/1.2427123.

DOI:10.1121/1.2427123
PMID:17407904
Abstract

Cepstral-based estimation is used to provide a baseline estimate of the noise level in the logarithmic spectrum for voiced speech. A theoretical description of cepstral processing of voiced speech containing aspiration noise, together with supporting empirical data, is provided in order to illustrate the nature of the noise baseline estimation process. Taking the Fourier transform of the liftered (filtered in the cepstral domain) cepstrum produces a noise baseline estimate. It is shown that Fourier transforming the low-pass liftered cepstrum is comparable to applying a moving average (MA) filter to the logarithmic spectrum and hence the baseline receives contributions from the glottal source excited vocal tract and the noise excited vocal tract. Because the estimation process resembles the action of a MA filter, the resulting noise baseline is determined by the harmonic resolution (as determined by the temporal analysis window length) and the glottal source spectral tilt. On selecting an appropriate temporal analysis window length the estimated baseline is shown to lie halfway between the glottal excited vocal tract and the noise excited vocal tract. This information is employed in a new harmonics-to-noise (HNR) estimation technique, which is shown to provide accurate HNR estimates when tested on synthetically generated voice signals.

摘要

基于倒谱的估计用于为浊音语音的对数频谱中的噪声水平提供基线估计。本文给出了包含吸气噪声的浊音语音倒谱处理的理论描述,并辅以实证数据,以说明噪声基线估计过程的本质。对提升后的(在倒谱域中滤波)倒谱进行傅里叶变换可得到噪声基线估计。结果表明,对低通提升后的倒谱进行傅里叶变换相当于对对数频谱应用移动平均(MA)滤波器,因此基线接收来自声门源激励声道和噪声激励声道的贡献。由于估计过程类似于MA滤波器的作用,因此得到的噪声基线由谐波分辨率(由时间分析窗口长度决定)和声门源频谱倾斜度决定。在选择合适的时间分析窗口长度时,估计的基线显示位于声门激励声道和噪声激励声道之间的中间位置。该信息被应用于一种新的谐波噪声比(HNR)估计技术,在对合成生成的语音信号进行测试时,该技术能提供准确的HNR估计。

相似文献

1
Noise estimation in voice signals using short-term cepstral analysis.使用短期倒谱分析对语音信号进行噪声估计。
J Acoust Soc Am. 2007 Mar;121(3):1679-90. doi: 10.1121/1.2427123.
2
On first rahmonic amplitude in the analysis of synthesized aperiodic voice signals.
J Acoust Soc Am. 2006 Nov;120(5 Pt 1):2896-907. doi: 10.1121/1.2355483.
3
Investigation of a glottal related harmonics-to-noise ratio and spectral tilt as indicators of glottal noise in synthesized and human voice signals.研究与声门相关的谐波噪声比和频谱倾斜度作为合成语音信号和人类语音信号中声门噪声指标的情况。
J Acoust Soc Am. 2008 Mar;123(3):1642-52. doi: 10.1121/1.2832651.
4
Spectral noise estimation in the evaluation of pathological voice.病理性嗓音评估中的频谱噪声估计
Logoped Phoniatr Vocol. 2006;31(4):182-9. doi: 10.1080/14015430600576238.
5
Perturbation-free measurement of the harmonics-to-noise ratio in voice signals using pitch synchronous harmonic analysis.使用基音同步谐波分析对语音信号中的谐波噪声比进行无扰动测量。
J Acoust Soc Am. 1999 May;105(5):2866-81. doi: 10.1121/1.426901.
6
A cepstrum-based technique for determining a harmonics-to-noise ratio in speech signals.一种基于倒谱的语音信号谐波噪声比测定技术。
J Speech Hear Res. 1993 Apr;36(2):254-66. doi: 10.1044/jshr.3602.254.
7
Source-filter comparison of measurements of fundamental frequency perturbation and amplitude perturbation for synthesized voice signals.合成语音信号基频微扰和幅度微扰测量的源-滤波器比较
J Voice. 2008 Mar;22(2):125-37. doi: 10.1016/j.jvoice.2006.09.007. Epub 2006 Dec 4.
8
Influence of the ventricular folds on a voice source with specified vocal fold motion.声襞对指定声带运动声源的影响。
J Acoust Soc Am. 2010 Mar;127(3):1519-27. doi: 10.1121/1.3299200.
9
Temporal and spectral estimations of harmonics-to-noise ratio in human voice signals.人类语音信号中谐波与噪声比的时间和频谱估计。
J Acoust Soc Am. 1997 Jul;102(1):537-43. doi: 10.1121/1.419726.
10
The Impact of Nasalance on Cepstral Peak Prominence and Harmonics-to-Noise Ratio.鼻音对谐波峰值突出度和声噪比的影响。
Laryngoscope. 2019 Aug;129(8):E299-E304. doi: 10.1002/lary.27685. Epub 2018 Dec 25.

引用本文的文献

1
Integration of Motor Learning Principles Into Real-Time Ambulatory Voice Biofeedback and Example Implementation Via a Clinical Case Study With Vocal Fold Nodules.将运动学习原理整合到实时动态语音生物反馈中,并通过声带小结临床案例研究进行示例实施。
Am J Speech Lang Pathol. 2017 Feb 1;26(1):1-10. doi: 10.1044/2016_AJSLP-15-0187.
2
Relationships between vocal function measures derived from an acoustic microphone and a subglottal neck-surface accelerometer.由声学麦克风和声门下颈部表面加速度计得出的嗓音功能测量指标之间的关系。
IEEE/ACM Trans Audio Speech Lang Process. 2016 Apr;24(4):659-668. doi: 10.1109/TASLP.2016.2516647. Epub 2016 Jan 11.
3
Automatic evaluation of hypernasality based on a cleft palate speech database.
基于腭裂语音数据库的超鼻音自动评估。
J Med Syst. 2015 May;39(5):61. doi: 10.1007/s10916-015-0242-2. Epub 2015 Mar 28.
4
A study of sounds produced by Algerian esophageal speakers.一项关于阿尔及利亚食管发声者所发出声音的研究。
Afr Health Sci. 2012 Dec;12(4):452-8. doi: 10.4314/ahs.v12i4.9.
5
Voice assessment: updates on perceptual, acoustic, aerodynamic, and endoscopic imaging methods.嗓音评估:感知、声学、空气动力学及内镜成像方法的进展
Curr Opin Otolaryngol Head Neck Surg. 2008 Jun;16(3):211-5. doi: 10.1097/MOO.0b013e3282fe96ce.