• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于约束线性预测的声门逆滤波的闭相协方差分析。

Closed phase covariance analysis based on constrained linear prediction for glottal inverse filtering.

作者信息

Alku Paavo, Magi Carlo, Yrttiaho Santeri, Bäckström Tom, Story Brad

机构信息

Department of Signal Processing and Acoustics, Helsinki University of Technology, P.O. Box 3000, Fi-02015 TKK, Finland.

出版信息

J Acoust Soc Am. 2009 May;125(5):3289-305. doi: 10.1121/1.3095801.

DOI:10.1121/1.3095801
PMID:19425671
Abstract

Closed phase (CP) covariance analysis is a widely used glottal inverse filtering method based on the estimation of the vocal tract during the glottal CP. Since the length of the CP is typically short, the vocal tract computation with linear prediction (LP) is vulnerable to the covariance frame position. The present study proposes modification of the CP algorithm based on two issues. First, and most importantly, the computation of the vocal tract model is changed from the one used in the conventional LP into a form where a constraint is imposed on the dc gain of the inverse filter in the filter optimization. With this constraint, LP analysis is more prone to give vocal tract models that are justified by the source-filter theory; that is, they show complex conjugate roots in the formant regions rather than unrealistic resonances at low frequencies. Second, the new CP method utilizes a minimum phase inverse filter. The method was evaluated using synthetic vowels produced by physical modeling and natural speech. The results show that the algorithm improves the performance of the CP-type inverse filtering and its robustness with respect to the covariance frame position.

摘要

闭相(CP)协方差分析是一种广泛使用的声门逆滤波方法,它基于在声门闭相期间对声道的估计。由于闭相的长度通常较短,使用线性预测(LP)进行声道计算容易受到协方差帧位置的影响。本研究基于两个问题提出了对CP算法的改进。首先,也是最重要的,声道模型的计算从传统LP中使用的方法改变为在滤波器优化中对逆滤波器的直流增益施加约束的形式。有了这个约束,LP分析更容易给出符合源-滤波器理论的声道模型;也就是说,它们在共振峰区域显示出复共轭根,而不是在低频处出现不切实际的共振。其次,新的CP方法使用最小相位逆滤波器。该方法使用通过物理建模产生的合成元音和自然语音进行了评估。结果表明,该算法提高了CP型逆滤波的性能及其对协方差帧位置的鲁棒性。

相似文献

1
Closed phase covariance analysis based on constrained linear prediction for glottal inverse filtering.基于约束线性预测的声门逆滤波的闭相协方差分析。
J Acoust Soc Am. 2009 May;125(5):3289-305. doi: 10.1121/1.3095801.
2
Glottal inverse filtering with the closed-phase covariance analysis utilizing mathematical constraints in modelling of the vocal tract.
Logoped Phoniatr Vocol. 2009 Dec;34(4):200-9. doi: 10.3109/14015430902913519.
3
Measuring and modeling vocal source-tract interaction.测量与建模声源-声道相互作用。
IEEE Trans Biomed Eng. 1994 Jul;41(7):663-71. doi: 10.1109/10.301733.
4
Formant frequency estimation of high-pitched vowels using weighted linear prediction.利用加权线性预测估计高音元音的共振峰频率。
J Acoust Soc Am. 2013 Aug;134(2):1295-313. doi: 10.1121/1.4812756.
5
TKK Aparat: an environment for voice inverse filtering and parameterization.TKK设备:一种用于语音逆滤波和参数化的环境。
Logoped Phoniatr Vocol. 2008;33(1):49-64. doi: 10.1080/14015430701855333.
6
Analysis of Glottal Inverse Filtering in the Presence of Source-Filter Interaction.源-滤波器相互作用下的声门逆滤波分析
Speech Commun. 2020 Oct;123:98-108. doi: 10.1016/j.specom.2020.07.003. Epub 2020 Jul 24.
7
How the peak glottal area affects linear predictive coding-based formant estimates of vowels.声门峰面积如何影响基于线性预测编码的元音共振峰估计。
J Acoust Soc Am. 2019 Jul;146(1):223. doi: 10.1121/1.5116137.
8
What do male singers mean by modal and falsetto register? An investigation of the glottal voice source.男歌手所说的模态和假声区是什么意思?对声门声源的一项调查。
Logoped Phoniatr Vocol. 2009;34(2):73-83. doi: 10.1080/14015430902879918.
9
Perceived loudness of speech based on the characteristics of glottal excitation source.基于声门激励源特征的语音感知响度
J Acoust Soc Am. 2009 Oct;126(4):2061-71. doi: 10.1121/1.3203668.
10
Direct speech feature estimation using an iterative EM algorithm for vocal fold pathology detection.使用迭代期望最大化算法进行声带病变检测的直接语音特征估计
IEEE Trans Biomed Eng. 1996 Apr;43(4):373-83. doi: 10.1109/10.486257.

引用本文的文献

1
Maximum Correntropy Linear Prediction for Voice Inverse Filtering: Theoretical Framework and Practical Implementation.用于语音逆滤波的最大相关熵线性预测:理论框架与实际实现
IEEE Trans Audio Speech Lang Process (2025). 2025;33:152-162. doi: 10.1109/taslp.2024.3512187. Epub 2024 Dec 5.
2
Continuous-Time Model Identification of the Subglottal System.声门下系统的连续时间模型识别
Biomed Signal Process Control. 2024 Sep;95(Pt A). doi: 10.1016/j.bspc.2024.106394. Epub 2024 May 3.
3
Glottal Airflow Estimation using Neck Surface Acceleration and Low-Order Kalman Smoothing.
利用颈部表面加速度和低阶卡尔曼平滑估计声门气流
IEEE/ACM Trans Audio Speech Lang Process. 2023;31:2055-2066. doi: 10.1109/taslp.2023.3277269. Epub 2023 May 17.
4
Kalman Filter Implementation of Subglottal Impedance-Based Inverse Filtering to Estimate Glottal Airflow during Phonation.基于声门下阻抗的逆滤波的卡尔曼滤波器实现,用于估计发声过程中的声门气流。
Appl Sci (Basel). 2022 Jan;12(1). doi: 10.3390/app12010401. Epub 2021 Dec 31.
5
Evaluation of Glottal Inverse Filtering Algorithms Using a Physiologically Based Articulatory Speech Synthesizer.使用基于生理学的发音语音合成器评估声门逆滤波算法
IEEE/ACM Trans Audio Speech Lang Process. 2017 Aug;25(8):1718-1730. doi: 10.1109/taslp.2017.2714839. Epub 2017 Jun 12.
6
Non-stationary Bayesian estimation of parameters from a body cover model of the vocal folds.基于声带身体覆盖模型的参数非平稳贝叶斯估计。
J Acoust Soc Am. 2016 May;139(5):2683. doi: 10.1121/1.4948755.
7
Statistical properties of linear prediction analysis underlying the challenge of formant bandwidth estimation.共振峰带宽估计挑战背后的线性预测分析的统计特性。
J Acoust Soc Am. 2015 Feb;137(2):944-50. doi: 10.1121/1.4906840.
8
Modeling the effects of a posterior glottal opening on vocal fold dynamics with implications for vocal hyperfunction.模拟声门后开口对声带动力学的影响及其对发声功能亢进的意义。
J Acoust Soc Am. 2014 Dec;136(6):3262. doi: 10.1121/1.4901714.
9
Subglottal Impedance-Based Inverse Filtering of Voiced Sounds Using Neck Surface Acceleration.基于声门下阻抗的颈部表面加速度对浊音进行逆滤波
IEEE Trans Audio Speech Lang Process. 2013 Sep;21(9):1929-1939. doi: 10.1109/TASL.2013.2263138.
10
Development of a glottal area index that integrates glottal gap size and open quotient.开发一种综合声门区间隙大小和声门开放率的声门区面积指数。
J Acoust Soc Am. 2013 Mar;133(3):1656-66. doi: 10.1121/1.4789931.