• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

测量与建模声源-声道相互作用。

Measuring and modeling vocal source-tract interaction.

作者信息

Childers D G, Wong C F

机构信息

Department of Electrical Engineering, University of Florida, Gainesville 32611-2024.

出版信息

IEEE Trans Biomed Eng. 1994 Jul;41(7):663-71. doi: 10.1109/10.301733.

DOI:10.1109/10.301733
PMID:7927387
Abstract

The quality of synthetic speech is affected by two factors: intelligibility and naturalness. At present, synthesized speech may be highly intelligible, but often sounds unnatural. Speech intelligibility depends on the synthesizer's ability to reproduce the formants, the formant bandwidths, and formant transitions, whereas speech naturalness is thought to depend on the excitation waveform characteristics for voiced and unvoiced sounds. Voiced sounds may be generated by a quasiperiodic train of glottal pulses of specified shape exciting the vocal tract filter. It is generally assumed that the glottal source and the vocal tract filter are linearly separable and do not interact. However, this assumption is often not valid, since it has been observed that appreciable source-tract interaction can occur in natural speech. Previous experiments in speech synthesis have demonstrated that the naturalness of synthetic speech does improve when source-tract interaction is simulated in the synthesis process. The purpose of this paper is two-fold: 1) to present an algorithm for automatically measuring source-tract interaction for voiced speech, and 2) to present a simple speech production model that incorporates source-tract interaction into the glottal source model. This glottal source model controls: 1) the skewness of the glottal pulse, and 2) the amount of the first formant ripple superimposed on the glottal pulse. A major application of the results of this paper is the modeling of vocal disorders.

摘要

合成语音的质量受两个因素影响

可懂度和自然度。目前,合成语音可能具有很高的可懂度,但听起来往往不自然。语音可懂度取决于合成器再现共振峰、共振峰带宽和共振峰过渡的能力,而语音自然度则被认为取决于浊音和清音的激励波形特征。浊音可以由一系列特定形状的声门脉冲准周期性地激发声道滤波器产生。通常假设声门源和声道滤波器是线性可分离的,且不相互作用。然而,这个假设往往是无效的,因为据观察,在自然语音中可能会发生明显的源 - 声道相互作用。先前的语音合成实验表明,在合成过程中模拟源 - 声道相互作用时,合成语音的自然度确实会提高。本文的目的有两个:1)提出一种自动测量浊音语音源 - 声道相互作用的算法,2)提出一个简单的语音产生模型,将源 - 声道相互作用纳入声门源模型。这个声门源模型控制:1)声门脉冲的偏度,2)叠加在声门脉冲上的第一共振峰波纹的量。本文结果的一个主要应用是对嗓音障碍进行建模。

相似文献

1
Measuring and modeling vocal source-tract interaction.测量与建模声源-声道相互作用。
IEEE Trans Biomed Eng. 1994 Jul;41(7):663-71. doi: 10.1109/10.301733.
2
Closed phase covariance analysis based on constrained linear prediction for glottal inverse filtering.基于约束线性预测的声门逆滤波的闭相协方差分析。
J Acoust Soc Am. 2009 May;125(5):3289-305. doi: 10.1121/1.3095801.
3
TKK Aparat: an environment for voice inverse filtering and parameterization.TKK设备:一种用于语音逆滤波和参数化的环境。
Logoped Phoniatr Vocol. 2008;33(1):49-64. doi: 10.1080/14015430701855333.
4
Source-filter comparison of measurements of fundamental frequency perturbation and amplitude perturbation for synthesized voice signals.合成语音信号基频微扰和幅度微扰测量的源-滤波器比较
J Voice. 2008 Mar;22(2):125-37. doi: 10.1016/j.jvoice.2006.09.007. Epub 2006 Dec 4.
5
A theoretical study of F0-F1 interaction with application to resonant speaking and singing voice.F0-F1相互作用的理论研究及其在共振语音和歌唱声音中的应用。
J Voice. 2004 Sep;18(3):292-8. doi: 10.1016/j.jvoice.2003.12.010.
6
Voice production model integrating boundary-layer analysis of glottal flow and source-filter coupling.整合声门波导层流边界层分析和源滤波器耦合的语音产生模型。
J Acoust Soc Am. 2011 Mar;129(3):1554-67. doi: 10.1121/1.3533732.
7
Investigation of a glottal related harmonics-to-noise ratio and spectral tilt as indicators of glottal noise in synthesized and human voice signals.研究与声门相关的谐波噪声比和频谱倾斜度作为合成语音信号和人类语音信号中声门噪声指标的情况。
J Acoust Soc Am. 2008 Mar;123(3):1642-52. doi: 10.1121/1.2832651.
8
A nonlinear operator-based speech feature analysis method with application to vocal fold pathology assessment.
IEEE Trans Biomed Eng. 1998 Mar;45(3):300-13. doi: 10.1109/10.661155.
9
What can vortices tell us about vocal fold vibration and voice production.涡旋能告诉我们关于声带振动和发声的哪些信息?
Curr Opin Otolaryngol Head Neck Surg. 2008 Jun;16(3):183-7. doi: 10.1097/MOO.0b013e3282ff5fc5.
10
Speech synthesis by glottal excited linear prediction.基于声门激励线性预测的语音合成。
J Acoust Soc Am. 1994 Oct;96(4):2026-36. doi: 10.1121/1.411319.

引用本文的文献

1
Sensitivity of Source-Filter Interaction to Specific Vocal Tract Shapes.源-滤波器相互作用对特定声道形状的敏感性。
IEEE/ACM Trans Audio Speech Lang Process. 2016 Dec;24(12):2507-2515. doi: 10.1109/taslp.2016.2616543. Epub 2016 Oct 11.