Suppr超能文献

用于塞音发音的声学驱动声道模型。

An acoustically-driven vocal tract model for stop consonant production.

作者信息

Story Brad H, Bunton Kate

机构信息

Speech Acoustics Laboratory, Department of Speech, Language, and Hearing Sciences, University of Arizona, P.O. Box 210071, Tucson, AZ 85721.

出版信息

Speech Commun. 2017 Mar;87:1-17. doi: 10.1016/j.specom.2016.12.001. Epub 2016 Dec 9.

Abstract

The purpose of this study was to further develop a multi-tier model of the vocal tract area function in which the modulations of shape to produce speech are generated by the product of a vowel substrate and a consonant superposition function. The new approach consists of specifying input parameters for a target consonant as a set of directional changes in the resonance frequencies of the vowel substrate. Using calculations of acoustic sensitivity functions, these "resonance deflection patterns" are transformed into time-varying deformations of the vocal tract shape without any direct specification of location or extent of the consonant constriction along the vocal tract. The configuration of the constrictions and expansions that are generated by this process were shown to be physiologically-realistic and produce speech sounds that are easily identifiable as the target consonants. This model is a useful enhancement for area function-based synthesis and can serve as a tool for understanding how the vocal tract is shaped by a talker during speech production.

摘要

本研究的目的是进一步开发一种声道区域功能的多层模型,其中产生语音的形状调制由元音基质和辅音叠加函数的乘积生成。新方法包括将目标辅音的输入参数指定为元音基质共振频率的一组方向变化。利用声学灵敏度函数的计算,这些“共振偏转模式”被转换为声道形状随时间变化的变形,而无需直接指定辅音收缩沿声道的位置或范围。通过该过程产生的收缩和扩张配置被证明在生理上是现实的,并产生易于识别为目标辅音的语音。该模型是基于区域功能合成的有用增强,可作为理解说话者在语音产生过程中如何塑造声道的工具。

相似文献

1
An acoustically-driven vocal tract model for stop consonant production.用于塞音发音的声学驱动声道模型。
Speech Commun. 2017 Mar;87:1-17. doi: 10.1016/j.specom.2016.12.001. Epub 2016 Dec 9.
2
Relation of vocal tract shape, formant transitions, and stop consonant identification.声道形状、共振峰过渡和塞音识别的关系。
J Speech Lang Hear Res. 2010 Dec;53(6):1514-28. doi: 10.1044/1092-4388(2010/09-0127). Epub 2010 Jul 19.
7
3D dynamic MRI of the vocal tract during natural speech.自然言语状态下声道的 3D 动态 MRI
Magn Reson Med. 2019 Mar;81(3):1511-1520. doi: 10.1002/mrm.27570. Epub 2018 Nov 3.
8
9
Estimation of vocal tract shape for VCV syllables for a speech training aid.
Conf Proc IEEE Eng Med Biol Soc. 2005;2005:6642-5. doi: 10.1109/IEMBS.2005.1616025.

本文引用的文献

2
Tuning of vocal tract model parameters for nasals using sensitivity functions.
J Acoust Soc Am. 2015 Feb;137(2):1021-31. doi: 10.1121/1.4906158.
4
Relation of vocal tract shape, formant transitions, and stop consonant identification.声道形状、共振峰过渡和塞音识别的关系。
J Speech Lang Hear Res. 2010 Dec;53(6):1514-28. doi: 10.1044/1092-4388(2010/09-0127). Epub 2010 Jul 19.
9
Open source software for experiment design and control.用于实验设计与控制的开源软件。
J Speech Lang Hear Res. 2005 Feb;48(1):45-60. doi: 10.1044/1092-4388(2005/005).

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验