用于塞音发音的声学驱动声道模型。

An acoustically-driven vocal tract model for stop consonant production.

作者信息

Story Brad H, Bunton Kate

机构信息

Speech Acoustics Laboratory, Department of Speech, Language, and Hearing Sciences, University of Arizona, P.O. Box 210071, Tucson, AZ 85721.

出版信息

Speech Commun. 2017 Mar;87:1-17. doi: 10.1016/j.specom.2016.12.001. Epub 2016 Dec 9.

DOI:10.1016/j.specom.2016.12.001

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5234468/

Abstract

The purpose of this study was to further develop a multi-tier model of the vocal tract area function in which the modulations of shape to produce speech are generated by the product of a vowel substrate and a consonant superposition function. The new approach consists of specifying input parameters for a target consonant as a set of directional changes in the resonance frequencies of the vowel substrate. Using calculations of acoustic sensitivity functions, these "resonance deflection patterns" are transformed into time-varying deformations of the vocal tract shape without any direct specification of location or extent of the consonant constriction along the vocal tract. The configuration of the constrictions and expansions that are generated by this process were shown to be physiologically-realistic and produce speech sounds that are easily identifiable as the target consonants. This model is a useful enhancement for area function-based synthesis and can serve as a tool for understanding how the vocal tract is shaped by a talker during speech production.

摘要

本研究的目的是进一步开发一种声道区域功能的多层模型，其中产生语音的形状调制由元音基质和辅音叠加函数的乘积生成。新方法包括将目标辅音的输入参数指定为元音基质共振频率的一组方向变化。利用声学灵敏度函数的计算，这些“共振偏转模式”被转换为声道形状随时间变化的变形，而无需直接指定辅音收缩沿声道的位置或范围。通过该过程产生的收缩和扩张配置被证明在生理上是现实的，并产生易于识别为目标辅音的语音。该模型是基于区域功能合成的有用增强，可作为理解说话者在语音产生过程中如何塑造声道的工具。

相似文献

1

An acoustically-driven vocal tract model for stop consonant production.用于塞音发音的声学驱动声道模型。

Speech Commun. 2017 Mar;87:1-17. doi: 10.1016/j.specom.2016.12.001. Epub 2016 Dec 9.

2

Relation of vocal tract shape, formant transitions, and stop consonant identification.声道形状、共振峰过渡和塞音识别的关系。

J Speech Lang Hear Res. 2010 Dec;53(6):1514-28. doi: 10.1044/1092-4388(2010/09-0127). Epub 2010 Jul 19.

3

A parametric model of the vocal tract area function for vowel and consonant simulation.用于元音和辅音模拟的声道面积函数参数模型。

J Acoust Soc Am. 2005 May;117(5):3231-54. doi: 10.1121/1.1869752.

4

The relation of velopharyngeal coupling area to the identification of stop versus nasal consonants in North American English based on speech generated by acoustically driven vocal tract modulations.基于声驱动声道调制产生的语音，探讨北美英语中腭咽耦合面积与塞音与鼻音区分的关系。

J Acoust Soc Am. 2021 Nov;150(5):3618. doi: 10.1121/10.0007223.

5

A model of speech production based on the acoustic relativity of the vocal tract.基于声道声学相对性的言语产生模型。

J Acoust Soc Am. 2019 Oct;146(4):2522. doi: 10.1121/1.5127756.

6

Identification of voiced stop consonants produced by acoustically driven vocal tract modulations.声驱动声道调制产生的浊塞音的识别。

JASA Express Lett. 2021 Aug;1(8):085203. doi: 10.1121/10.0005917.

7

3D dynamic MRI of the vocal tract during natural speech.自然言语状态下声道的 3D 动态 MRI

Magn Reson Med. 2019 Mar;81(3):1511-1520. doi: 10.1002/mrm.27570. Epub 2018 Nov 3.

8

Vowel and consonant contributions to vocal tract shape.元音和辅音对声道形状的影响。

J Acoust Soc Am. 2009 Aug;126(2):825-36. doi: 10.1121/1.3158816.

9

Estimation of vocal tract shape for VCV syllables for a speech training aid.

Conf Proc IEEE Eng Med Biol Soc. 2005;2005:6642-5. doi: 10.1109/IEMBS.2005.1616025.

10

Acoustic and perceptual effects of changes in vocal tract constrictions for vowels.元音声道收缩变化的声学和感知效应。

J Acoust Soc Am. 1992 Sep;92(3):1301-9. doi: 10.1121/1.403924.

引用本文的文献

1

A model of speech production based on the acoustic relativity of the vocal tract.基于声道声学相对性的言语产生模型。

J Acoust Soc Am. 2019 Oct;146(4):2522. doi: 10.1121/1.5127756.

本文引用的文献

1

R.H. Stetson, Motor Phonetics: A Study of Speech Movements in Action, 2nd ed., Amsterdam, North Holland Publishing Co., 1951.R.H. 斯特森，《运动语音学：言语运动的实际研究》，第二版，阿姆斯特丹，北荷兰出版公司，1951年。

Phonetica. 2017;74(4):255-258. doi: 10.1159/000477624. Epub 2017 Nov 9.

2

Tuning of vocal tract model parameters for nasals using sensitivity functions.

J Acoust Soc Am. 2015 Feb;137(2):1021-31. doi: 10.1121/1.4906158.

3

Phrase-level speech simulation with an airway modulation model of speech production.基于言语产生气道调制模型的短语级言语模拟。

Comput Speech Lang. 2013 Jun 1;27(4):989-1010. doi: 10.1016/j.csl.2012.10.005.

4

Relation of vocal tract shape, formant transitions, and stop consonant identification.声道形状、共振峰过渡和塞音识别的关系。

J Speech Lang Hear Res. 2010 Dec;53(6):1514-28. doi: 10.1044/1092-4388(2010/09-0127). Epub 2010 Jul 19.

5

Anatomic development of the oral and pharyngeal portions of the vocal tract: an imaging study.声道口腔和咽部的解剖学发育：一项影像学研究。

J Acoust Soc Am. 2009 Mar;125(3):1666-78. doi: 10.1121/1.3075589.

6

Technique for "tuning" vocal tract area functions based on acoustic sensitivity functions.基于声学敏感度函数对声道面积函数进行“调谐”的技术。

J Acoust Soc Am. 2006 Feb;119(2):715-8. doi: 10.1121/1.2151802.

7

Synergistic modes of vocal tract articulation for American English vowels.美式英语元音的声道协同发音模式。

J Acoust Soc Am. 2005 Dec;118(6):3834-59. doi: 10.1121/1.2118367.

8

A parametric model of the vocal tract area function for vowel and consonant simulation.用于元音和辅音模拟的声道面积函数参数模型。

J Acoust Soc Am. 2005 May;117(5):3231-54. doi: 10.1121/1.1869752.

9

Open source software for experiment design and control.用于实验设计与控制的开源软件。

J Speech Lang Hear Res. 2005 Feb;48(1):45-60. doi: 10.1044/1092-4388(2005/005).

10

The relationship of vocal tract shape to three voice qualities.声道形状与三种嗓音特质的关系。

J Acoust Soc Am. 2001 Apr;109(4):1651-67. doi: 10.1121/1.1352085.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验