元音和元音序列发音过程中声道模式的时间依赖性。

Time dependence of vocal tract modes during production of vowels and vowel sequences.

作者信息

Story Brad H

机构信息

Speech Acoustics Laboratory, Department of Speech, Language, and Hearing Sciences, University of Arizona, Tucson, Arizona 85721, USA.

出版信息

J Acoust Soc Am. 2007 Jun;121(6):3770-89. doi: 10.1121/1.2730621.

DOI:10.1121/1.2730621

PMID:17552726

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2310171/

Abstract

Vocal tract shaping patterns based on articulatory fleshpoint data from four speakers in the University of Wisconsin x-ray microbeam (XRMB) database [J. Westbury, UW-Madison, (1994)] were determined with a principal component analysis (PCA). Midsagittal cross-distance functions representative of approximately the front 6 cm of the oral cavity for each of 11 vowels and vowel-vowel (VV) sequences were obtained from the pellet positions and the hard palate profile for the four speakers. A PCA was independently performed on each speaker's set of cross-distance functions representing static vowels only, and again with time-dependent cross-distance functions representing vowels and VV sequences. In all cases, results indicated that the first two orthogonal components (referred to as modes) accounted for more than 97% of the variance in each speaker's set of cross-distance functions. In addition, the shape of each mode was shown to be similar across the speakers suggesting that the modes represent common patterns of vocal tract deformation. Plots of the resulting time-dependent coefficient records showed that the four speakers activated each mode similarly during production of the vowel sequences. Finally, a procedure was described for using the time-dependent mode coefficients obtained from the XRMB data as input for an area function model of the vocal tract.

摘要

基于威斯康星大学X射线微束（XRMB）数据库[J.韦斯特伯里，威斯康星大学麦迪逊分校，（1994年）]中四位说话者的发音肉点数据，通过主成分分析（PCA）确定声道塑造模式。从四位说话者的颗粒位置和硬腭轮廓中获得了代表11个元音和元音-元音（VV）序列中每个元音口腔前部约6厘米的矢状中交叉距离函数。对每位说话者仅代表静态元音的交叉距离函数集独立进行主成分分析，然后再次对代表元音和VV序列的随时间变化的交叉距离函数进行主成分分析。在所有情况下，结果表明前两个正交分量（称为模式）占每位说话者交叉距离函数集方差的97%以上。此外，各模式的形状在不同说话者之间显示出相似性，这表明这些模式代表了声道变形的常见模式。所得随时间变化的系数记录图表明，四位说话者在元音序列发音过程中以相似方式激活每个模式。最后，描述了一种程序，用于将从XRMB数据中获得的随时间变化的模式系数用作声道面积函数模型的输入。