CReSS LLC, 1 Seaborn Place, Lexington, Massachusetts 02420, USA.
J Acoust Soc Am. 2012 Jan;131(1):424-34. doi: 10.1121/1.3665988.
Traditional models of mappings from midsagittal cross-distances to cross-sectional areas use only local cross-distance information. These are not the optimal models on which to base the construction of a mapping between the two domains. This can be understood because phonemic identity can affect the relation between local cross-distance and cross-sectional area. However, phonemic identity is not an appropriate independent variable for the control of an articulatory synthesizer. Two alternative approaches for constructing cross-distance to area mappings that can be used for articulatory synthesis are presented. One is a vowel height-sensitive model and the other is a non-parametric model called loess. These depend on global cross-distance information and generally perform better than the traditional models.
传统的从中矢状面横向距离到横截面积的映射模型仅使用局部横向距离信息。这些模型不是构建两个域之间映射的最佳模型。这可以理解,因为音位身份会影响局部横向距离和横截面积之间的关系。然而,音位身份不是控制发音合成器的适当独立变量。本文提出了两种可用于发音合成的构建横向距离到面积映射的替代方法。一种是元音高度敏感模型,另一种是称为局部加权回归的非参数模型。这些方法依赖于全局横向距离信息,通常比传统模型表现更好。