使用运动学三音素模型生成发音动作。

Generation of articulatory movements by using a kinematic triphone model.

作者信息

Okadome T, Honda M

机构信息

NTT Laboratories/CREST, Kanagawa-pref, Japan.

出版信息

J Acoust Soc Am. 2001 Jul;110(1):453-63. doi: 10.1121/1.1377633.

DOI:10.1121/1.1377633

PMID:11508970

Abstract

The method described here predicts the trajectories of articulatory movements for continuous speech by using a kinematic triphone model and the minimum-acceleration model. The kinematic triphone model, which is constructed from articulatory data obtained from experiments using an electro-magnetic articulographic system, is characterized by three kinematic features of a triphone and by the intervals between two successive phonemes in the triphone. After a kinematic feature of a phoneme in a given sentence is extracted, the minimum-acceleration trajectory that coincides with the extremum of the time integral of the squared magnitude of the articulator acceleration is formulated. The calculation of the minimum acceleration requires only linear computation. The method predicts both the qualitative features and the quantitative details of experimentally observed articulation.

摘要

这里描述的方法通过使用运动学三音素模型和最小加速度模型来预测连续语音的发音运动轨迹。运动学三音素模型由使用电磁关节造影系统的实验获得的发音数据构建而成，其特征在于三音素的三个运动学特征以及三音素中两个连续音素之间的间隔。在提取给定句子中一个音素的运动学特征后，制定与发音器加速度平方大小的时间积分极值相吻合的最小加速度轨迹。最小加速度的计算仅需要线性计算。该方法既能预测实验观察到的发音的定性特征，也能预测其定量细节。