Perkell Joseph S, Zandipour Majid, Matthies Melanie L, Lane Harlan
Research Laboratory of Electronics, Massachusetts Institute of Technology, Cambridge 02139, USA.
J Acoust Soc Am. 2002 Oct;112(4):1627-41. doi: 10.1121/1.1506369.
This study explores the hypothesis that clear speech is produced with greater "articulatory effort" than normal speech. Kinematic and acoustic data were gathered from seven subjects as they pronounced multiple repetitions of utterances in different speaking conditions, including normal, fast, clear, and slow. Data were analyzed within a framework based on a dynamical model of single-axis frictionless movements, in which peak movement speed is used as a relative measure of articulatory effort (Nelson, 1983). There were differences in peak movement speed, distance and duration among the conditions and among the speakers. Three speakers produced the "clear" condition utterances with movements that had larger distances and durations than those for "normal" utterances. Analyses of the data within a peak speed, distance, duration "performance space" indicated increased effort (reflected in greater peak speed) in the clear condition for the three speakers, in support of the hypothesis. The remaining four speakers used other combinations of parameters to produce the clear condition. The validity of the simple dynamical model for analyzing these complex movements was considered by examining several additional parameters. Some movement characteristics differed from those required for the model-based analysis, presumably because the articulators are complicated structurally and interact with one another mechanically. More refined tests of control strategies for different speaking styles will depend on future analyses of more complicated movements with more realistic models.
本研究探讨了这样一种假设,即清晰言语的产生比正常言语需要更大的“发音努力”。从七名受试者在不同说话条件下(包括正常、快速、清晰和缓慢)重复发音多个话语时收集了运动学和声学数据。数据在基于单轴无摩擦运动动力学模型的框架内进行分析,其中峰值运动速度被用作发音努力的相对度量(纳尔逊,1983年)。在不同条件和不同说话者之间,峰值运动速度、距离和持续时间存在差异。三名说话者在“清晰”条件下发音时的运动距离和持续时间比“正常”发音时更大。在峰值速度、距离、持续时间“表现空间”内对数据进行的分析表明,对于这三名说话者,在清晰条件下努力程度增加(表现为更高的峰值速度),支持了该假设。其余四名说话者使用其他参数组合来产生清晰条件。通过检查几个额外参数来考虑用于分析这些复杂运动的简单动力学模型的有效性。一些运动特征与基于模型的分析所需的特征不同,大概是因为发音器官在结构上很复杂并且在机械上相互作用。对不同说话风格控制策略的更精细测试将取决于未来使用更现实模型对更复杂运动的分析。