Rodd Joe, Bosker Hans Rutger, Ernestus Mirjam, Alday Phillip M, Meyer Antje S, Ten Bosch Louis
Psychology of Language Department.
Centre for Language Studies.
Psychol Rev. 2020 Mar;127(2):281-304. doi: 10.1037/rev0000172. Epub 2019 Dec 30.
That speakers can vary their speaking rate is evident, but how they accomplish this has hardly been studied. Consider this analogy: When walking, speed can be continuously increased, within limits, but to speed up further, humans must run. Are there multiple qualitatively distinct speech "gaits" that resemble walking and running? Or is control achieved by continuous modulation of a single gait? This study investigates these possibilities through simulations of a new connectionist computational model of the cognitive process of speech production, EPONA, that borrows from Dell, Burger, and Svec's (1997) model. The model has parameters that can be adjusted to fit the temporal characteristics of speech at different speaking rates. We trained the model on a corpus of disyllabic Dutch words produced at different speaking rates. During training, different clusters of parameter values (regimes) were identified for different speaking rates. In a 1-gait system, the regimes used to achieve fast and slow speech are qualitatively similar, but quantitatively different. In a multiple gait system, there is no linear relationship between the parameter settings associated with each gait, resulting in an abrupt shift in parameter values to move from speaking slowly to speaking fast. After training, the model achieved good fits in all three speaking rates. The parameter settings associated with each speaking rate were not linearly related, suggesting the presence of cognitive gaits. Thus, we provide the first computationally explicit account of the ability to modulate the speech production system to achieve different speaking styles. (PsycINFO Database Record (c) 2020 APA, all rights reserved).
说话者能够改变他们的语速,这是显而易见的,但他们是如何做到这一点的却几乎没有得到研究。考虑这样一个类比:走路时,速度可以在一定限度内持续增加,但要进一步加快速度,人类就必须跑步。是否存在多种在性质上截然不同的言语“步态”,类似于走路和跑步?或者控制是通过对单一步态的连续调制来实现的?本研究通过对一种新的关于言语产生认知过程的联结主义计算模型EPONA进行模拟来探究这些可能性,该模型借鉴了戴尔、伯格和斯韦克(1997)的模型。该模型具有可调整的参数,以适应不同语速下言语的时间特征。我们在一个由不同语速说出的双音节荷兰语单词语料库上训练该模型。在训练过程中,针对不同的语速识别出了不同的参数值簇(状态)。在单步态系统中,用于实现快速和慢速言语的状态在性质上相似,但在数量上不同。在多步态系统中,与每个步态相关的参数设置之间不存在线性关系,导致参数值从慢速说话到快速说话时会突然转变。训练后,该模型在所有三种语速下都实现了良好的拟合。与每种语速相关的参数设置并非线性相关,这表明存在认知步态。因此,我们首次提供了关于调节言语产生系统以实现不同说话风格能力的计算上明确的解释。(《心理学文摘数据库记录》(c)2020美国心理学会,保留所有权利)