Scharenborg Odette
Centre for Language and Speech Technology, Radboud University Nijmegen, Erasmusplein 1, 6525 HT Nijmegen, The Netherlands.
J Acoust Soc Am. 2010 Jun;127(6):3758-70. doi: 10.1121/1.3377050.
Evidence that listeners, at least in a laboratory environment, use durational cues to help resolve temporarily ambiguous speech input has accumulated over the past decades. This paper introduces Fine-Tracker, a computational model of word recognition specifically designed for "tracking" fine-phonetic information in the acoustic speech signal and using it during word recognition. Two simulations were carried out using real speech as input to the model. The simulations showed that the Fine-Tracker, as has been found for humans, benefits from durational information during word recognition, and uses it to disambiguate the incoming speech signal. The availability of durational information allows the computational model to distinguish embedded words from their matrix words (first simulation), and to distinguish word final realizations of [s] from word initial realizations (second simulation). Fine-Tracker thus provides the first computational model of human word recognition that is able to extract durational information from the speech signal and to use it to differentiate words.
在过去几十年里,有证据表明,至少在实验室环境中,听众会利用时长线索来帮助解析暂时模糊的语音输入。本文介绍了精细追踪器(Fine-Tracker),这是一种专门为在声学语音信号中“追踪”精细语音信息并在单词识别过程中使用该信息而设计的单词识别计算模型。使用真实语音作为模型输入进行了两次模拟。模拟结果表明,精细追踪器与人类一样,在单词识别过程中受益于时长信息,并利用它来消除传入语音信号的歧义。时长信息的可用性使计算模型能够将嵌入词与其矩阵词区分开来(第一次模拟),并将[s]的词尾实现与词首实现区分开来(第二次模拟)。因此,精细追踪器提供了第一个能够从语音信号中提取时长信息并利用它来区分单词的人类单词识别计算模型。