Norris Dennis, McQueen James M
Medical Research Council, Cognition and Brain Sciences Unit, Cambridge, UK.
Psychol Rev. 2008 Apr;115(2):357-95. doi: 10.1037/0033-295X.115.2.357.
A Bayesian model of continuous speech recognition is presented. It is based on Shortlist (D. Norris, 1994; D. Norris, J. M. McQueen, A. Cutler, & S. Butterfield, 1997) and shares many of its key assumptions: parallel competitive evaluation of multiple lexical hypotheses, phonologically abstract prelexical and lexical representations, a feedforward architecture with no online feedback, and a lexical segmentation algorithm based on the viability of chunks of the input as possible words. Shortlist B is radically different from its predecessor in two respects. First, whereas Shortlist was a connectionist model based on interactive-activation principles, Shortlist B is based on Bayesian principles. Second, the input to Shortlist B is no longer a sequence of discrete phonemes; it is a sequence of multiple phoneme probabilities over 3 time slices per segment, derived from the performance of listeners in a large-scale gating study. Simulations are presented showing that the model can account for key findings: data on the segmentation of continuous speech, word frequency effects, the effects of mispronunciations on word recognition, and evidence on lexical involvement in phonemic decision making. The success of Shortlist B suggests that listeners make optimal Bayesian decisions during spoken-word recognition.
本文提出了一种连续语音识别的贝叶斯模型。它基于Shortlist(D.诺里斯,1994;D.诺里斯、J.M.麦奎因、A.卡特勒和S.巴特菲尔德,1997),并共享其许多关键假设:对多个词汇假设进行并行竞争评估、语音学抽象的词前和词汇表征、无在线反馈的前馈架构,以及基于输入片段作为可能单词的可行性的词汇分割算法。Shortlist B在两个方面与其前身有根本不同。首先,Shortlist是一个基于交互激活原则的联结主义模型,而Shortlist B基于贝叶斯原则。其次,Shortlist B的输入不再是离散音素序列;它是每个片段在3个时间片上的多个音素概率序列,该序列源自大规模门控研究中听众的表现。文中给出的模拟结果表明,该模型能够解释关键发现:连续语音分割数据、词频效应、发音错误对单词识别的影响,以及词汇参与音素决策的证据。Shortlist B的成功表明,听众在口语单词识别过程中做出了最优的贝叶斯决策。