Department of Linguistics, University of Pennsylvania.
Department of Cognitive Science, Johns Hopkins University.
Psychol Sci. 2021 Mar;32(3):410-423. doi: 10.1177/0956797620968787. Epub 2021 Feb 22.
What happens to an acoustic signal after it enters the mind of a listener? Previous work has demonstrated that listeners maintain intermediate representations over time. However, the internal structure of such representations-be they the acoustic-phonetic signal or more general information about the probability of possible categories-remains underspecified. We present two experiments using a novel speaker-adaptation paradigm aimed at uncovering the format of speech representations. We exposed adult listeners ( = 297) to a speaker whose utterances contained acoustically ambiguous information concerning phones (and thus words), and we manipulated the temporal availability of disambiguating cues via visually presented text (presented before or after each utterance). Results from a traditional phoneme-categorization task showed that listeners adapted to a modified acoustic distribution when disambiguating text was provided before but not after the audio. These results support the position that speech representations consist of activation over categories and are inconsistent with direct maintenance of the acoustic-phonetic signal.
当一个声学信号进入听众的大脑后会发生什么?先前的研究已经证明,听众会随着时间的推移保持中间表示。然而,这种表示的内部结构——无论是声学语音信号还是关于可能类别概率的更一般信息——仍然没有具体说明。我们提出了两个使用新的说话人适应范式的实验,旨在揭示语音表示的格式。我们让成年听众(n=297)接触一个说话人,其话语包含有关电话(也就是单词)的声学歧义信息,并且我们通过视觉呈现的文本(在每个话语之前或之后呈现)来操纵可消除歧义线索的时间可用性。来自传统的音素分类任务的结果表明,当提供消除歧义的文本时,听众会适应修改后的声学分布,但在音频之后则不会。这些结果支持了这样一种观点,即语音表示由类别上的激活组成,并且与对声学语音信号的直接维持不一致。