Lee Yoonjeong, Kaiser Elsi, Goldstein Louis
Department of Head and Neck Surgery, University of California, USA.
Department of Linguistics, University of Southern California, USA.
Lang Speech. 2020 Sep;63(3):526-549. doi: 10.1177/0023830919866870. Epub 2019 Aug 6.
This study uses a response mouse-tracking paradigm to examine the role of sub-phonemic information in online lexical ambiguity resolution of continuous speech. We examine listeners' sensitivity to the sub-phonemic information that is specific to the ambiguous internal open juncture /s/-stop sequences in American English (e.g., "" vs. ""), that is, voice onset time (VOT) indicating different degrees of aspiration (e.g., long VOT for "" vs. short VOT for "") in connected speech contexts. A cross-splicing method was used to create two-word sequences (e.g., "" or "") with matching VOTs (long for ""; short for "") or mismatching VOTs ( for ""; for ""). Participants ( = 20) heard the two-word sequences, while looking at computer displays with the second word in the left/right corner ("" and ""). Then, listeners' click responses and mouse movement trajectories were recorded. Click responses show significant effects of VOT manipulation, while mouse trajectories do not. Our results show that stop-release information, whether temporal or spectral, can (mis)guide listeners' interpretation of the possible location of a word boundary between /s/ and a following stop, even when other aspects in the acoustic signal (e.g., duration of /s/) point to the alternative segmentation. Taken together, our results suggest that segmentation and lexical access are highly attuned to bottom-up phonetic information; our results have implications for a model of spoken language recognition with position-specific representations available at the prelexical level and also allude to the possibility that detailed phonetic information may be stored in the listeners' lexicons.
本研究采用反应式鼠标追踪范式,以检验亚音位信息在连续语音在线词汇歧义消解中的作用。我们考察了听者对美式英语中歧义性内部开首音/s/与塞音序列(例如,“”与“”)所特有的亚音位信息的敏感度,即在连贯语音语境中,嗓音起始时间(VOT)表明不同程度的送气(例如,“”的VOT长,而“”的VOT短)。采用交叉拼接法创建具有匹配VOT(“”的VOT长;“”的VOT短)或不匹配VOT(“”的VOT ;“”的VOT )的双词序列(例如,“”或“”)。参与者(n = 20)听双词序列,同时看着电脑显示屏,第二个词在左上角或右上角(“”和“”)。然后,记录听者的点击反应和鼠标移动轨迹。点击反应显示出VOT操纵的显著效果,而鼠标轨迹则不然。我们的研究结果表明,塞音释放信息,无论是时间上的还是频谱上的,都可以(错误地)引导听者对/s/与后续塞音之间词边界可能位置的解读,即使声学信号中的其他方面(例如,/s/的时长)指向另一种切分方式。综合来看,我们的研究结果表明,切分和词汇通达对自下而上的语音信息高度敏感;我们的研究结果对一种在词汇前水平具有位置特定表征的口语识别模型具有启示意义,也暗示了详细语音信息可能存储在听者词汇表中的可能性。