Horev Nitza, Most Tova, Pratt Hillel
Evoked Potentials Laboratory, Technion-Israel Institute of Technology, Haifa, Israel.
Ear Hear. 2007 Feb;28(1):111-28. doi: 10.1097/01.aud.0000250021.69163.96.
To determine whether voicing perception is influenced primarily by linguistic experience or if it is due to innate temporal sensitivity to voicing boundaries, by examining behavioral and electrophysiological correlates of speech Voice-Onset-Time (VOT) and nonspeech Formant-Onset-Time (FOT) categorical perception.
Behavioral measures and auditory event-related potentials (ERPs) were obtained from 14 normal-hearing Hebrew speakers, whose voicing distinction is different than English, during identification and discrimination of two sets of stimuli: a VOT continuum, created by editing natural productions of /ba/ and /pa/, and an analogous nonspeech continuum, composed of two synthesized formants, varying in their onset time-FOT.
VOT and FOT continua yielded similar behavioral identification curves. Differences between the two stimulus types were found in discrimination of within-category differences and in reaction time effects. During identification and discrimination tasks, ERPs were differently affected by the VOT or FOT value of the stimulus: VOT value had a significant effect on N1 latency and on N1 and P2 amplitudes whereas FOT value had a significant effect on P2 amplitude. Additionally, during identification tasks, whereas all speech signals evoked a P3, regardless of overt categorization, only the perceptually "rare" nonspeech stimulus (+15 msec FOT) evoked a P3.
Voicing boundaries corresponded to Hebrew VOT values of production, suggesting that voicing perception in Hebrew is mediated mainly by linguistic experience rather than by innate temporal sensitivity. ERP data differed to VOT versus FOT stimuli as early as N1, indicating that brain processing of the temporal aspects of speech and nonspeech signals differ from their early stages. Further studies to establish the neural response patterns to voicing in speakers of languages that use different voicing categories than English are warranted.
通过检查语音语音起始时间(VOT)和非语音共振峰起始时间(FOT)范畴感知的行为和电生理相关性,确定语音感知主要是受语言经验影响,还是由于对语音边界的先天时间敏感性。
从14名听力正常的希伯来语使用者那里获取行为测量数据和听觉事件相关电位(ERP),他们的语音区分与英语不同,在识别和辨别两组刺激时进行:一组VOT连续体,通过编辑/ba/和/pa/的自然发音创建;另一组类似的非语音连续体,由两个合成共振峰组成,其起始时间 - FOT不同。
VOT和FOT连续体产生了相似的行为识别曲线。在类别内差异的辨别和反应时间效应方面发现了两种刺激类型之间的差异。在识别和辨别任务期间,ERP受刺激的VOT或FOT值的影响不同:VOT值对N1潜伏期以及N1和P2波幅有显著影响,而FOT值对P2波幅有显著影响。此外,在识别任务期间,所有语音信号都诱发了P3,无论明显的分类如何,只有感知上“罕见”的非语音刺激(+15毫秒FOT)诱发了P3。
语音边界与希伯来语发音的VOT值相对应,这表明希伯来语中的语音感知主要由语言经验介导,而非先天时间敏感性。ERP数据在N1阶段就因VOT与FOT刺激而不同,表明大脑对语音和非语音信号时间方面的处理从早期阶段就不同。有必要进一步开展研究,以确定使用与英语不同语音类别的语言使用者对语音的神经反应模式。