Myers Emily B, Theodore Rachel M
University of Connecticut, Department of Speech, Language, and Hearing Sciences, 850 Bolton Road, Unit 1085, Storrs, CT 06269-1085, United States; University of Connecticut, Department of Psychological Sciences, 406 Babbidge Road, Unit 1020, Storrs, CT 06269-1020, United States; Haskins Laboratories, 300 George Street, Suite 900, New Haven, CT 06511, United States; Connecticut Institute for the Brain and Cognitive Sciences, 337 Mansfield Road, Unit 1272, Storrs, CT 06269-1085, United States.
University of Connecticut, Department of Speech, Language, and Hearing Sciences, 850 Bolton Road, Unit 1085, Storrs, CT 06269-1085, United States; Haskins Laboratories, 300 George Street, Suite 900, New Haven, CT 06511, United States; Connecticut Institute for the Brain and Cognitive Sciences, 337 Mansfield Road, Unit 1272, Storrs, CT 06269-1085, United States.
Brain Lang. 2017 Feb;165:33-44. doi: 10.1016/j.bandl.2016.11.001. Epub 2016 Nov 27.
The speech stream simultaneously carries information about talker identity and linguistic content, and the same acoustic property (e.g., voice-onset-time, or VOT) may be used for both purposes. Separable neural networks for processing talker identity and phonetic content have been identified, but it is unclear how a singular acoustic property is parsed by the neural system for talker identification versus phonetic processing. In the current study, listeners were exposed to two talkers with characteristically different VOTs. Subsequently, brain activation was measured using fMRI as listeners performed a phonetic categorization task on these stimuli. Right temporoparietal regions previously implicated in talker identification showed sensitivity to the match between VOT variant and talker, whereas left posterior temporal regions showed sensitivity to the typicality of phonetic exemplars, regardless of talker typicality. Taken together, these results suggest that neural systems for voice recognition capture talker-specific phonetic variation.
语音流同时携带有关说话者身份和语言内容的信息,并且相同的声学属性(例如,语音起始时间,或VOT)可用于这两个目的。已经确定了用于处理说话者身份和语音内容的可分离神经网络,但尚不清楚神经系统如何针对说话者识别与语音处理来解析单一的声学属性。在当前的研究中,让听众接触具有特征性不同VOT的两个说话者。随后,当听众对这些刺激执行语音分类任务时,使用功能磁共振成像(fMRI)测量大脑激活情况。先前与说话者识别有关的右侧颞顶区域显示出对VOT变体与说话者之间匹配的敏感性,而左侧颞后区域显示出对语音样本典型性的敏感性,无论说话者的典型性如何。综上所述,这些结果表明用于语音识别的神经系统捕捉到了特定于说话者的语音变化。