Department of Neurological Surgery, Department of Physiology, and Center for Integrative Neuroscience, University of California, San Francisco, CA 94143, USA.
Science. 2014 Feb 28;343(6174):1006-10. doi: 10.1126/science.1245994. Epub 2014 Jan 30.
During speech perception, linguistic elements such as consonants and vowels are extracted from a complex acoustic speech signal. The superior temporal gyrus (STG) participates in high-order auditory processing of speech, but how it encodes phonetic information is poorly understood. We used high-density direct cortical surface recordings in humans while they listened to natural, continuous speech to reveal the STG representation of the entire English phonetic inventory. At single electrodes, we found response selectivity to distinct phonetic features. Encoding of acoustic properties was mediated by a distributed population response. Phonetic features could be directly related to tuning for spectrotemporal acoustic cues, some of which were encoded in a nonlinear fashion or by integration of multiple cues. These findings demonstrate the acoustic-phonetic representation of speech in human STG.
在语音感知过程中,语音信号中的辅音和元音等语言元素被从复杂的声学语音信号中提取出来。优势颞上回(STG)参与了语音的高阶听觉处理,但它如何编码语音信息还不太清楚。我们在人类进行自然、连续的语音感知时,使用高密度直接皮质表面记录技术,揭示了 STG 对整个英语语音库的表示。在单个电极上,我们发现了对不同语音特征的反应选择性。对声音特性的编码是由分布式群体反应介导的。语音特征可以直接与对频谱时间声音线索的调谐相关,其中一些以非线性或多种线索的整合方式进行编码。这些发现证明了人类 STG 中语音的声学语音表示。