Cognitive Neuroimaging Unit, NeuroSpin Center, Gif/Yvette, France; Sagol School of Neuroscience, Tel-Aviv University, Tel-Aviv, Israel; Language and Brain Lab, Sagol School of Neuroscience and School of Education, Tel-Aviv University, Tel-Aviv, Israel.
Sagol School of Neuroscience, Tel-Aviv University, Tel-Aviv, Israel; Department of Psychology and Center for Neural Science, New York University, New York, NY, United States.
Neuroimage. 2021 Feb 1;226:117499. doi: 10.1016/j.neuroimage.2020.117499. Epub 2020 Oct 24.
One of the central tasks of the human auditory system is to extract sound features from incoming acoustic signals that are most critical for speech perception. Specifically, phonological features and phonemes are the building blocks for more complex linguistic entities, such as syllables, words and sentences. Previous ECoG and EEG studies showed that various regions in the superior temporal gyrus (STG) exhibit selective responses to specific phonological features. However, electrical activity recorded by ECoG or EEG grids reflects average responses of large neuronal populations and is therefore limited in providing insights into activity patterns of single neurons. Here, we recorded spiking activity from 45 units in the STG from six neurosurgical patients who performed a listening task with phoneme stimuli. Fourteen units showed significant responsiveness to the stimuli. Using a Naïve-Bayes model, we find that single-cell responses to phonemes are governed by manner-of-articulation features and are organized according to sonority with two main clusters for sonorants and obstruents. We further find that 'neural similarity' (i.e. the similarity of evoked spiking activity between pairs of phonemes) is comparable to the 'perceptual similarity' (i.e. to what extent two phonemes are judged as sounding similar) based on perceptual confusion, assessed behaviorally in healthy subjects. Thus, phonemes that were perceptually similar also had similar neural responses. Taken together, our findings indicate that manner-of-articulation is the dominant organization dimension of phoneme representations at the single-cell level, suggesting a remarkable consistency across levels of analyses, from the single neuron level to that of large neuronal populations and behavior.
人类听觉系统的核心任务之一是从传入的声学信号中提取对语音感知最重要的声音特征。具体来说,音韵特征和音位是更复杂语言实体(如音节、单词和句子)的构建块。以前的 ECoG 和 EEG 研究表明,上颞回(STG)的各个区域对特定的音韵特征表现出选择性反应。然而,ECoG 或 EEG 网格记录的电活动反映了大量神经元群体的平均反应,因此在提供对单个神经元活动模式的见解方面存在局限性。在这里,我们从六名接受神经外科手术的患者的 STG 中记录了 45 个单位的尖峰活动,这些患者在进行语音刺激的听力任务时。有 14 个单位对刺激表现出显著的反应。使用朴素贝叶斯模型,我们发现,对音位的单细胞反应受发音方式特征的控制,并根据响亮度进行组织,有两个主要的响音和闭塞音簇。我们进一步发现,“神经相似性”(即音位之间诱发的尖峰活动的相似性)与“感知相似性”(即两个音位听起来相似的程度)相当,基于健康受试者的行为评估的感知混淆。因此,感知上相似的音位也具有相似的神经反应。总的来说,我们的发现表明,发音方式是音位表示在单细胞水平上的主要组织维度,这表明在从单个神经元水平到大型神经元群体和行为的分析水平上存在显著的一致性。