Department of Psychology, Rutgers, The State University of New Jersey, Piscataway, NJ, 08854, USA.
Sci Rep. 2023 Jul 10;13(1):11172. doi: 10.1038/s41598-023-37982-5.
Vocal communication requires the formation of acoustic categories to enable invariant representations of sounds despite superficial variations. Humans form acoustic categories for speech phonemes, enabling the listener to recognize words independent of speakers; animals can also discriminate speech phonemes. We investigated the neural mechanisms of this process using electrophysiological recordings from the zebra finch secondary auditory area, caudomedial nidopallium (NCM), during passive exposure to human speech stimuli consisting of two naturally spoken words produced by multiple speakers. Analysis of neural distance and decoding accuracy showed improvements in neural discrimination between word categories over the course of exposure, and this improved representation transferred to the same words by novel speakers. We conclude that NCM neurons formed generalized representations of word categories independent of speaker-specific variations that became more refined over the course of passive exposure. The discovery of this dynamic encoding process in NCM suggests a general processing mechanism for forming categorical representations of complex acoustic signals that humans share with other animals.
声音交流需要形成声学范畴,以便在不考虑表面变化的情况下对声音进行不变的表示。人类会对言语音素形成声学范畴,从而使听众能够独立于说话者识别单词;动物也可以区分言语音素。我们使用电生理学记录技术,在斑马雀的次要听觉区、尾侧中纹状体(NCM)中进行研究,在被动暴露于人类语音刺激期间,这些刺激由多个说话者说出的两个自然说出的单词组成。对神经距离和解码准确性的分析表明,在暴露过程中,神经对单词范畴的区分能力有所提高,并且这种改进的表示可以转移到由新说话者说出的相同单词上。我们得出的结论是,NCM 神经元形成了与说话者特定变化无关的、通用的单词范畴表示,并且在被动暴露过程中变得更加精细。在 NCM 中发现的这种动态编码过程表明,对于形成人类与其他动物共有的复杂声音信号的范畴表示,存在一种通用的处理机制。