Redwood Center for Theoretical Neuroscience, University of California, Berkeley, California, United States of America.
PLoS Comput Biol. 2012;8(7):e1002594. doi: 10.1371/journal.pcbi.1002594. Epub 2012 Jul 12.
We have developed a sparse mathematical representation of speech that minimizes the number of active model neurons needed to represent typical speech sounds. The model learns several well-known acoustic features of speech such as harmonic stacks, formants, onsets and terminations, but we also find more exotic structures in the spectrogram representation of sound such as localized checkerboard patterns and frequency-modulated excitatory subregions flanked by suppressive sidebands. Moreover, several of these novel features resemble neuronal receptive fields reported in the Inferior Colliculus (IC), as well as auditory thalamus and cortex, and our model neurons exhibit the same tradeoff in spectrotemporal resolution as has been observed in IC. To our knowledge, this is the first demonstration that receptive fields of neurons in the ascending mammalian auditory pathway beyond the auditory nerve can be predicted based on coding principles and the statistical properties of recorded sounds.
我们已经开发出一种语音的稀疏数学表示,它可以最小化表示典型语音所需的活动模型神经元的数量。该模型学习了语音的几个众所周知的声学特征,如谐波堆栈、共振峰、起始和终止,但我们也在声音的频谱图表示中发现了更奇特的结构,如局部棋盘格模式和受抑制侧带环绕的调频兴奋子区域。此外,其中一些新特征类似于下丘(IC)以及听觉丘脑和皮层中报告的神经元感受野,我们的模型神经元在频谱时间分辨率上表现出与 IC 中观察到的相同的权衡。据我们所知,这是首次证明基于编码原理和记录声音的统计特性,可以预测超越听神经的哺乳动物上行听觉通路中神经元的感受野。