Department of Complexity Science and Engineering, The University of Tokyo, Japan.
Network. 2009;20(4):253-67. doi: 10.3109/09548980903447751.
Sparse coding and its related theories have been successful to explain various response properties of early stages of sensory information processing such as primary visual cortex and peripheral auditory system, which suggests that the emergence of such properties results from adaptation of the nerve system to natural stimuli. The present study continues this line of research in a higher stage of auditory processing, focusing on harmonic structures that are often found in behaviourally important natural sound like animal vocalization. It has been physiologically shown that monkey primary auditory cortices (A1) have neurons with response properties capturing such harmonic structures: their response and modulation peaks are often found at frequencies that are harmonically related to each other. We hypothesize that such relations emerge from sparse coding of harmonic natural sounds. Our simulation shows that similar harmonic relations emerge from frequency-domain sparse codes of harmonic sounds, namely, piano performance and human speech. Moreover, the modulatory behaviours can be explained by competitive interactions of model neurons that capture partially common harmonic structures.
稀疏编码及其相关理论成功地解释了早期感觉信息处理(如初级视觉皮层和外围听觉系统)的各种反应特性,这表明这些特性的出现是神经系统对自然刺激的适应。本研究在听觉处理的更高阶段继续这一研究,重点是在行为上重要的自然声音(如动物叫声)中经常出现的谐波结构。生理学研究表明,猴子初级听觉皮层(A1)中的神经元具有捕捉这种谐波结构的反应特性:它们的反应和调制峰值通常出现在彼此谐波相关的频率上。我们假设这种关系是从谐波自然声音的稀疏编码中产生的。我们的模拟表明,类似的谐波关系也出现在谐波声音的频域稀疏编码中,即钢琴演奏和人类语音。此外,调制行为可以用捕获部分共同谐波结构的模型神经元的竞争相互作用来解释。