Max Planck Institute for Biological Cybernetics and Bernstein Center for Computational Neuroscience, 72076 Tübingen, Germany.
Proc Natl Acad Sci U S A. 2014 Apr 22;111(16):6104-8. doi: 10.1073/pnas.1322705111. Epub 2014 Apr 7.
Human perception, cognition, and action are laced with seemingly arbitrary mappings. In particular, sound has a strong spatial connotation: Sounds are high and low, melodies rise and fall, and pitch systematically biases perceived sound elevation. The origins of such mappings are unknown. Are they the result of physiological constraints, do they reflect natural environmental statistics, or are they truly arbitrary? We recorded natural sounds from the environment, analyzed the elevation-dependent filtering of the outer ear, and measured frequency-dependent biases in human sound localization. We find that auditory scene statistics reveals a clear mapping between frequency and elevation. Perhaps more interestingly, this natural statistical mapping is tightly mirrored in both ear-filtering properties and in perceived sound location. This suggests that both sound localization behavior and ear anatomy are fine-tuned to the statistics of natural auditory scenes, likely providing the basis for the spatial connotation of human hearing.
人类的感知、认知和行为都交织着看似随意的映射。特别是,声音具有很强的空间内涵:声音有高有低,旋律有升有降,音高系统地影响感知声音的高度。这些映射的起源尚不清楚。它们是生理限制的结果,反映了自然环境的统计数据,还是真的很随意?我们记录了来自环境的自然声音,分析了外耳的高度相关滤波,并测量了人类声音定位中与频率相关的偏差。我们发现,听觉场景统计数据揭示了频率和高度之间的清晰映射。也许更有趣的是,这种自然的统计映射在耳滤波特性和感知声音位置上都得到了紧密的反映。这表明,声音定位行为和耳朵解剖结构都经过了精细调整,以适应自然听觉场景的统计数据,这可能为人类听觉的空间内涵提供了基础。