Department of Electrical Engineering, Technical University of Denmark, Kgs. Lyngby 2800, Denmark; Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.
Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139, USA; Program in Speech and Hearing Biosciences and Technology, Harvard University, Cambridge, MA 02138, USA.
Curr Biol. 2018 May 7;28(9):1405-1418.e10. doi: 10.1016/j.cub.2018.03.049. Epub 2018 Apr 19.
To overcome variability, estimate scene characteristics, and compress sensory input, perceptual systems pool data into statistical summaries. Despite growing evidence for statistical representations in perception, the underlying mechanisms remain poorly understood. One example of such representations occurs in auditory scenes, where background texture appears to be represented with time-averaged sound statistics. We probed the averaging mechanism using "texture steps"-textures containing subtle shifts in stimulus statistics. Although generally imperceptible, steps occurring in the previous several seconds biased texture judgments, indicative of a multi-second averaging window. Listeners seemed unable to willfully extend or restrict this window but showed signatures of longer integration times for temporally variable textures. In all cases the measured timescales were substantially longer than previously reported integration times in the auditory system. Integration also showed signs of being restricted to sound elements attributed to a common source. The results suggest an integration process that depends on stimulus characteristics, integrating over longer extents when it benefits statistical estimation of variable signals and selectively integrating stimulus components likely to have a common cause in the world. Our methodology could be naturally extended to examine statistical representations of other types of sensory signals.
为了克服变异性、估计场景特征和压缩感觉输入,感知系统将数据汇总到统计摘要中。尽管越来越多的证据表明感知中存在统计表示,但潜在的机制仍未得到很好的理解。这种表示的一个例子发生在听觉场景中,其中背景纹理似乎是用时间平均的声音统计数据来表示的。我们使用“纹理步长”来探测平均机制,这些纹理包含刺激统计数据的微妙变化。尽管通常是不可察觉的,但在前几秒钟内发生的步骤会影响纹理判断,表明存在一个多秒的平均窗口。听众似乎无法有意地扩展或限制这个窗口,但对于时间变化的纹理,表现出更长的整合时间的特征。在所有情况下,测量的时间尺度都远远长于以前在听觉系统中报告的整合时间。整合也表现出限制在归因于共同来源的声音元素上的迹象。结果表明,存在一种依赖于刺激特征的整合过程,当它有利于对可变信号的统计估计时,会在更长的时间内进行整合,并选择性地整合可能在现实世界中有共同原因的刺激成分。我们的方法可以自然地扩展到检查其他类型的感觉信号的统计表示。