Chen Bo, Perona Pietro
J Vis. 2015;15(16):9. doi: 10.1167/15.16.9.
Searching for objects among clutter is a key ability of the visual system. Speed and accuracy are the crucial performance criteria. How can the brain trade off these competing quantities for optimal performance in different tasks? Can a network of spiking neurons carry out such computations, and what is its architecture? We propose a new model that takes input from V1-type orientation-selective spiking neurons and detects a target in the shortest time that is compatible with a given acceptable error rate. Subject to the assumption that the output of the primary visual cortex comprises Poisson neurons with known properties, our model is an ideal observer. The model has only five free parameters: the signal-to-noise ratio in a hypercolumn, the costs of false-alarm and false-reject errors versus the cost of time, and two parameters accounting for nonperceptual delays. Our model postulates two gain-control mechanisms--one local to hypercolumns and one global to the visual field--to handle variable scene complexity. Error rate and response time predictions match psychophysics data as we vary stimulus discriminability, scene complexity, and the uncertainty associated with each of these quantities. A five-layer spiking network closely approximates the optimal model, suggesting that known cortical mechanisms are sufficient for implementing visual search efficiently.
在杂乱环境中搜索目标是视觉系统的一项关键能力。速度和准确性是至关重要的性能标准。大脑如何在不同任务中权衡这些相互竞争的因素以实现最优性能?一个脉冲神经元网络能否执行这样的计算,其架构又是怎样的?我们提出了一种新模型,该模型从V1型方向选择性脉冲神经元获取输入,并在与给定可接受错误率兼容的最短时间内检测到目标。基于初级视觉皮层的输出由具有已知特性的泊松神经元组成这一假设,我们的模型是一个理想观察者。该模型仅有五个自由参数:超柱中的信噪比、误报和漏报错误的代价与时间代价的对比,以及两个用于解释非感知延迟的参数。我们的模型假定了两种增益控制机制——一种是超柱局部的,一种是视野全局的——来处理可变的场景复杂性。当我们改变刺激可辨别性、场景复杂性以及与这些量相关的不确定性时,错误率和反应时间预测与心理物理学数据相匹配。一个五层脉冲网络紧密近似最优模型,这表明已知的皮层机制足以有效地实现视觉搜索。