Department of Neurophysics, Institute for Theoretical Physics, University of Bremen, Bremen, Germany.
PLoS Comput Biol. 2012;8(5):e1002520. doi: 10.1371/journal.pcbi.1002520. Epub 2012 May 24.
For processing and segmenting visual scenes, the brain is required to combine a multitude of features and sensory channels. It is neither known if these complex tasks involve optimal integration of information, nor according to which objectives computations might be performed. Here, we investigate if optimal inference can explain contour integration in human subjects. We performed experiments where observers detected contours of curvilinearly aligned edge configurations embedded into randomly oriented distractors. The key feature of our framework is to use a generative process for creating the contours, for which it is possible to derive a class of ideal detection models. This allowed us to compare human detection for contours with different statistical properties to the corresponding ideal detection models for the same stimuli. We then subjected the detection models to realistic constraints and required them to reproduce human decisions for every stimulus as well as possible. By independently varying the four model parameters, we identify a single detection model which quantitatively captures all correlations of human decision behaviour for more than 2000 stimuli from 42 contour ensembles with greatly varying statistical properties. This model reveals specific interactions between edges closely matching independent findings from physiology and psychophysics. These interactions imply a statistics of contours for which edge stimuli are indeed optimally integrated by the visual system, with the objective of inferring the presence of contours in cluttered scenes. The recurrent algorithm of our model makes testable predictions about the temporal dynamics of neuronal populations engaged in contour integration, and it suggests a strong directionality of the underlying functional anatomy.
为了处理和分割视觉场景,大脑需要将多种特征和感觉通道结合起来。目前还不清楚这些复杂任务是否涉及信息的最优整合,也不清楚计算是根据哪些目标进行的。在这里,我们研究了最优推理是否可以解释人类受试者的轮廓整合。我们进行了实验,让观察者检测嵌入在随机定向干扰器中的曲线排列边缘配置的轮廓。我们的框架的关键特征是使用生成过程来创建轮廓,对于该生成过程,可以推导出一类理想的检测模型。这使我们能够将具有不同统计特性的轮廓的人类检测与相同刺激的相应理想检测模型进行比较。然后,我们对检测模型施加现实约束,并要求它们尽可能地复制人类对每个刺激的决策。通过独立地改变四个模型参数,我们确定了一个单一的检测模型,该模型可以定量地捕捉到 42 个轮廓集合中超过 2000 个刺激的人类决策行为的所有相关性,这些集合具有极大不同的统计特性。该模型揭示了边缘之间的特定相互作用,这些相互作用与生理学和心理物理学的独立发现非常吻合。这些相互作用意味着轮廓的统计特性确实是由视觉系统最优地整合的,其目标是推断出杂乱场景中轮廓的存在。我们模型的递归算法对参与轮廓整合的神经元群体的时间动态做出了可测试的预测,并暗示了潜在功能解剖结构的强烈方向性。