Computational Neuroscience Laboratory, The Salk Institute La Jolla, CA, USA.
Front Hum Neurosci. 2011 Aug 9;5:78. doi: 10.3389/fnhum.2011.00078. eCollection 2011.
Incompatible images presented to the two eyes lead to perceptual oscillations in which one image at a time is visible. Early models portrayed this binocular rivalry as involving reciprocal inhibition between monocular representations of images, occurring at an early visual stage prior to binocular mixing. However, psychophysical experiments found conditions where rivalry could also occur at a higher, more abstract level of representation. In those cases, the rivalry was between image representations dissociated from eye-of-origin information, rather than between monocular representations from the two eyes. Moreover, neurophysiological recordings found the strongest rivalry correlate in inferotemporal cortex, a high-level, predominantly binocular visual area involved in object recognition, rather than early visual structures. An unresolved issue is how can the separate identities of the two images be maintained after binocular mixing in order for rivalry to be possible at higher levels? Here we demonstrate that after the two images are mixed, they can be unmixed at any subsequent stage using a physiologically plausible non-linear signal-processing algorithm, non-negative matrix factorization, previously proposed for parsing object parts during object recognition. The possibility that unmixed left and right images can be regenerated at late stages within the visual system provides a mechanism for creating various binocular representations and interactions de novo in different cortical areas for different purposes, rather than inheriting then from early areas. This is a clear example how non-linear algorithms can lead to highly non-intuitive behavior in neural information processing.
向两只眼睛呈现不兼容的图像会导致知觉振荡,在这种情况下,一次只能看到一个图像。早期的模型将这种双眼竞争描述为图像的单眼表示之间的相互抑制,发生在双眼混合之前的早期视觉阶段。然而,心理物理学实验发现,竞争也可能发生在更高、更抽象的表示水平。在这些情况下,竞争是在与眼起源信息分离的图像表示之间发生的,而不是在来自两只眼睛的单眼表示之间发生的。此外,神经生理学记录发现,在参与物体识别的高级、主要是双眼视觉区域——颞下皮质中,存在最强的竞争相关物,而不是早期的视觉结构。一个悬而未决的问题是,在双眼混合之后,两个图像的单独身份如何能够保持,以便在更高的水平上进行竞争?在这里,我们证明,在两个图像混合之后,可以使用一种生理上合理的非线性信号处理算法——非负矩阵分解,在任何后续阶段将它们重新分离,该算法之前被提议用于在物体识别期间解析物体部分。在视觉系统的晚期阶段可以重新生成未混合的左右图像的可能性为在不同的皮层区域为不同的目的创建各种双眼表示和交互提供了一种机制,而不是从早期区域继承它们。这是一个明显的例子,说明非线性算法如何导致神经信息处理中非常非直观的行为。