Grossberg S
Department of Cognitive and Neural Systems, Boston University, Massachusetts 02215.
Percept Psychophys. 1994 Jan;55(1):48-121. doi: 10.3758/bf03206880.
A neural network theory of three-dimensional (3-D) vision, called FACADE theory, is described. The theory proposes a solution of the classical figure-ground problem for biological vision. It does so by suggesting how boundary representations and surface representations are formed within a boundary contour system (BCS) and a feature contour system (FCS). The BCS and FCS interact reciprocally to form 3-D boundary and surface representations that are mutually consistent. Their interactions generate 3-D percepts wherein occluding and occluded object parts are separated, completed, and grouped. The theory clarifies how preattentive processes of 3-D perception and figure-ground separation interact reciprocally with attentive processes of spatial localization, object recognition, and visual search. A new theory of stereopsis is proposed that predicts how cells sensitive to multiple spatial frequencies, disparities, and orientations are combined by context-sensitive filtering, competition, and cooperation to form coherent BCS boundary segmentations. Several factors contribute to figure-ground pop-out, including: boundary contrast between spatially contiguous boundaries, whether due to scenic differences in luminance, color, spatial frequency, or disparity; partially ordered interactions from larger spatial scales and disparities to smaller scales and disparities; and surface filling-in restricted to regions surrounded by a connected boundary. Phenomena such as 3-D pop-out from a 2-D picture, Da Vinci stereopsis, 3-D neon color spreading, completion of partially occluded objects, and figure-ground reversals are analyzed. The BCS and FCS subsystems model aspects of how the two parvocellular cortical processing streams that join the lateral geniculate nucleus to prestriate cortical area V4 interact to generate a multiplexed representation of Form-And-Color-And-DEpth, or FACADE, within area V4. Area V4 is suggested to support figure-ground separation and to interact with cortical mechanisms of spatial attention, attentive object learning, and visual search. Adaptive resonance theory (ART) mechanisms model aspects of how prestriate visual cortex interacts reciprocally with a visual object recognition system in inferotemporal (IT) cortex for purposes of attentive object learning and categorization. Object attention mechanisms of the What cortical processing stream through IT cortex are distinguished from spatial attention mechanisms of the Where cortical processing stream through parietal cortex. Parvocellular BCS and FCS signals interact with the model What stream. Parvocellular FCS and magnocellular motion BCS signals interact with the model Where stream.(ABSTRACT TRUNCATED AT 400 WORDS)
本文描述了一种名为“FACADE理论”的三维视觉神经网络理论。该理论提出了一种针对生物视觉中经典的图形-背景问题的解决方案。它通过提出边界轮廓系统(BCS)和特征轮廓系统(FCS)中边界表示和表面表示是如何形成的来做到这一点。BCS和FCS相互作用,形成相互一致的三维边界和表面表示。它们的相互作用产生三维感知,其中遮挡和被遮挡的物体部分被分离、补全并分组。该理论阐明了三维感知和图形-背景分离的前注意过程如何与空间定位、物体识别和视觉搜索的注意过程相互作用。提出了一种新的立体视觉理论,该理论预测了对多个空间频率、视差和方向敏感的细胞如何通过上下文敏感滤波、竞争和合作组合在一起,以形成连贯的BCS边界分割。有几个因素促成了图形-背景凸显,包括:空间相邻边界之间的边界对比度,无论是由于亮度、颜色、空间频率还是视差的场景差异;从较大空间尺度和视差到较小尺度和视差的部分有序相互作用;以及仅限于由连接边界包围的区域的表面填充。分析了诸如从二维图片中弹出的三维效果、达·芬奇立体视觉、三维霓虹色扩散、部分遮挡物体的补全以及图形-背景反转等现象。BCS和FCS子系统模拟了两条小细胞皮层处理流(它们将外侧膝状体核与纹前皮层区域V4相连)如何相互作用,以在区域V4内生成形式-颜色-深度(FACADE)的多重表示。区域V4被认为支持图形-背景分离,并与空间注意、注意物体学习和视觉搜索的皮层机制相互作用。自适应共振理论(ART)机制模拟了纹前视觉皮层如何与颞下(IT)皮层中的视觉物体识别系统相互作用,以进行注意物体学习和分类。通过IT皮层的“什么”皮层处理流的物体注意机制与通过顶叶皮层的“哪里”皮层处理流的空间注意机制不同。小细胞BCS和FCS信号与模型中的“什么”流相互作用。小细胞FCS和大细胞运动BCS信号与模型中的“哪里”流相互作用。(摘要截选至400字)