Department of Physiology and Pharmacology, State University of New York Downstate Medical Center, Brooklyn, New York, USA.
PLoS One. 2012;7(11):e48216. doi: 10.1371/journal.pone.0048216. Epub 2012 Nov 5.
Hierarchical generative models, such as Bayesian networks, and belief propagation have been shown to provide a theoretical framework that can account for perceptual processes, including feedforward recognition and feedback modulation. The framework explains both psychophysical and physiological experimental data and maps well onto the hierarchical distributed cortical anatomy. However, the complexity required to model cortical processes makes inference, even using approximate methods, very computationally expensive. Thus, existing object perception models based on this approach are typically limited to tree-structured networks with no loops, use small toy examples or fail to account for certain perceptual aspects such as invariance to transformations or feedback reconstruction. In this study we develop a Bayesian network with an architecture similar to that of HMAX, a biologically-inspired hierarchical model of object recognition, and use loopy belief propagation to approximate the model operations (selectivity and invariance). Crucially, the resulting Bayesian network extends the functionality of HMAX by including top-down recursive feedback. Thus, the proposed model not only achieves successful feedforward recognition invariant to noise, occlusions, and changes in position and size, but is also able to reproduce modulatory effects such as illusory contour completion and attention. Our novel and rigorous methodology covers key aspects such as learning using a layerwise greedy algorithm, combining feedback information from multiple parents and reducing the number of operations required. Overall, this work extends an established model of object recognition to include high-level feedback modulation, based on state-of-the-art probabilistic approaches. The methodology employed, consistent with evidence from the visual cortex, can be potentially generalized to build models of hierarchical perceptual organization that include top-down and bottom-up interactions, for example, in other sensory modalities.
分层生成模型,如贝叶斯网络和置信传播,已经被证明提供了一个理论框架,可以解释感知过程,包括前馈识别和反馈调制。该框架解释了心理物理学和生理学实验数据,并很好地映射到分层分布式皮质解剖结构上。然而,建模皮质过程所需的复杂性使得推断,即使使用近似方法,也非常计算密集。因此,基于这种方法的现有对象感知模型通常限于没有循环的树状网络,使用小的玩具示例,或者无法解释某些感知方面,例如对变换或反馈重建的不变性。在这项研究中,我们开发了一个具有类似于 HMAX 的架构的贝叶斯网络,HMAX 是一种受生物启发的对象识别分层模型,并使用有环置信传播来近似模型操作(选择性和不变性)。至关重要的是,所提出的贝叶斯网络通过包括自上而下的递归反馈扩展了 HMAX 的功能。因此,所提出的模型不仅能够实现对噪声、遮挡、位置和大小变化的成功前馈识别不变性,而且还能够再现调制效应,如错觉轮廓完成和注意力。我们新颖而严格的方法涵盖了关键方面,例如使用分层贪婪算法进行学习、组合来自多个父节点的反馈信息以及减少所需操作的数量。总的来说,这项工作将一个成熟的对象识别模型扩展到包括基于最新概率方法的高层反馈调制。所采用的方法与视觉皮层的证据一致,可以潜在地推广到建立包括自上而下和自下而上相互作用的分层感知组织模型,例如在其他感觉模态中。