• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

颞叶和顶叶网络在不变视觉识别中的直接和间接合作。

Direct and Indirect Cooperation between Temporal and Parietal Networks for Invariant Visual Recognition.

出版信息

J Cogn Neurosci. 1992 Winter;4(1):35-57. doi: 10.1162/jocn.1992.4.1.35.

DOI:10.1162/jocn.1992.4.1.35
PMID:23967856
Abstract

A new type of biologically inspired multilayered network is proposed to model the properties of the primate visual system with respect to invariant visual recognition (IVR). This model is based on 10 major neurobiological and psychological constraints. The first five constraints shape the architecture and properties of the network. 1. The network model has a Y-like double-branched multilayered architecture, with one input (the retina) and two parallel outputs, the "What" and the "Where," which model, respectively, the temporal pathway, specialized for "object" identification, and the parietal pathway specialized for "spatial" localization. 2. Four processing layers are sufficient to model the main functional steps of primate visual system that transform the retinal information into prototypes (object-centered reference frame) in the "What" branch and into an oculomotor command in the "Where" branch. 3. The distribution of receptive field sizes within and between the two functional pathways provides an appropriate tradeoff between discrimination and invariant recognition capabilities. 4. The two outputs are represented by a population coding: the ocular command is computed as a population vector in the "Where" branch and the prototypes are coded in a "semidistributed" way in the "What" branch. In the intermediate associative steps, processing units learn to associate prototypes (through feedback connections) to component features (through feedforward ones). 5. The basic processing units of the network do not model single cells but model the local neuronal circuits that combine different information flows organized in separate cortical layers. Such a biologically constrained model shows shift-invariant and size-invariant capabilities that resemble those of humans (psychological constraints): 6. During the Learning session, a set of patterns (26 capital letters and 2 geometric figures) are presented to the network: a single presentation of each pattern in one position (at the center) and with one size is sufficient to learn the corresponding prototypes (internal representations). These patterns are thus presented in widely varying new sizes and positions during the Recognition session: 7. The "What" branch of the network succeeds in immediate recognition for patterns presented in the central zone of the retina with the learned size. 8. The recognition by the "What" branch is resistant to changes in size within a limited range of variation related to the distribution of receptive field (RF) sizes in the successive processing steps of this pathway. 9. Even when ocular movements are not allowed, the recognition capabilities of the "What" branch are unaffected by changing positions around the learned one. This significant shift-invariance of the "What" branch is also related to the distribution of RF sizes. 10. When varying both sizes and locations, the "What" and the "Where" branches cooperate for recognition: the location coding in the "Where" branch can command, under the control of the "What" branch, an ocular movement efficient to reset peripheral patterns toward the central zone of the retina until successful recognition. This model results in predictions about anatomical connections and physiological interactions between temporal and parietal cortices.

摘要

提出了一种新型的受生物启发的多层网络,用于对灵长类视觉系统的不变视觉识别(IVR)特性进行建模。该模型基于 10 个主要的神经生物学和心理学约束条件。前五个约束条件塑造了网络的架构和特性。1. 网络模型具有 Y 形的双层多层架构,有一个输入(视网膜)和两个并行输出,“What”和“Where”,分别用于模拟专门用于“对象”识别的时间通路和专门用于“空间”定位的顶叶通路。2. 四个处理层足以模拟灵长类视觉系统的主要功能步骤,将视网膜信息转换为“ What”分支中的原型(以对象为中心的参考系)和“ Where”分支中的眼球运动命令。3. 两个功能通路内和通路之间的感受野大小分布提供了辨别力和不变识别能力之间的适当折衷。4. 两个输出由群体编码表示:眼球运动命令在“ Where”分支中计算为群体向量,原型在“ What”分支中以“半分布式”方式编码。在中间联想步骤中,处理单元通过反馈连接学习将原型(通过反馈连接)与组件特征(通过前馈连接)相关联。5. 网络的基本处理单元不模拟单个细胞,而是模拟将组织在不同皮层层中的不同信息流组合在一起的局部神经元回路。这样一种受生物约束的模型显示出类似于人类的移位不变性和大小不变性能力(心理约束):6. 在学习过程中,将一组模式(26 个大写字母和 2 个几何图形)呈现给网络:每个模式在一个位置(中心)和一个大小上的单一呈现足以学习相应的原型(内部表示)。因此,在识别过程中,这些模式以广泛变化的新大小和位置呈现:7. 网络的“ What”分支能够成功识别在学习大小的视网膜中央区域呈现的模式。8. 该“ What”分支的识别不受限于与该途径的连续处理步骤中的感受野(RF)大小分布相关的变化范围内的大小变化的影响。9. 即使不允许眼球运动,“ What”分支的识别能力也不受围绕学习位置的变化的影响。这种“ What”分支的显著移位不变性也与 RF 大小的分布有关。10. 当大小和位置同时变化时,“ What”和“ Where”分支协同进行识别:“ Where”分支中的位置编码可以在“ What”分支的控制下,命令眼球运动有效地将外围模式重置到视网膜的中央区域,直到成功识别。该模型对颞叶和顶叶皮质之间的解剖连接和生理相互作用产生了预测。

相似文献

1
Direct and Indirect Cooperation between Temporal and Parietal Networks for Invariant Visual Recognition.颞叶和顶叶网络在不变视觉识别中的直接和间接合作。
J Cogn Neurosci. 1992 Winter;4(1):35-57. doi: 10.1162/jocn.1992.4.1.35.
2
How does the brain rapidly learn and reorganize view-invariant and position-invariant object representations in the inferotemporal cortex?大脑如何在后颞叶皮层中快速学习和重新组织不变视图和不变位置的物体表示?
Neural Netw. 2011 Dec;24(10):1050-61. doi: 10.1016/j.neunet.2011.04.004. Epub 2011 Apr 22.
3
View-invariant object category learning, recognition, and search: how spatial and object attention are coordinated using surface-based attentional shrouds.视图不变物体类别学习、识别与搜索:基于表面的注意力罩如何协调空间和物体注意力。
Cogn Psychol. 2009 Feb;58(1):1-48. doi: 10.1016/j.cogpsych.2008.05.001. Epub 2008 Jul 23.
4
A hybrid learning network for shift, orientation, and scaling invariant pattern recognition.一种用于平移、旋转和尺度不变模式识别的混合学习网络。
Network. 2001 Nov;12(4):493-512.
5
Invariant visual object recognition: a model, with lighting invariance.不变视觉对象识别:一种具有光照不变性的模型。
J Physiol Paris. 2006 Jul-Sep;100(1-3):43-62. doi: 10.1016/j.jphysparis.2006.09.004. Epub 2006 Oct 30.
6
Brain mechanisms for invariant visual recognition and learning.用于不变视觉识别与学习的脑机制
Behav Processes. 1994 Dec;33(1-2):113-38. doi: 10.1016/0376-6357(94)90062-0. Epub 2002 May 31.
7
On the road to invariant recognition: explaining tradeoff and morph properties of cells in inferotemporal cortex using multiple-scale task-sensitive attentive learning.在不变识别的道路上:使用多尺度任务敏感注意学习解释下颞叶皮质中细胞的权衡和形态特性。
Neural Netw. 2011 Dec;24(10):1036-49. doi: 10.1016/j.neunet.2011.04.001. Epub 2011 Apr 22.
8
Learning optimized features for hierarchical models of invariant object recognition.为不变物体识别的层次模型学习优化特征。
Neural Comput. 2003 Jul;15(7):1559-88. doi: 10.1162/089976603321891800.
9
Invariant object recognition in the visual system with novel views of 3D objects.视觉系统中具有三维物体新视角的不变物体识别
Neural Comput. 2002 Nov;14(11):2585-96. doi: 10.1162/089976602760407982.
10
Invariant global motion recognition in the dorsal visual system: a unifying theory.背侧视觉系统中的不变全局运动识别:一种统一理论。
Neural Comput. 2007 Jan;19(1):139-69. doi: 10.1162/neco.2007.19.1.139.