Maersk Mc-Kinney Moller Institute, University of Southern Denmark, Campusvej 55, Odense M 5230, Denmark.
IEEE Trans Pattern Anal Mach Intell. 2013 Aug;35(8):1847-71. doi: 10.1109/TPAMI.2012.272.
Computational modeling of the primate visual system yields insights of potential relevance to some of the challenges that computer vision is facing, such as object recognition and categorization, motion detection and activity recognition, or vision-based navigation and manipulation. This paper reviews some functional principles and structures that are generally thought to underlie the primate visual cortex, and attempts to extract biological principles that could further advance computer vision research. Organized for a computer vision audience, we present functional principles of the processing hierarchies present in the primate visual system considering recent discoveries in neurophysiology. The hierarchical processing in the primate visual system is characterized by a sequence of different levels of processing (on the order of 10) that constitute a deep hierarchy in contrast to the flat vision architectures predominantly used in today's mainstream computer vision. We hope that the functional description of the deep hierarchies realized in the primate visual system provides valuable insights for the design of computer vision algorithms, fostering increasingly productive interaction between biological and computer vision research.
灵长类视觉系统的计算建模为计算机视觉面临的一些挑战提供了潜在的见解,例如目标识别和分类、运动检测和活动识别,或基于视觉的导航和操作。本文综述了一些普遍认为是灵长类视觉皮层基础的功能原理和结构,并试图提取可能进一步推动计算机视觉研究的生物学原理。针对计算机视觉受众,我们根据神经生理学的最新发现,提出了灵长类视觉系统中处理层次的功能原理。灵长类视觉系统的分层处理的特点是一系列不同的处理层次(大约 10 个),与当今主流计算机视觉中主要使用的扁平视觉架构相比构成了一个深层层次结构。我们希望灵长类视觉系统中实现的深层层次结构的功能描述为计算机视觉算法的设计提供了有价值的见解,促进了生物和计算机视觉研究之间日益富有成效的互动。