论内侧几何学在人类视觉中的作用。

On the role of medial geometry in human vision.

作者信息

Kimia Benjamin B

机构信息

LEMS, Division of Engineering, Brown University, Providence, RI 02912, USA.

出版信息

J Physiol Paris. 2003 Mar-May;97(2-3):155-90. doi: 10.1016/j.jphysparis.2003.09.003.

DOI:10.1016/j.jphysparis.2003.09.003

PMID:14766140

Abstract

A key challenge underlying theories of vision is how the spatially restricted, retinotopically represented feature analysis can be integrated to form abstract, coordinate-free object models. A resolution likely depends on the use of intermediate-level representations which can on the one hand be populated by local features and on the other hand be used as atomic units underlying the formation of, and interaction with, object hypotheses. The precise structure of this intermediate representation derives from the varied requirements of a range of visual tasks which motivate a significant role for incorporating a geometry of visual form. The need to integrate input from features capturing surface properties such as texture, shading, motion, color, etc., as well as from features capturing surface discontinuities such as silhouettes, T-junctions, etc., implies a geometry which captures both regional and boundary aspects. Curves, as a geometric model of boundaries, have been extensively used as an intermediate representation in computational, perceptual, and physiological studies, while the use of the medial axis (MA) has been popular mainly in computer vision as a geometric region-based model of the interior of closed boundaries. We extend the traditional model of the MA to represent images, where each MA segment represents a region of the image which we call a visual fragment. We present a unified theory of perceptual grouping and object recognition where through various sequences of transformations of the MA representation, visual fragments are grouped in various configurations to form object hypotheses, and are related to stored models. The mechanisms underlying both the computation and the transformation of the MA is a lateral wave propagation model. Recent psychophysical experiments depicting contrast sensitivity map peaks at the medial axes of stimuli, and experiments on perceptual filling-in, and brightness induction and modulation, are consistent with both the use of an MA representation and a propagation-based scheme. Also, recent neurophysiological recordings in V1 correlate with the MA hypothesis and a horizontal propagation scheme. This evidence supports a geometric computational paradigm for processing sensory data where both dynamic in-plane propagation and feedforward-feedback connections play an integral role.

摘要

视觉理论的一个关键挑战在于，如何将空间受限的、视网膜拓扑表示的特征分析进行整合，以形成抽象的、无坐标的对象模型。一种解决方案可能依赖于使用中间层表示，这种表示一方面可以由局部特征填充，另一方面可以用作形成对象假设并与对象假设进行交互的原子单元。这种中间表示的精确结构源于一系列视觉任务的不同要求，这些要求促使在纳入视觉形式的几何结构方面发挥重要作用。整合来自捕捉表面属性（如纹理、阴影、运动、颜色等）的特征以及来自捕捉表面不连续性（如轮廓、T 型连接等）的特征的输入需求，意味着一种既捕捉区域又捕捉边界方面的几何结构。曲线作为边界的几何模型，在计算、感知和生理研究中被广泛用作中间表示，而中轴线（MA）的使用主要在计算机视觉中作为基于几何区域的封闭边界内部模型而流行。我们扩展了传统的中轴线模型来表示图像，其中每个中轴线段表示图像的一个区域，我们称之为视觉片段。我们提出了一种统一的感知分组和对象识别理论，通过中轴线表示的各种变换序列，视觉片段以各种配置进行分组以形成对象假设，并与存储的模型相关联。中轴线计算和变换的基础机制是横向波传播模型。最近描述刺激中轴线处对比敏感度图峰值的心理物理学实验、感知填充实验以及亮度诱导和调制实验，都与使用中轴线表示和基于传播的方案一致。此外，最近在 V1 区的神经生理学记录与中轴线假设和水平传播方案相关。这一证据支持了一种用于处理感官数据的几何计算范式，其中动态平面内传播和前馈 - 反馈连接都起着不可或缺的作用。

相似文献

On the role of medial geometry in human vision.论内侧几何学在人类视觉中的作用。

J Physiol Paris. 2003 Mar-May;97(2-3):155-90. doi: 10.1016/j.jphysparis.2003.09.003.

Texture segmentation in human perception: a combined modeling and fMRI study.人类感知中的纹理分割：一项结合建模与功能磁共振成像的研究。

Neuroscience. 2008 Feb 6;151(3):730-6. doi: 10.1016/j.neuroscience.2007.11.040. Epub 2007 Dec 4.

How position dependent is visual object recognition?视觉物体识别在多大程度上依赖于位置？

Trends Cogn Sci. 2008 Mar;12(3):114-22. doi: 10.1016/j.tics.2007.12.006. Epub 2008 Feb 11.

Cortical connections and early visual function: intra- and inter-columnar processing.

J Physiol Paris. 2003 Mar-May;97(2-3):191-208. doi: 10.1016/j.jphysparis.2003.09.004.

Geometry and the visual brain.

J Physiol Paris. 2003 Mar-May;97(2-3):99-103. doi: 10.1016/j.jphysparis.2003.09.012.

Feedforward and recurrent processing in scene segmentation: electroencephalography and functional magnetic resonance imaging.场景分割中的前馈和循环处理：脑电图与功能磁共振成像

J Cogn Neurosci. 2008 Nov;20(11):2097-109. doi: 10.1162/jocn.2008.20142.

A model for learning topographically organized parts-based representations of objects in visual cortex: topographic nonnegative matrix factorization.一种用于在视觉皮层中学习物体的基于部分的拓扑组织表示的模型：拓扑非负矩阵分解。

Neural Comput. 2009 Sep;21(9):2605-33. doi: 10.1162/neco.2009.03-08-722.

Neural mechanisms for the robust representation of junctions.

Neural Comput. 2004 May;16(5):1013-37. doi: 10.1162/089976604773135087.

The neurogeometry of pinwheels as a sub-Riemannian contact structure.作为次黎曼接触结构的风车状神经元几何结构

J Physiol Paris. 2003 Mar-May;97(2-3):265-309. doi: 10.1016/j.jphysparis.2003.10.010.

Texture segregation by visual cortex: perceptual grouping, attention, and learning.视觉皮层的纹理分离：知觉分组、注意力与学习

Vision Res. 2007 Nov;47(25):3173-211. doi: 10.1016/j.visres.2007.07.013. Epub 2007 Sep 27.

引用本文的文献

Hiding the Rabbit: Using a genetic algorithm to investigate shape guidance in visual search.隐藏兔子：使用遗传算法研究视觉搜索中的形状引导。

J Vis. 2022 Jan 4;22(1):7. doi: 10.1167/jov.22.1.7.

Skeletal representations of shape in the human visual cortex.人类视觉皮层中的形状骨骼表示。

Neuropsychologia. 2022 Jan 7;164:108092. doi: 10.1016/j.neuropsychologia.2021.108092. Epub 2021 Nov 18.

Interaction of surface pattern and contour shape in the tilt after effects evoked by symmetry.对称性诱发的倾斜后效中表面图形与轮廓形状的相互作用。

Sci Rep. 2021 Apr 13;11(1):8024. doi: 10.1038/s41598-021-87429-y.

Skeletal descriptions of shape provide unique perceptual information for object recognition.骨骼描述的形状为物体识别提供了独特的感知信息。

Sci Rep. 2019 Jun 27;9(1):9359. doi: 10.1038/s41598-019-45268-y.

Skeletal representations of shape in human vision: Evidence for a pruned medial axis model.人类视觉中形状的骨骼表示：修剪后的中轴模型的证据。

J Vis. 2019 Jun 3;19(6):6. doi: 10.1167/19.6.6.

Surface diagnosticity predicts the high-level representation of regular and irregular object shape in human vision.表面诊断性预测人类视觉中规则和不规则物体形状的高级表征。

Atten Percept Psychophys. 2019 Jul;81(5):1589-1608. doi: 10.3758/s13414-019-01698-4.

Representations of regular and irregular shapes by deep Convolutional Neural Networks, monkey inferotemporal neurons and human judgments.深度卷积神经网络、猴子下颞叶神经元和人类判断对规则和不规则形状的表示。

PLoS Comput Biol. 2018 Oct 26;14(10):e1006557. doi: 10.1371/journal.pcbi.1006557. eCollection 2018 Oct.

Integration of objects and space in perception and memory.感知和记忆中的物体与空间整合。

Nat Neurosci. 2017 Oct 26;20(11):1493-1503. doi: 10.1038/nn.4657.

Shape beyond recognition: form-derived directionality and its effects on visual attention and motion perception.面目全非：形态衍生的方向性及其对视知觉和运动知觉的影响。

J Exp Psychol Gen. 2014 Feb;143(1):434-54. doi: 10.1037/a0032353. Epub 2013 Apr 8.

Medial axis shape coding in macaque inferotemporal cortex.恒河猴下颞叶皮层中的中轴形状编码。

Neuron. 2012 Jun 21;74(6):1099-113. doi: 10.1016/j.neuron.2012.04.029.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

论内侧几何学在人类视觉中的作用。

On the role of medial geometry in human vision.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献