Suppr超能文献

人类立体视觉的计算理论。

A computational theory of human stereo vision.

作者信息

Marr D, Poggio T

出版信息

Proc R Soc Lond B Biol Sci. 1979 May 23;204(1156):301-28. doi: 10.1098/rspb.1979.0029.

Abstract

An algorithm is proposed for solving the stereoscopic matching problem. The algorithm consists of five steps: (1) Each image is filtered at different orientations with bar masks of four sizes that increase with eccentricity; the equivalent filters are one or two octaves wide. (2) Zero-crossings in the filtered images, which roughly correspond to edges, are localized. Positions of the ends of lines and edges are also found. (3) For each mask orientation and size, matching takes place between pairs of zero-crossings or terminationss of the same sign in the two images, for a range of disparities up to about the width of the mask's central region. (4) Wide masks can control vergence movements, thus causing small masks to come into correspondence. (5) When a correspondence is achieved, it is stored in a dynamic buffer, called the 2 1/2-D sketch. It is shown that this proposal provides a theoretical framework for most existing psychophysical and neurophysiological data about stereopsis. Several critical experimental predictions are also made, for instance about the size of Panum's area under various conditions. The results of such experiments would tell us whether, for example, cooperativity is necessary for the matching process.

摘要

提出了一种用于解决立体匹配问题的算法。该算法由五个步骤组成:(1)使用四种尺寸的条形掩模以不同方向对每幅图像进行滤波,这些尺寸随离心率增加;等效滤波器的宽度为一个或两个倍频程。(2)对滤波后的图像中的零交叉点进行定位,这些零交叉点大致对应于边缘。还找出线条和边缘的端点位置。(3)对于每个掩模方向和尺寸,在两幅图像中具有相同符号的零交叉点或终止点对之间进行匹配,匹配视差范围可达掩模中心区域宽度左右。(4)宽掩模可控制辐辏运动,从而使小掩模实现对应。(5)当实现对应时,将其存储在一个称为2.5维草图的动态缓冲区中。结果表明,该提议为大多数现有的关于立体视觉的心理物理学和神经生理学数据提供了一个理论框架。还做出了几个关键的实验预测,例如关于在各种条件下潘诺区的大小。此类实验的结果将告诉我们,例如,匹配过程是否需要协同作用。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验