Suppr超能文献

三维大小感知的心理几何。

Mental geometry of three-dimensional size perception.

机构信息

Graduate Center for Vision Research, State University of New York, New York, NY.

出版信息

J Vis. 2020 Aug 3;20(8):14. doi: 10.1167/jov.20.8.14.

Abstract

Judging the poses, sizes, and shapes of objects accurately is necessary for organisms and machines to operate successfully in the world. Retinal images of three-dimensional objects are mapped by the rules of projective geometry and preserve the invariants of that geometry. Since Plato, it has been debated whether geometry is innate to the human brain, and Poincare and Einstein thought it worth examining whether formal geometry arises from experience with the world. We examine if humans have learned to exploit projective geometry to estimate sizes and aspects of three-dimensional shape that are related to relative lengths and aspect ratios. Numerous studies have examined size invariance as a function of physical distance, which changes scale on the retina. However, it is surprising that possible constancy or inconstancy of relative size seems not to have been investigated for object pose, which changes retinal image size differently along different axes. We show systematic underestimation of length for extents pointing toward or away from the observer, both for static objects and dynamically rotating objects. Observers do correct for projected shortening according to the optimal back-transform, obtained by inverting the projection function, but the correction is inadequate by a multiplicative factor. The clue is provided by the greater underestimation for longer objects, and the observation that they seem to be more slanted toward the observer. Adding a multiplicative factor for perceived slant in the back-transform model provides good fits to the corrections used by observers. We quantify the slant illusion with two different slant matching measurements, and use a dynamic demonstration to show that the slant illusion perceptually dominates length nonrigidity. In biological and mechanical objects, distortions of shape are manifold, and changes in aspect ratio and relative limb sizes are functionally important. Our model shows that observers try to retain invariance of these aspects of shape to three-dimensional rotation by correcting retinal image distortions due to perspective projection, but the corrections can fall short. We discuss how these results imply that humans have internalized particular aspects of projective geometry through evolution or learning, and if humans assume that images are preserving the continuity, collinearity, and convergence invariances of projective geometry, that would simply explain why illusions such as Ames' chair appear cohesive despite being a projection of disjointed elements, and thus supplement the generic viewpoint assumption.

摘要

准确判断物体的姿势、大小和形状对于生物体和机器在世界上成功运作是必要的。三维物体的视网膜图像是按照投影几何的规则映射的,并保留了该几何的不变量。自柏拉图以来,人们一直在争论几何是否是人类大脑的固有属性,彭加勒和爱因斯坦认为值得研究形式几何是否源于对世界的经验。我们研究人类是否已经学会利用投影几何来估计与相对长度和纵横比相关的三维形状的大小和方面。许多研究已经研究了大小不变性作为物理距离的函数,而物理距离会在视网膜上改变比例。然而,令人惊讶的是,对于物体姿势(它沿着不同的轴改变视网膜图像的大小),似乎没有研究相对大小的恒定性或不恒定性。我们发现,对于指向观察者或远离观察者的范围,无论是静态物体还是动态旋转物体,都会系统地低估长度。观察者确实会根据通过反转投影函数获得的最佳反向变换来校正投影缩短,但校正不足一个乘法因子。线索是较长物体的低估程度更大,并且观察到它们似乎更向观察者倾斜。在反向变换模型中添加一个用于感知倾斜的乘法因子,可以很好地拟合观察者使用的校正。我们使用两种不同的倾斜匹配测量来量化倾斜错觉,并使用动态演示来显示倾斜错觉在感知上主导了长度非刚性。在生物和机械物体中,形状的扭曲是多方面的,并且纵横比和相对肢体大小的变化在功能上很重要。我们的模型表明,观察者试图通过校正由于透视投影而导致的视网膜图像失真来保持对三维旋转的形状这些方面的不变性,但校正可能不足。我们讨论了这些结果如何暗示人类通过进化或学习内化了投影几何的某些方面,如果人类假设图像保持了投影几何的连续性、共线性和收敛不变性,那么这将简单地解释为什么像 Ames 椅子这样的错觉看起来很有凝聚力,尽管它是不连贯元素的投影,因此补充了通用视角假设。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e7ea/7438683/744131563e18/jovi-20-8-14-s1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验