Ullman S
Perception. 1984;13(3):255-74. doi: 10.1068/p130255.
The human visual system can extract 3-D shape information of unfamiliar moving objects from their projected transformations. Computational studies of this capacity have established that 3-D shape can be extracted correctly from a brief presentation, provided the moving objects are rigid. The human visual system requires a longer temporal extension, but it can cope with considerable deviations from rigidity. It is shown how the 3-D structure of rigid as well as nonrigid objects can be recovered by maintaining an internal model of the viewed object and modifying it at each instant by the minimal nonrigid change that is sufficient to account for the observed transformation. The results of applying this incremental rigidity scheme to rigid and nonrigid objects in motion are described and compared with human perception.
人类视觉系统能够从陌生移动物体的投影变换中提取其三维形状信息。对这种能力的计算研究已证实,只要移动物体是刚性的,就能从简短的呈现中正确提取三维形状。人类视觉系统需要更长的时间跨度,但它能够应对与刚性有相当大偏差的情况。本文展示了如何通过维持所观察物体的内部模型,并在每个瞬间通过足以解释所观察到的变换的最小非刚性变化来修改该模型,从而恢复刚性和非刚性物体的三维结构。描述了将这种增量刚性方案应用于运动中的刚性和非刚性物体的结果,并与人类感知进行了比较。