van den Berg A V, Brenner E
Erasmus University Rotterdam, Faculty of Medicine, The Netherlands.
Vision Res. 1994 Aug;34(16):2153-67. doi: 10.1016/0042-6989(94)90324-7.
The retinal flow during normal locomotion contains components due to rotation and translation of the observer. The translatory part of the flow-pattern is informative of heading, because it radiates outward from the direction of heading. However, it is not directly accessible from the retinal flow. Nevertheless, humans can perceive their direction of heading from the compound retinal flow without need for extra-retinal signals that indicate the rotation. Two classes of models have been proposed to explain the visual decomposition of the retinal flow into its constituent parts. One type relies on local operations to remove the rotational part of the flow field. The other type explicitly determines the direction and magnitude of the rotation from the global retinal flow, for subsequent removal. According to the former model, nearby points are most reliable for estimating one's heading. In the latter type of model the quality of the heading estimate depends on the accuracy with which the ego-rotation is determined and is therefore most reliable when based on the most distant points. We report that subjects underestimate the eccentricity of heading, relative to the fixated point in the ground plane, when the visible range of the ground plane is reduced. Moreover we find that in perception of heading, humans can tolerate more noise than the optimal observer (in the least squares sense) would do if only using optic flow. The latter finding argues against both schemes because ultimately both classes of model are limited in their noise tolerance to that of the optimal observer, which uses all information available in the optic flow. Apparently humans use more information than is present in the optic flow. Both aspects of human performance are consistent with the use of static depth information in addition to the optic flow to select the most distant points. Processing of the flow of these selected points provides the most reliable estimate of the ego-rotation. Subsequent estimates of the heading direction, obtained from the translatory component of the flow, are robust with respect to noise. In such a scheme heading estimates are subject to systematic errors, similar to those reported, if the most distant points are not much further away than the fixation point, because the ego-rotation is underestimated.
正常运动过程中的视网膜流动包含由于观察者的旋转和平移而产生的成分。流动模式的平移部分提供了关于前进方向的信息,因为它从前进方向向外辐射。然而,它不能直接从视网膜流动中获取。尽管如此,人类可以从复合视网膜流动中感知自己的前进方向,而无需额外的视网膜信号来指示旋转。已经提出了两类模型来解释视网膜流动分解为其组成部分的视觉过程。一类模型依靠局部运算来去除流场的旋转部分。另一类模型则从全局视网膜流动中明确确定旋转的方向和大小,以便随后去除。根据前一种模型,附近的点对于估计前进方向最为可靠。在后一种模型中,前进方向估计的质量取决于自我旋转确定的准确性,因此基于最远点时最为可靠。我们报告称,当地面平面的可见范围缩小时,相对于地平面上的注视点,受试者会低估前进方向的偏心率。此外,我们发现,在前进方向感知中,人类能够容忍比仅使用光流的最优观察者(在最小二乘意义上)更多的噪声。后一个发现与这两种模型都相悖,因为最终这两类模型在噪声容忍度上都局限于使用光流中所有可用信息的最优观察者的噪声容忍度。显然,人类使用的信息比光流中存在的信息更多。人类表现的这两个方面都与除了光流之外还使用静态深度信息来选择最远点一致。对这些选定点的流动进行处理可提供最可靠的自我旋转估计。随后从流动的平移分量获得的前进方向估计对噪声具有鲁棒性。在这样的方案中,如果最远点不比注视点远很多,前进方向估计会出现类似所报告的系统误差,因为自我旋转被低估了。