• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于运动的结构分割:建模与心理物理学

Segmentation in structure from motion: modeling and psychophysics.

作者信息

Caudek C, Rubin N

机构信息

Center for Neural Science, New York University, 4 Washington Pl., New York, NY 10003, USA.

出版信息

Vision Res. 2001 Sep;41(21):2715-32. doi: 10.1016/s0042-6989(01)00163-8.

DOI:10.1016/s0042-6989(01)00163-8
PMID:11587722
Abstract

Much work has been done on the question of how the visual system extracts the three-dimensional (3D) structure and motion of an object from two-dimensional (2D) motion information, a problem known as 'Structure from Motion', or SFM. Much less is known, however, about the human ability to recover structure and motion when the optic flow field arises from multiple objects, although observations of this ability date as early as Ullman's well-known two-cylinders stimulus [The interpretation of visual motion (1979)]. In the presence of multiple objects, the SFM problem is further aggravated by the need to solve the segmentation problem, i.e. deciding which motion signal belongs to which object. Here, we present a model for how the human visual system solves the combined SFM and segmentation problems, which we term SSFM, concurrently. The model is based on computation of a simple scalar property of the optic flow field known as def, which was previously shown to be used by human observers in SFM. The def values of many triplets of moving dots are computed, and the identification of multiple objects the image is based on detecting multiple peaks in the histogram of def values. In five experiments, we show that human SSFM performance is consistent with the predictions of the model. We compare the predictions of our model to those of other theoretical approaches, in particular those that use a rigidity hypothesis, and discuss the validity of each approach as a model for human SSFM.

摘要

关于视觉系统如何从二维(2D)运动信息中提取物体的三维(3D)结构和运动这一问题,已经开展了大量研究工作,该问题被称为“从运动中恢复结构”,即SFM。然而,对于当光流场由多个物体产生时人类恢复结构和运动的能力,我们所知甚少,尽管对这种能力的观察可以追溯到乌尔曼著名的双圆柱刺激实验[《视觉运动的解释》(1979年)]。在存在多个物体的情况下,由于需要解决分割问题,即确定哪个运动信号属于哪个物体,SFM问题进一步加剧。在此,我们提出了一个关于人类视觉系统如何同时解决SFM和分割这两个组合问题的模型,我们将其称为SSFM。该模型基于对光流场一种简单标量属性的计算,这种属性被称为def,之前的研究表明人类观察者在SFM中会使用它。计算许多移动点三元组的def值,并基于检测def值直方图中的多个峰值来识别图像中的多个物体。在五个实验中,我们表明人类的SSFM表现与该模型的预测一致。我们将我们模型的预测与其他理论方法的预测进行了比较,特别是那些使用刚性假设的方法,并讨论了每种方法作为人类SSFM模型的有效性。

相似文献

1
Segmentation in structure from motion: modeling and psychophysics.基于运动的结构分割:建模与心理物理学
Vision Res. 2001 Sep;41(21):2715-32. doi: 10.1016/s0042-6989(01)00163-8.
2
A unified probabilistic model of the perception of three-dimensional structure from optic flow.一种基于光流感知三维结构的统一概率模型。
Biol Cybern. 2007 Dec;97(5-6):461-77. doi: 10.1007/s00422-007-0183-z. Epub 2007 Nov 7.
3
A new theory of structure-from-motion perception.一种关于运动结构感知的新理论。
J Vis. 2009 Oct 26;9(11):23.1-20. doi: 10.1167/9.11.23.
4
Contributions of form, motion and task to biological motion perception.形态、运动和任务对生物运动感知的作用。
J Vis. 2009 Mar 31;9(3):28.1-11. doi: 10.1167/9.3.28.
5
Restricted ability to recover three-dimensional global motion from one-dimensional local signals: theoretical observations.从一维局部信号恢复三维全局运动的能力受限:理论观察
Vision Res. 1995 Feb;35(4):569-78. doi: 10.1016/0042-6989(94)00136-a.
6
Complex interactions between spatial, orientation, and motion cues for biological motion perception across visual space.视觉空间中生物运动感知的空间、方向和运动线索之间的复杂相互作用。
J Vis. 2013 Feb 6;13(2):8. doi: 10.1167/13.2.8.
7
Short-term temporal recruitment in structure from motion.
Vision Res. 2002 May;42(10):1213-23. doi: 10.1016/s0042-6989(02)00052-4.
8
The stationarity hypothesis: an allocentric criterion in visual perception.平稳性假设:视觉感知中的一种以客体为中心的标准。
Vision Res. 2001 Oct;41(23):3023-37. doi: 10.1016/s0042-6989(01)00190-0.
9
Restricted ability to recover three-dimensional global motion from one-dimensional motion signals: psychophysical observations.从一维运动信号中恢复三维全局运动的能力受限:心理物理学观察
Vision Res. 1995 Feb;35(4):463-76. doi: 10.1016/0042-6989(94)00135-9.
10
A method for the real-time rendering of formless dot field structure-from-motion stimuli.一种用于实时渲染无形点场运动结构刺激的方法。
J Vis. 2008 May 23;8(5):8.1-8. doi: 10.1167/8.5.8.

引用本文的文献

1
Gestalts at threshold could reveal Gestalts as predictions.阈限上的格式塔可以揭示格式塔作为预测。
Sci Rep. 2021 Sep 15;11(1):18308. doi: 10.1038/s41598-021-97878-0.
2
Seeing via Miniature Eye Movements: A Dynamic Hypothesis for Vision.通过微小眼动来看见:视觉的动态假说。
Front Comput Neurosci. 2012 Nov 8;6:89. doi: 10.3389/fncom.2012.00089. eCollection 2012.
3
Bayesian modeling of perceived surface slant from actively-generated and passively-observed optic flow.基于主动生成和被动观测光流的感知表面斜度的贝叶斯建模。
PLoS One. 2011 Apr 14;6(4):e18731. doi: 10.1371/journal.pone.0018731.