• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Picture perception reveals mental geometry of 3D scene inferences.图片感知揭示了 3D 场景推理的心理几何。
Proc Natl Acad Sci U S A. 2018 Jul 24;115(30):7807-7812. doi: 10.1073/pnas.1804873115. Epub 2018 Jul 9.
2
Complexity of mental geometry for 3D pose perception.用于3D姿势感知的心理几何学复杂性。
Vision Res. 2024 Sep;222:108438. doi: 10.1016/j.visres.2024.108438. Epub 2024 Jun 8.
3
Mental geometry of perceiving 3D size in pictures.知觉图片中三维大小的心理几何
J Vis. 2020 Oct 1;20(10):4. doi: 10.1167/jov.20.10.4.
4
Mental geometry of three-dimensional size perception.三维大小感知的心理几何。
J Vis. 2020 Aug 3;20(8):14. doi: 10.1167/jov.20.8.14.
5
Why pictures look right when viewed from the wrong place.为什么从错误的位置看图片时它们看起来是正确的。
Nat Neurosci. 2005 Oct;8(10):1401-10. doi: 10.1038/nn1553. Epub 2005 Sep 18.
6
Gestalt-like constraints produce veridical (Euclidean) percepts of 3D indoor scenes.类格式塔约束产生三维室内场景的真实(欧几里得)感知。
Vision Res. 2016 Sep;126:264-277. doi: 10.1016/j.visres.2015.09.011. Epub 2015 Nov 3.
7
Ponzo’s Illusion in 3D: Perspective Gradients Dominate Differences in Retinal Size.三维空间中的庞佐错觉:透视梯度主导视网膜大小差异。
Multisens Res. 2016;29(4-5):421-38. doi: 10.1163/22134808-00002522.
8
An Efficient 3D Human Pose Retrieval and Reconstruction from 2D Image-Based Landmarks.基于二维图像特征点的高效三维人体姿态检索与重建。
Sensors (Basel). 2021 Apr 1;21(7):2415. doi: 10.3390/s21072415.
9
Perceived depth of 3-D objects in 3-D scenes.三维场景中三维物体的感知深度。
Perception. 2001;30(6):681-92. doi: 10.1068/p3087.
10
Multiple Photographs of a Perspective Scene Reveal the Principles of Picture Perception.一幅透视场景的多张照片揭示了图像感知的原理。
Vision (Basel). 2018 Jun 26;2(3):26. doi: 10.3390/vision2030026.

引用本文的文献

1
Generative Phenomenology of form perception: Perceptograms and cortical models for amblyopic phantom percepts.形式感知的生成现象学:弱视幻视的感知图与皮层模型
bioRxiv. 2025 Jun 25:2025.06.24.661078. doi: 10.1101/2025.06.24.661078.
2
Drawing as a versatile cognitive tool.绘画作为一种多功能的认知工具。
Nat Rev Psychol. 2023 Sep;2(9):556-568. doi: 10.1038/s44159-023-00212-w. Epub 2023 Jul 17.
3
Perceptual transitions between object rigidity and non-rigidity: Competition and cooperation among motion energy, feature tracking, and shape-based priors.物体刚性和非刚性之间的知觉转换:运动能量、特征跟踪和基于形状的先验之间的竞争与合作。
J Vis. 2024 Feb 1;24(2):3. doi: 10.1167/jov.24.2.3.
4
Perception of 3D shape integrates intuitive physics and analysis-by-synthesis.对三维形状的感知综合了直观物理和分析综合。
Nat Hum Behav. 2024 Feb;8(2):320-335. doi: 10.1038/s41562-023-01759-7. Epub 2023 Nov 23.
5
Mental geometry of perceiving 3D size in pictures.知觉图片中三维大小的心理几何
J Vis. 2020 Oct 1;20(10):4. doi: 10.1167/jov.20.10.4.
6
Mental geometry of three-dimensional size perception.三维大小感知的心理几何。
J Vis. 2020 Aug 3;20(8):14. doi: 10.1167/jov.20.8.14.

本文引用的文献

1
The lawful imprecision of human surface tilt estimation in natural scenes.人类在自然场景中对表面倾斜估计的合法不精确性。
Elife. 2018 Jan 31;7:e31448. doi: 10.7554/eLife.31448.
2
Visual perception as retrospective Bayesian decoding from high- to low-level features.从高级特征到低级特征的回溯贝叶斯解码的视觉感知。
Proc Natl Acad Sci U S A. 2017 Oct 24;114(43):E9115-E9124. doi: 10.1073/pnas.1706906114. Epub 2017 Oct 9.
3
Virtual slant explains perceived slant, distortion, and motion in pictorial scenes.虚拟倾斜解释了在图像场景中感知到的倾斜、变形和运动。
Perception. 2013;42(3):253-70. doi: 10.1068/p7328.
4
The perceptual basis of common photographic practice.常见摄影实践的感知基础。
J Vis. 2012 May 25;12(5):8. doi: 10.1167/12.5.8.
5
Cardinal rules: visual orientation perception reflects knowledge of environmental statistics.首要规则:视觉方向知觉反映了对环境统计数据的了解。
Nat Neurosci. 2011 Jun 5;14(7):926-32. doi: 10.1038/nn.2831.
6
Perspective-based illusory movement in a flat billboard--an explanation.平面广告牌中基于视角的虚幻运动——一种解释
Perception. 2010;39(8):1086-93. doi: 10.1068/p5990.
7
Is perceptual space inherently non-Euclidean?感知空间本质上是非欧几里得的吗?
J Math Psychol. 2009 Apr 1;53(2):86-91. doi: 10.1016/j.jmp.2008.12.006.
8
Is pictorial perception robust? The effect of the observer vantage point on the perceived depth structure of linear-perspective images.图像感知是否稳健?观察者视角对线性透视图像中感知深度结构的影响。
Perception. 2008;37(1):106-25. doi: 10.1068/p5657.
9
Fundamental failures of shape constancy resulting from cortical anisotropy.由皮质各向异性导致的形状恒常性的基本缺陷。
J Neurosci. 2007 Nov 14;27(46):12540-5. doi: 10.1523/JNEUROSCI.4496-07.2007.
10
Why pictures look right when viewed from the wrong place.为什么从错误的位置看图片时它们看起来是正确的。
Nat Neurosci. 2005 Oct;8(10):1401-10. doi: 10.1038/nn1553. Epub 2005 Sep 18.

图片感知揭示了 3D 场景推理的心理几何。

Picture perception reveals mental geometry of 3D scene inferences.

机构信息

Graduate Center for Vision Research, College of Optometry, State University of New York, New York, NY 10036.

Graduate Center for Vision Research, College of Optometry, State University of New York, New York, NY 10036

出版信息

Proc Natl Acad Sci U S A. 2018 Jul 24;115(30):7807-7812. doi: 10.1073/pnas.1804873115. Epub 2018 Jul 9.

DOI:10.1073/pnas.1804873115
PMID:29987008
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6065005/
Abstract

Pose estimation of objects in real scenes is critically important for biological and machine visual systems, but little is known of how humans infer 3D poses from 2D retinal images. We show unexpectedly remarkable agreement in the 3D poses different observers estimate from pictures. We further show that all observers apply the same inferential rule from all viewpoints, utilizing the geometrically derived back-transform from retinal images to actual 3D scenes. Pose estimations are altered by a fronto-parallel bias, and by image distortions that appear to tilt the ground plane. We used pictures of single sticks or pairs of joined sticks taken from different camera angles. Observers viewed these from five directions, and matched the perceived pose of each stick by rotating an arrow on a horizontal touchscreen. The projection of each 3D stick to the 2D picture, and then onto the retina, is described by an invertible trigonometric expression. The inverted expression yields the back-projection for each object pose, camera elevation, and observer viewpoint. We show that a model that uses the back-projection, modulated by just two free parameters, explains 560 pose estimates per observer. By considering changes in retinal image orientations due to position and elevation of limbs, the model also explains perceived limb poses in a complex scene of two bodies lying on the ground. The inferential rules simply explain both perceptual invariance and dramatic distortions in poses of real and pictured objects, and show the benefits of incorporating projective geometry of light into mental inferences about 3D scenes.

摘要

真实场景中物体的姿态估计对于生物和机器视觉系统至关重要,但人类如何从 2D 视网膜图像推断 3D 姿态知之甚少。我们发现,不同观察者从图像中估计的 3D 姿态惊人地一致。我们进一步表明,所有观察者都从所有视角应用相同的推理规则,利用从视网膜图像到实际 3D 场景的几何反变换。姿态估计受到正面平行偏差和图像扭曲的影响,这些扭曲似乎使地面平面倾斜。我们使用从不同摄像机角度拍摄的单个棒或成对连接棒的图片。观察者从五个方向观看这些图片,并通过在水平触摸屏上旋转箭头来匹配每个棒的感知姿态。每个 3D 棒在 2D 图片上的投影,然后在视网膜上的投影,由可反转的三角函数表达式描述。反转的表达式为每个物体姿态、摄像机高度和观察者视点生成反向投影。我们表明,使用反向投影并仅通过两个自由参数进行调制的模型可以解释每个观察者的 560 个姿态估计。通过考虑由于四肢位置和高度引起的视网膜图像方向的变化,该模型还可以解释在地面上躺着两个身体的复杂场景中感知到的肢体姿态。推理规则简单地解释了真实和图像物体姿态的感知不变性和显著扭曲,并展示了将光的投影几何纳入对 3D 场景的心理推断的好处。