Suppr超能文献

人类自然图像中的三维姿态辨别

Three-dimensional pose discrimination in natural images of humans.

作者信息

Zhu Hongru, Yuille Alan, Kersten Daniel

机构信息

Department of Cognitive Science, Johns Hopkins University.

Department of Psychology, University of Minnesota Twin Cities.

出版信息

Cogsci. 2021 Jul;43:223-229.

Abstract

Perceiving 3D structure in natural images is an immense computational challenge for the visual system. While many previous studies focused on the perception of rigid 3D objects, we applied a novel method on a common set of non-rigid objects-static images of the human body in the natural world. We investigated to what extent human ability to interpret 3D poses in natural images depends on the typicality of the underlying 3D pose and the informativeness of the viewpoint. Using a novel 2AFC pose matching task, we measured how well subjects were able to match a target natural pose image with one of two comparison, synthetic body images from a different viewpoint-one was rendered with the same 3D pose parameters as the target while the other was a distractor rendered with added noises on joint angles. We found that performance for typical poses was measurably better than atypical poses; however, we found no significant difference between informative and less informative viewpoints. Further comparisons of 2D and 3D pose matching models on the same task showed that 3D body knowledge is particularly important when interpreting images of atypical poses. These results suggested that human ability to interpret 3D poses depends on pose typicality but not viewpoint informativeness, and that humans probably use prior knowledge of 3D pose structures.

摘要

在自然图像中感知三维结构对视觉系统来说是一项巨大的计算挑战。尽管之前许多研究聚焦于刚性三维物体的感知,但我们将一种新方法应用于一组常见的非刚性物体——自然界中人体的静态图像。我们研究了人类在自然图像中解读三维姿势的能力在多大程度上取决于潜在三维姿势的典型性和视角的信息量。使用一种新颖的二选一姿势匹配任务,我们测量了受试者将目标自然姿势图像与两个比较图像之一(来自不同视角的合成人体图像)进行匹配的能力,其中一个比较图像是使用与目标相同的三维姿势参数渲染的,而另一个是在关节角度上添加了噪声的干扰项。我们发现典型姿势的表现明显优于非典型姿势;然而,我们发现信息量丰富和信息量少的视角之间没有显著差异。在同一任务上对二维和三维姿势匹配模型的进一步比较表明,在解读非典型姿势图像时,三维人体知识尤为重要。这些结果表明,人类解读三维姿势的能力取决于姿势典型性而非视角信息量,并且人类可能会利用三维姿势结构的先验知识。

相似文献

4
Robust 3D Human Pose Estimation from Single Images or Video Sequences.基于单张图像或视频序列的鲁棒 3D 人体姿态估计。
IEEE Trans Pattern Anal Mach Intell. 2019 May;41(5):1227-1241. doi: 10.1109/TPAMI.2018.2828427. Epub 2018 Apr 19.
5
3D Human Pose Machines with Self-Supervised Learning.基于自监督学习的 3D 人体姿态估计
IEEE Trans Pattern Anal Mach Intell. 2020 May;42(5):1069-1082. doi: 10.1109/TPAMI.2019.2892452. Epub 2019 Jan 14.
6
Picture perception reveals mental geometry of 3D scene inferences.图片感知揭示了 3D 场景推理的心理几何。
Proc Natl Acad Sci U S A. 2018 Jul 24;115(30):7807-7812. doi: 10.1073/pnas.1804873115. Epub 2018 Jul 9.
7
LCR-Net++: Multi-Person 2D and 3D Pose Detection in Natural Images.LCR-Net++:自然图像中的多人 2D 和 3D 姿态检测。
IEEE Trans Pattern Anal Mach Intell. 2020 May;42(5):1146-1161. doi: 10.1109/TPAMI.2019.2892985. Epub 2019 Jan 14.
9
Complexity of mental geometry for 3D pose perception.用于3D姿势感知的心理几何学复杂性。
Vision Res. 2024 Sep;222:108438. doi: 10.1016/j.visres.2024.108438. Epub 2024 Jun 8.
10
Viewpoint and pose in body-form adaptation.
Perception. 2013;42(2):176-86. doi: 10.1068/p7265.

本文引用的文献

1
The Representation of Two-Body Shapes in the Human Visual Cortex.人体视觉皮层中对二体形状的表示。
J Neurosci. 2020 Jan 22;40(4):852-863. doi: 10.1523/JNEUROSCI.1378-19.2019. Epub 2019 Dec 4.
2
The Two-Body Inversion Effect.二体反转效应。
Psychol Sci. 2017 Mar;28(3):369-379. doi: 10.1177/0956797616685769. Epub 2017 Jan 1.
4
Viewpoint and pose in body-form adaptation.
Perception. 2013;42(2):176-86. doi: 10.1068/p7265.
5
About turn: the visual representation of human body orientation revealed by adaptation.转身:适应揭示的人体方位视觉表征
Psychol Sci. 2009 Mar;20(3):363-71. doi: 10.1111/j.1467-9280.2009.02301.x. Epub 2009 Feb 23.
8
Selectivity for the human body in the fusiform gyrus.梭状回中对人体的选择性。
J Neurophysiol. 2005 Jan;93(1):603-8. doi: 10.1152/jn.00513.2004. Epub 2004 Aug 4.
10
Object classification for human and ideal observers.人类观察者和理想观察者的目标分类
Vision Res. 1995 Feb;35(4):549-68. doi: 10.1016/0042-6989(94)00150-k.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验