从单目图像中恢复3D人体姿态。

Recovering 3D human pose from monocular images.

作者信息

Agarwal Ankur, Triggs Bill

机构信息

INRIA Rhône-Alpes, 665, Avenue de l'Europe, 38330 Montbonnot, France.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2006 Jan;28(1):44-58. doi: 10.1109/TPAMI.2006.21.

DOI:10.1109/TPAMI.2006.21

PMID:16402618

Abstract

We describe a learning-based method for recovering 3D human body pose from single images and monocular image sequences. Our approach requires neither an explicit body model nor prior labeling of body parts in the image. Instead, it recovers pose by direct nonlinear regression against shape descriptor vectors extracted automatically from image silhouettes. For robustness against local silhouette segmentation errors, silhouette shape is encoded by histogram-of-shape-contexts descriptors. We evaluate several different regression methods: ridge regression, Relevance Vector Machine (RVM) regression, and Support Vector Machine (SVM) regression over both linear and kernel bases. The RVMs provide much sparser regressors without compromising performance, and kernel bases give a small but worthwhile improvement in performance. The loss of depth and limb labeling information often makes the recovery of 3D pose from single silhouettes ambiguous. To handle this, the method is embedded in a novel regressive tracking framework, using dynamics from the previous state estimate together with a learned regression value to disambiguate the pose. We show that the resulting system tracks long sequences stably. For realism and good generalization over a wide range of viewpoints, we train the regressors on images resynthesized from real human motion capture data. The method is demonstrated for several representations of full body pose, both quantitatively on independent but similar test data and qualitatively on real image sequences. Mean angular errors of 4-6 degrees are obtained for a variety of walking motions.

摘要

我们描述了一种基于学习的方法，用于从单张图像和单目图像序列中恢复三维人体姿态。我们的方法既不需要明确的人体模型，也不需要对图像中的身体部位进行预先标注。相反，它通过对从图像轮廓中自动提取的形状描述符向量进行直接非线性回归来恢复姿态。为了增强对局部轮廓分割错误的鲁棒性，轮廓形状由形状上下文直方图描述符进行编码。我们评估了几种不同的回归方法：岭回归、相关向量机（RVM）回归以及基于线性和核基的支持向量机（SVM）回归。RVM在不影响性能的情况下提供了更为稀疏的回归器，并且核基在性能上带来了虽小但值得的提升。深度和肢体标注信息的缺失常常使得从单个轮廓中恢复三维姿态变得模糊不清。为了解决这个问题，该方法被嵌入到一个新颖的回归跟踪框架中，利用先前状态估计的动态信息以及学习到的回归值来消除姿态的歧义。我们展示了所得到的系统能够稳定地跟踪长序列。为了在广泛的视角范围内实现真实感和良好的泛化能力，我们在从真实人体运动捕捉数据重新合成的图像上训练回归器。该方法针对全身姿态的几种表示方式进行了演示，在独立但相似的测试数据上进行了定量评估，在真实图像序列上进行了定性评估。对于各种行走动作，平均角度误差为4 - 6度。

相似文献

Recovering 3D human pose from monocular images.

IEEE Trans Pattern Anal Mach Intell. 2006 Jan;28(1):44-58. doi: 10.1109/TPAMI.2006.21.

Recovering 3D human body configurations using shape contexts.

IEEE Trans Pattern Anal Mach Intell. 2006 Jul;28(7):1052-62. doi: 10.1109/TPAMI.2006.149.

A model-based approach for estimating human 3D poses in static images.

IEEE Trans Pattern Anal Mach Intell. 2006 Jun;28(6):905-16. doi: 10.1109/TPAMI.2006.110.

Tracking people on a torus.

IEEE Trans Pattern Anal Mach Intell. 2009 Mar;31(3):520-38. doi: 10.1109/TPAMI.2008.101.

A Bayesian framework for extracting human gait using strong prior knowledge.

IEEE Trans Pattern Anal Mach Intell. 2006 Nov;28(11):1738-52. doi: 10.1109/TPAMI.2006.214.

Recovering articulated pose: a comparison of two pre and postimposed constraint methods.

IEEE Trans Pattern Anal Mach Intell. 2006 Jan;28(1):163-8. doi: 10.1109/TPAMI.2006.22.

Make3D: learning 3D scene structure from a single still image.

IEEE Trans Pattern Anal Mach Intell. 2009 May;31(5):824-40. doi: 10.1109/TPAMI.2008.132.

Motion analysis of articulated objects from monocular images.

IEEE Trans Pattern Anal Mach Intell. 2006 Apr;28(4):625-36. doi: 10.1109/TPAMI.2006.78.

Face recognition from a single training image under arbitrary unknown lighting using spherical harmonics.

IEEE Trans Pattern Anal Mach Intell. 2006 Mar;28(3):351-63. doi: 10.1109/TPAMI.2006.53.

A particle filtering framework for joint video tracking and pose estimation.

IEEE Trans Image Process. 2010 Jun;19(6):1625-34. doi: 10.1109/TIP.2010.2043009. Epub 2010 Mar 8.

引用本文的文献

A Systematic Review of Recent Deep Learning Approaches for 3D Human Pose Estimation.

J Imaging. 2023 Dec 12;9(12):275. doi: 10.3390/jimaging9120275.

Multi-Camera-Based Human Activity Recognition for Human-Robot Collaboration in Construction.

Sensors (Basel). 2023 Aug 7;23(15):6997. doi: 10.3390/s23156997.

Pixels2Pose: Super-resolution time-of-flight imaging for 3D pose estimation.

Sci Adv. 2022 Dec 2;8(48):eade0123. doi: 10.1126/sciadv.ade0123. Epub 2022 Nov 30.

An Efficient 3D Human Pose Retrieval and Reconstruction from 2D Image-Based Landmarks.

Sensors (Basel). 2021 Apr 1;21(7):2415. doi: 10.3390/s21072415.

3D Human Pose Estimation with a Catadioptric Sensor in Unconstrained Environments Using an Annealed Particle Filter.

Sensors (Basel). 2020 Dec 7;20(23):6985. doi: 10.3390/s20236985.

Multi-View-Based Pose Estimation and Its Applications on Intelligent Manufacturing.

Sensors (Basel). 2020 Sep 7;20(18):5072. doi: 10.3390/s20185072.

A Review of the Evolution of Vision-Based Motion Analysis and the Integration of Advanced Computer Vision Methods Towards Developing a Markerless System.

Sports Med Open. 2018 Jun 5;4(1):24. doi: 10.1186/s40798-018-0139-y.

Training Classifiers with Shadow Features for Sensor-Based Human Activity Recognition.

Sensors (Basel). 2017 Feb 27;17(3):476. doi: 10.3390/s17030476.

Human Pose Estimation from Monocular Images: A Comprehensive Survey.

Sensors (Basel). 2016 Nov 25;16(12):1966. doi: 10.3390/s16121966.

Forest Walk Methods for Localizing Body Joints from Single Depth Image.

PLoS One. 2015 Sep 24;10(9):e0138328. doi: 10.1371/journal.pone.0138328. eCollection 2015.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

从单目图像中恢复3D人体姿态。

Recovering 3D human pose from monocular images.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献