基于单目深度图像的高效人体姿态估计。

Efficient human pose estimation from single depth images.

机构信息

Microsoft Research, Cambridge.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2013 Dec;35(12):2821-40. doi: 10.1109/TPAMI.2012.241.

DOI:10.1109/TPAMI.2012.241

Abstract

We describe two new approaches to human pose estimation. Both can quickly and accurately predict the 3D positions of body joints from a single depth image without using any temporal information. The key to both approaches is the use of a large, realistic, and highly varied synthetic set of training images. This allows us to learn models that are largely invariant to factors such as pose, body shape, field-of-view cropping, and clothing. Our first approach employs an intermediate body parts representation, designed so that an accurate per-pixel classification of the parts will localize the joints of the body. The second approach instead directly regresses the positions of body joints. By using simple depth pixel comparison features and parallelizable decision forests, both approaches can run super-real time on consumer hardware. Our evaluation investigates many aspects of our methods, and compares the approaches to each other and to the state of the art. Results on silhouettes suggest broader applicability to other imaging modalities.

摘要

我们描述了两种新的人体姿态估计方法。这两种方法都可以快速准确地从单张深度图像中预测身体关节的 3D 位置，而无需使用任何时间信息。这两种方法的关键都在于使用了大量真实且高度多样化的合成训练图像集。这使我们能够学习到对姿态、体型、视场裁剪和衣物等因素具有很大不变性的模型。我们的第一种方法采用了中间的身体部位表示法，其设计目的是通过对部位进行精确的逐像素分类来定位身体的关节。第二种方法则直接回归身体关节的位置。通过使用简单的深度像素比较特征和可并行化决策森林，这两种方法都可以在消费级硬件上实现超实时运行。我们的评估研究了我们方法的许多方面，并将这些方法相互比较，以及与最新技术进行比较。在剪影上的结果表明，它们更广泛地适用于其他成像模式。

相似文献

Efficient human pose estimation from single depth images.

IEEE Trans Pattern Anal Mach Intell. 2013 Dec;35(12):2821-40. doi: 10.1109/TPAMI.2012.241.

Recovering 3D human pose from monocular images.

IEEE Trans Pattern Anal Mach Intell. 2006 Jan;28(1):44-58. doi: 10.1109/TPAMI.2006.21.

Absolute depth estimation from a single defocused image.

IEEE Trans Image Process. 2013 Nov;22(11):4545-50. doi: 10.1109/TIP.2013.2274389. Epub 2013 Jul 23.

Human Part Segmentation in Depth Images with Annotated Part Positions.

Sensors (Basel). 2018 Jun 11;18(6):1900. doi: 10.3390/s18061900.

Unified structured learning for simultaneous human pose estimation and garment attribute classification.

IEEE Trans Image Process. 2014 Nov;23(11):4786-98. doi: 10.1109/TIP.2014.2358082. Epub 2014 Sep 15.

Forest Walk Methods for Localizing Body Joints from Single Depth Image.

PLoS One. 2015 Sep 24;10(9):e0138328. doi: 10.1371/journal.pone.0138328. eCollection 2015.

Weakly Supervised Adversarial Learning for 3D Human Pose Estimation from Point Clouds.

IEEE Trans Vis Comput Graph. 2020 May;26(5):1851-1859. doi: 10.1109/TVCG.2020.2973076. Epub 2020 Feb 13.

A model-based approach for estimating human 3D poses in static images.

IEEE Trans Pattern Anal Mach Intell. 2006 Jun;28(6):905-16. doi: 10.1109/TPAMI.2006.110.

WHSP-Net: A Weakly-Supervised Approach for 3D Hand Shape and Pose Recovery from a Single Depth Image.

Sensors (Basel). 2019 Aug 31;19(17):3784. doi: 10.3390/s19173784.

Dense soft tissue 3D reconstruction refined with super-pixel segmentation for robotic abdominal surgery.

Int J Comput Assist Radiol Surg. 2016 Feb;11(2):197-206. doi: 10.1007/s11548-015-1276-0. Epub 2015 Sep 26.

引用本文的文献

A Comprehensive Methodological Survey of Human Activity Recognition Across Diverse Data Modalities.

Sensors (Basel). 2025 Jun 27;25(13):4028. doi: 10.3390/s25134028.

Age Estimation From Blood Test Results Using a Random Forest Model.

J Clin Lab Anal. 2025 Jul;39(14):e70064. doi: 10.1002/jcla.70064. Epub 2025 Jun 12.

Optimization Method of Human Posture Recognition Based on Kinect V2 Sensor.

Biomimetics (Basel). 2025 Apr 21;10(4):254. doi: 10.3390/biomimetics10040254.

Light-Adaptive Human Body Key Point Detection Algorithm Based on Multi-Source Information Fusion.

Sensors (Basel). 2024 May 10;24(10):3021. doi: 10.3390/s24103021.

Image recognition of traditional Chinese medicine based on deep learning.

Front Bioeng Biotechnol. 2023 Jul 21;11:1199803. doi: 10.3389/fbioe.2023.1199803. eCollection 2023.

Study on the Interaction Behaviors Identification of Construction Workers Based on ST-GCN and YOLO.

Sensors (Basel). 2023 Jul 11;23(14):6318. doi: 10.3390/s23146318.

A novel approach for yoga pose estimation based on in-depth analysis of human body joint detection accuracy.

PeerJ Comput Sci. 2023 Jan 13;9:e1152. doi: 10.7717/peerj-cs.1152. eCollection 2023.

Computer-assisted approaches for measuring, segmenting, and analyzing functional upper extremity movement: a narrative review of the current state, limitations, and future directions.

Front Rehabil Sci. 2023 Apr 11;4:1130847. doi: 10.3389/fresc.2023.1130847. eCollection 2023.

A Review of Depth-Based Human Motion Enhancement: Past and Present.

IEEE J Biomed Health Inform. 2024 Feb;28(2):633-644. doi: 10.1109/JBHI.2023.3257662. Epub 2024 Feb 5.

ConMLP: MLP-Based Self-Supervised Contrastive Learning for Skeleton Data Analysis and Action Recognition.

Sensors (Basel). 2023 Feb 22;23(5):2452. doi: 10.3390/s23052452.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于单目深度图像的高效人体姿态估计。

Efficient human pose estimation from single depth images.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献