基于二维图像特征点的高效三维人体姿态检索与重建。

An Efficient 3D Human Pose Retrieval and Reconstruction from 2D Image-Based Landmarks.

机构信息

School of Computing, National University of Computer and Emerging Sciences, Islamabad 44000, Pakistan.

Gokhale Method Institute, Stanford, CA 94305, USA.

出版信息

Sensors (Basel). 2021 Apr 1;21(7):2415. doi: 10.3390/s21072415.

DOI:10.3390/s21072415

PMID:33915719

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8038066/

Abstract

We propose an efficient and novel architecture for 3D articulated human pose retrieval and reconstruction from 2D landmarks extracted from a 2D synthetic image, an annotated 2D image, an real RGB image or even a hand-drawn sketch. Given 2D joint positions in a single image, we devise a data-driven framework to infer the corresponding 3D human pose. To this end, we first normalize 3D human poses from Motion Capture (MoCap) dataset by eliminating translation, orientation, and the skeleton size discrepancies from the poses and then build a by projecting a subset of joints of the normalized 3D poses onto 2D image-planes by fully exploiting a variety of virtual cameras. With this approach, we not only transform 3D pose space to the normalized 2D pose space but also resolve the 2D-3D cross-domain retrieval task efficiently. The proposed architecture searches for poses from a MoCap dataset that are near to a given 2D query pose in a definite feature space made up of specific joint sets. These retrieved poses are then used to construct a weak perspective camera and a final 3D posture under the camera model that minimizes the reconstruction error. To estimate unknown camera parameters, we introduce a nonlinear, two-fold method. We exploit the retrieved similar poses and the viewing directions at which the MoCap dataset was sampled to minimize the projection error. Finally, we evaluate our approach thoroughly on a large number of heterogeneous 2D examples generated synthetically, 2D images with ground-truth, a variety of real internet images, and a proof of concept using 2D hand-drawn sketches of human poses. We conduct a pool of experiments to perform a quantitative study on PARSE dataset. We also show that the proposed system yields competitive, convincing results in comparison to other state-of-the-art methods.

摘要

我们提出了一种高效新颖的架构，用于从 2D 合成图像、标注 2D 图像、真实 RGB 图像甚至手绘草图中提取的 2D 地标中检索和重建 3D 关节人体姿势。给定单张图像中的 2D 关节位置，我们设计了一个数据驱动的框架来推断相应的 3D 人体姿势。为此，我们首先通过消除姿势中的平移、方向和骨架大小差异来对来自运动捕捉 (MoCap) 数据集的 3D 人体姿势进行归一化，然后通过充分利用各种虚拟相机将归一化 3D 姿势的子集关节投影到 2D 图像平面上来构建。通过这种方法，我们不仅将 3D 姿势空间转换为归一化的 2D 姿势空间，而且还有效地解决了 2D-3D 跨域检索任务。所提出的架构在由特定关节集组成的确定特征空间中从 MoCap 数据集中搜索与给定 2D 查询姿势接近的姿势。然后，这些检索到的姿势用于在相机模型下构建弱透视相机和最终 3D 姿势，该相机模型最小化重建误差。为了估计未知的相机参数，我们引入了一种非线性的、两阶段的方法。我们利用检索到的相似姿势和 MoCap 数据集采样的视图方向来最小化投影误差。最后，我们在大量异构的 2D 示例上进行了全面评估，这些示例是通过合成生成的、具有地面真实值的 2D 图像、各种真实互联网图像以及使用 2D 手绘人体姿势草图的概念证明。我们进行了一系列实验，对 PARSE 数据集进行了定量研究。我们还表明，与其他最先进的方法相比，所提出的系统在比较中产生了具有竞争力的、令人信服的结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/34c8/8038066/05fb2425e0f8/sensors-21-02415-g001.jpg

相似文献

An Efficient 3D Human Pose Retrieval and Reconstruction from 2D Image-Based Landmarks.

Sensors (Basel). 2021 Apr 1;21(7):2415. doi: 10.3390/s21072415.

Weakly Supervised Adversarial Learning for 3D Human Pose Estimation from Point Clouds.

IEEE Trans Vis Comput Graph. 2020 May;26(5):1851-1859. doi: 10.1109/TVCG.2020.2973076. Epub 2020 Feb 13.

3D Human Pose Machines with Self-Supervised Learning.

IEEE Trans Pattern Anal Mach Intell. 2020 May;42(5):1069-1082. doi: 10.1109/TPAMI.2019.2892452. Epub 2019 Jan 14.

LCR-Net++: Multi-Person 2D and 3D Pose Detection in Natural Images.

IEEE Trans Pattern Anal Mach Intell. 2020 May;42(5):1146-1161. doi: 10.1109/TPAMI.2019.2892985. Epub 2019 Jan 14.

MeshLifter: Weakly Supervised Approach for 3D Human Mesh Reconstruction from a Single 2D Pose Based on Loop Structure.

Sensors (Basel). 2020 Jul 30;20(15):4257. doi: 10.3390/s20154257.

Geometry-driven self-supervision for 3D human pose estimation.

Neural Netw. 2024 Jun;174:106237. doi: 10.1016/j.neunet.2024.106237. Epub 2024 Mar 14.

Robust 3D Human Pose Estimation from Single Images or Video Sequences.

IEEE Trans Pattern Anal Mach Intell. 2019 May;41(5):1227-1241. doi: 10.1109/TPAMI.2018.2828427. Epub 2018 Apr 19.

A model-based approach for estimating human 3D poses in static images.

IEEE Trans Pattern Anal Mach Intell. 2006 Jun;28(6):905-16. doi: 10.1109/TPAMI.2006.110.

Head pose estimation from a 2D face image using 3D face morphing with depth parameters.

IEEE Trans Image Process. 2015 Jun;24(6):1801-8. doi: 10.1109/TIP.2015.2405483. Epub 2015 Feb 19.

3D Reconstruction of Human Motion from Monocular Image Sequences.

IEEE Trans Pattern Anal Mach Intell. 2016 Aug;38(8):1505-16. doi: 10.1109/TPAMI.2016.2553028. Epub 2016 Apr 12.

引用本文的文献

Movement Recognition through Inductive Wireless Links: Investigation of Different Fabrication Techniques.

Sensors (Basel). 2023 Sep 8;23(18):7748. doi: 10.3390/s23187748.

An Effective and Efficient Approach for 3D Recovery of Human Motion Capture Data.

Sensors (Basel). 2023 Mar 31;23(7):3664. doi: 10.3390/s23073664.

RGB-D-Based Method for Measuring the Angular Range of Hip and Knee Joints during Home Care Rehabilitation.

Sensors (Basel). 2021 Dec 28;22(1):184. doi: 10.3390/s22010184.

本文引用的文献

Keys for Action: An Efficient Keyframe-Based Approach for 3D Action Recognition Using a Deep Neural Network.

Sensors (Basel). 2020 Apr 15;20(8):2226. doi: 10.3390/s20082226.

Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments.

IEEE Trans Pattern Anal Mach Intell. 2014 Jul;36(7):1325-39. doi: 10.1109/TPAMI.2013.248.

Recovering 3D human body configurations using shape contexts.

IEEE Trans Pattern Anal Mach Intell. 2006 Jul;28(7):1052-62. doi: 10.1109/TPAMI.2006.149.

Recovering 3D human pose from monocular images.

IEEE Trans Pattern Anal Mach Intell. 2006 Jan;28(1):44-58. doi: 10.1109/TPAMI.2006.21.

Application of a magnetic tracking device to kinesiologic studies.

J Biomech. 1988;21(7):613-20. doi: 10.1016/0021-9290(88)90225-4.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于二维图像特征点的高效三维人体姿态检索与重建。

An Efficient 3D Human Pose Retrieval and Reconstruction from 2D Image-Based Landmarks.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献