单目人体运动捕捉：使用结合几何先验的卷积神经网络进行单目人体运动捕捉。

MonoCap: Monocular Human Motion Capture using a CNN Coupled with a Geometric Prior.

作者信息

Zhou Xiaowei, Zhu Menglong, Pavlakos Georgios, Leonardos Spyridon, Derpanis Konstantinos G, Daniilidis Kostas

出版信息

IEEE Trans Pattern Anal Mach Intell. 2019 Apr;41(4):901-914. doi: 10.1109/TPAMI.2018.2816031. Epub 2018 Mar 15.

DOI:10.1109/TPAMI.2018.2816031

Abstract

Recovering 3D full-body human pose is a challenging problem with many applications. It has been successfully addressed by motion capture systems with body worn markers and multiple cameras. In this paper, we address the more challenging case of not only using a single camera but also not leveraging markers: going directly from 2D appearance to 3D geometry. Deep learning approaches have shown remarkable abilities to discriminatively learn 2D appearance features. The missing piece is how to integrate 2D, 3D, and temporal information to recover 3D geometry and account for the uncertainties arising from the discriminative model. We introduce a novel approach that treats 2D joint locations as latent variables whose uncertainty distributions are given by a deep fully convolutional neural network. The unknown 3D poses are modeled by a sparse representation and the 3D parameter estimates are realized via an Expectation-Maximization algorithm, where it is shown that the 2D joint location uncertainties can be conveniently marginalized out during inference. Extensive evaluation on benchmark datasets shows that the proposed approach achieves greater accuracy over state-of-the-art baselines. Notably, the proposed approach does not require synchronized 2D-3D data for training and is applicable to "in-the-wild" images, which is demonstrated with the MPII dataset.

摘要

恢复3D全身人体姿态是一个具有许多应用场景的挑战性问题。带有身体佩戴标记的运动捕捉系统和多台相机已成功解决了该问题。在本文中，我们解决了一个更具挑战性的情况，即不仅使用单个相机，而且不利用标记：直接从2D外观恢复到3D几何形状。深度学习方法已显示出卓越的能力来有区别地学习2D外观特征。缺失的部分是如何整合2D、3D和时间信息以恢复3D几何形状，并处理判别模型产生的不确定性。我们引入了一种新颖的方法，将2D关节位置视为潜在变量，其不确定性分布由深度全卷积神经网络给出。未知的3D姿态通过稀疏表示进行建模，3D参数估计通过期望最大化算法实现，其中表明在推理过程中可以方便地将2D关节位置不确定性边缘化。在基准数据集上的广泛评估表明，所提出的方法比现有最先进的基线方法具有更高的准确性。值得注意的是，所提出的方法在训练时不需要同步的2D-3D数据，并且适用于“自然场景”图像，这在MPII数据集上得到了验证。

相似文献

MonoCap: Monocular Human Motion Capture using a CNN Coupled with a Geometric Prior.单目人体运动捕捉：使用结合几何先验的卷积神经网络进行单目人体运动捕捉。

IEEE Trans Pattern Anal Mach Intell. 2019 Apr;41(4):901-914. doi: 10.1109/TPAMI.2018.2816031. Epub 2018 Mar 15.

Real-Time 3D Hand Pose Estimation with 3D Convolutional Neural Networks.基于3D卷积神经网络的实时3D手部姿态估计

IEEE Trans Pattern Anal Mach Intell. 2019 Apr;41(4):956-970. doi: 10.1109/TPAMI.2018.2827052. Epub 2018 Apr 16.

Capturing Complex 3D Human Motions with Kernelized Low-Rank Representation from Monocular RGB Camera.基于单目 RGB 相机的核化低秩表示从捕获复杂的 3D 人体运动。

Sensors (Basel). 2017 Sep 3;17(9):2019. doi: 10.3390/s17092019.

An Efficient 3D Human Pose Retrieval and Reconstruction from 2D Image-Based Landmarks.基于二维图像特征点的高效三维人体姿态检索与重建。

Sensors (Basel). 2021 Apr 1;21(7):2415. doi: 10.3390/s21072415.

3D Human Pose Machines with Self-Supervised Learning.基于自监督学习的 3D 人体姿态估计

IEEE Trans Pattern Anal Mach Intell. 2020 May;42(5):1069-1082. doi: 10.1109/TPAMI.2019.2892452. Epub 2019 Jan 14.

Fusing information from multiple 2D depth cameras for 3D human pose estimation in the operating room.将来自多个 2D 深度相机的信息融合用于手术室中的 3D 人体姿态估计。

Int J Comput Assist Radiol Surg. 2019 Nov;14(11):1871-1879. doi: 10.1007/s11548-019-02044-7. Epub 2019 Aug 6.

Structure from Articulated Motion: Accurate and Stable Monocular 3D Reconstruction without Training Data.从关节运动中构建结构：无需训练数据的准确且稳定的单目 3D 重建。

Sensors (Basel). 2019 Oct 22;19(20):4603. doi: 10.3390/s19204603.

MeshLifter: Weakly Supervised Approach for 3D Human Mesh Reconstruction from a Single 2D Pose Based on Loop Structure.基于循环结构的基于单张二维姿态的弱监督 3D 人体网格重建方法 MeshLifter。

Sensors (Basel). 2020 Jul 30;20(15):4257. doi: 10.3390/s20154257.

Classification of CT brain images based on deep learning networks.基于深度学习网络的 CT 脑图像分类。

Comput Methods Programs Biomed. 2017 Jan;138:49-56. doi: 10.1016/j.cmpb.2016.10.007. Epub 2016 Oct 20.

Learning Depth from Single Monocular Images Using Deep Convolutional Neural Fields.利用深度卷积神经场从单目图像中学习深度。

IEEE Trans Pattern Anal Mach Intell. 2016 Oct;38(10):2024-39. doi: 10.1109/TPAMI.2015.2505283. Epub 2015 Dec 3.

引用本文的文献

A Comprehensive Methodological Survey of Human Activity Recognition Across Diverse Data Modalities.跨多种数据模态的人类活动识别综合方法学综述

Sensors (Basel). 2025 Jun 27;25(13):4028. doi: 10.3390/s25134028.

Sensing In Exergames for Efficacy and Motion Quality: Scoping Review of Recent Publications.用于评估功效和运动质量的体感游戏研究：近期出版物的综述

JMIR Serious Games. 2024 Nov 5;12:e52153. doi: 10.2196/52153.

Effective evaluation of HGcnMLP method for markerless 3D pose estimation of musculoskeletal diseases patients based on smartphone monocular video.基于智能手机单目视频对肌肉骨骼疾病患者进行无标记3D姿态估计的HGcnMLP方法的有效评估。

Front Bioeng Biotechnol. 2024 Jan 9;11:1335251. doi: 10.3389/fbioe.2023.1335251. eCollection 2023.

The analysis of infrared high-speed motion capture system on motion aesthetics of aerobics athletes under biomechanics analysis.基于生物力学分析的红外高速运动捕捉系统对健美操运动员运动美感的分析。

PLoS One. 2023 May 25;18(5):e0286313. doi: 10.1371/journal.pone.0286313. eCollection 2023.

G2O-Pose: Real-Time Monocular 3D Human Pose Estimation Based on General Graph Optimization.G2O-Pose：基于通用图优化的实时单目 3D 人体姿态估计。

Sensors (Basel). 2022 Oct 30;22(21):8335. doi: 10.3390/s22218335.

3D Pose Estimation and Tracking in Handball Actions Using a Monocular Camera.使用单目相机对手球动作进行三维姿态估计与跟踪

J Imaging. 2022 Nov 10;8(11):308. doi: 10.3390/jimaging8110308.

The reliability and validity of gait analysis system using 3D markerless pose estimation algorithms.使用3D无标记姿态估计算法的步态分析系统的可靠性和有效性。

Front Bioeng Biotechnol. 2022 Aug 10;10:857975. doi: 10.3389/fbioe.2022.857975. eCollection 2022.

Design of a Resident Physical Fitness Data Monitoring System Based on the Sensor and Fuzzy Algorithm.基于传感器和模糊算法的居民体能数据监测系统设计。

Comput Intell Neurosci. 2022 Aug 1;2022:1742807. doi: 10.1155/2022/1742807. eCollection 2022.

Top-Down System for Multi-Person 3D Absolute Pose Estimation from Monocular Videos.基于单目视频的多人 3D 绝对姿态估计的自顶向下系统。

Sensors (Basel). 2022 May 28;22(11):4109. doi: 10.3390/s22114109.

Convolutional neural network in upper limb functional motion analysis after stroke.卷积神经网络在中风后上肢功能运动分析中的应用

PeerJ. 2020 Oct 9;8:e10124. doi: 10.7717/peerj.10124. eCollection 2020.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

单目人体运动捕捉：使用结合几何先验的卷积神经网络进行单目人体运动捕捉。

MonoCap: Monocular Human Motion Capture using a CNN Coupled with a Geometric Prior.

作者信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献