学习时空上下文自适应的三维人体姿态估计。

Learning Temporal-Spatial Contextual Adaptation for Three-Dimensional Human Pose Estimation.

机构信息

College of Information Engineering, Capital Normal University, Beijing 100048, China.

出版信息

Sensors (Basel). 2024 Jul 8;24(13):4422. doi: 10.3390/s24134422.

DOI:10.3390/s24134422

PMID:39001202

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11244605/

Abstract

Three-dimensional human pose estimation focuses on generating 3D pose sequences from 2D videos. It has enormous potential in the fields of human-robot interaction, remote sensing, virtual reality, and computer vision. Existing excellent methods primarily focus on exploring spatial or temporal encoding to achieve 3D pose inference. However, various architectures exploit the independent effects of spatial and temporal cues on 3D pose estimation, while neglecting the spatial-temporal synergistic influence. To address this issue, this paper proposes a novel 3D pose estimation method with a dual-adaptive spatial-temporal former (DASTFormer) and additional supervised training. The DASTFormer contains attention-adaptive (AtA) and pure-adaptive (PuA) modes, which will enhance pose inference from 2D to 3D by adaptively learning spatial-temporal effects, considering both their cooperative and independent influences. In addition, an additional supervised training with batch variance loss is proposed in this work. Different from common training strategy, a two-round parameter update is conducted on the same batch data. Not only can it better explore the potential relationship between spatial-temporal encoding and 3D poses, but it can also alleviate the batch size limitations imposed by graphics cards on transformer-based frameworks. Extensive experimental results show that the proposed method significantly outperforms most state-of-the-art approaches on Human3.6 and HumanEVA datasets.

摘要

三维人体姿态估计主要关注从二维视频生成三维姿态序列。它在人机交互、遥感、虚拟现实和计算机视觉等领域具有巨大的潜力。现有的优秀方法主要侧重于探索空间或时间编码以实现三维姿态推断。然而，各种架构利用空间和时间线索对三维姿态估计的独立影响，而忽略了空间-时间协同影响。为了解决这个问题，本文提出了一种具有双重自适应时空former（DASTFormer）和附加监督训练的新型三维姿态估计方法。DASTFormer 包含注意力自适应（AtA）和纯自适应（PuA）模式，通过自适应学习空间-时间效应来增强从二维到三维的姿态推断，同时考虑它们的协同和独立影响。此外，本文还提出了一种带有批方差损失的附加监督训练。与常见的训练策略不同，同一批数据上进行两轮参数更新。它不仅可以更好地探索空间-时间编码和三维姿态之间的潜在关系，还可以减轻图形卡对基于转换器框架的批大小限制。广泛的实验结果表明，所提出的方法在 Human3.6 和 HumanEVA 数据集上明显优于大多数最先进的方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cfbb/11244605/f353815c0a51/sensors-24-04422-g001.jpg

相似文献

Learning Temporal-Spatial Contextual Adaptation for Three-Dimensional Human Pose Estimation.学习时空上下文自适应的三维人体姿态估计。

Sensors (Basel). 2024 Jul 8;24(13):4422. doi: 10.3390/s24134422.

3D Human Pose Machines with Self-Supervised Learning.基于自监督学习的 3D 人体姿态估计

IEEE Trans Pattern Anal Mach Intell. 2020 May;42(5):1069-1082. doi: 10.1109/TPAMI.2019.2892452. Epub 2019 Jan 14.

A self-supervised spatio-temporal attention network for video-based 3D infant pose estimation.基于视频的 3D 婴儿姿态估计的自监督时空注意网络。

Med Image Anal. 2024 Aug;96:103208. doi: 10.1016/j.media.2024.103208. Epub 2024 May 18.

Learning Dynamical Human-Joint Affinity for 3D Pose Estimation in Videos.学习用于视频中3D姿态估计的动态人体关节亲和性

IEEE Trans Image Process. 2021;30:7914-7925. doi: 10.1109/TIP.2021.3109517. Epub 2021 Sep 21.

HDPose: Post-Hierarchical Diffusion with Conditioning for 3D Human Pose Estimation.HDPose：基于条件化的后分层扩散方法用于三维人体姿态估计

Sensors (Basel). 2024 Jan 26;24(3):829. doi: 10.3390/s24030829.

Weakly Supervised Adversarial Learning for 3D Human Pose Estimation from Point Clouds.基于点云的弱监督对抗学习三维人体姿态估计

IEEE Trans Vis Comput Graph. 2020 May;26(5):1851-1859. doi: 10.1109/TVCG.2020.2973076. Epub 2020 Feb 13.

Geometry-driven self-supervision for 3D human pose estimation.基于几何的三维人体姿态估计自监督学习。

Neural Netw. 2024 Jun;174:106237. doi: 10.1016/j.neunet.2024.106237. Epub 2024 Mar 14.

Boosting Monocular 3D Human Pose Estimation With Part Aware Attention.基于部件感知注意力的单目三维人体姿态估计提升。

IEEE Trans Image Process. 2022;31:4278-4291. doi: 10.1109/TIP.2022.3182269. Epub 2022 Jun 29.

Dual Networks Based 3D Multi-Person Pose Estimation From Monocular Video.基于双网络的单目视频3D多人姿态估计

IEEE Trans Pattern Anal Mach Intell. 2023 Feb;45(2):1636-1651. doi: 10.1109/TPAMI.2022.3170353. Epub 2023 Jan 6.

Robust 3D Human Pose Estimation from Single Images or Video Sequences.基于单张图像或视频序列的鲁棒 3D 人体姿态估计。

IEEE Trans Pattern Anal Mach Intell. 2019 May;41(5):1227-1241. doi: 10.1109/TPAMI.2018.2828427. Epub 2018 Apr 19.

引用本文的文献

Estimating a 3D Human Skeleton from a Single RGB Image by Fusing Predicted Depths from Multiple Virtual Viewpoints.通过融合多个虚拟视角预测的深度信息从单张RGB图像估计三维人体骨骼

Sensors (Basel). 2024 Dec 15;24(24):8017. doi: 10.3390/s24248017.

本文引用的文献

3D Pose Estimation and Tracking in Handball Actions Using a Monocular Camera.使用单目相机对手球动作进行三维姿态估计与跟踪

J Imaging. 2022 Nov 10;8(11):308. doi: 10.3390/jimaging8110308.

Collaborative Refining for Person Re-Identification With Label Noise.协同精炼与标签噪声的行人再识别。

IEEE Trans Image Process. 2022;31:379-391. doi: 10.1109/TIP.2021.3131937. Epub 2021 Dec 9.

HEMlets PoSh: Learning Part-Centric Heatmap Triplets for 3D Human Pose and Shape Estimation.HEMlets PoSh：用于 3D 人体姿态和形状估计的基于部分的热图三元组学习。

IEEE Trans Pattern Anal Mach Intell. 2022 Jun;44(6):3000-3014. doi: 10.1109/TPAMI.2021.3051173. Epub 2022 May 5.

3D Pose Estimation for Object Detection in Remote Sensing Images.用于遥感图像目标检测的3D姿态估计

Sensors (Basel). 2020 Feb 25;20(5):1240. doi: 10.3390/s20051240.

Multi-Task Deep Learning for Real-Time 3D Human Pose Estimation and Action Recognition.多任务深度学习的实时三维人体姿态估计和动作识别。

IEEE Trans Pattern Anal Mach Intell. 2021 Aug;43(8):2752-2764. doi: 10.1109/TPAMI.2020.2976014. Epub 2021 Jul 1.

Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments.Human3.6M：自然环境中 3D 人体感应的大规模数据集和预测方法。

IEEE Trans Pattern Anal Mach Intell. 2014 Jul;36(7):1325-39. doi: 10.1109/TPAMI.2013.248.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

学习时空上下文自适应的三维人体姿态估计。

Learning Temporal-Spatial Contextual Adaptation for Three-Dimensional Human Pose Estimation.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献