• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于单张图像或视频序列的鲁棒 3D 人体姿态估计。

Robust 3D Human Pose Estimation from Single Images or Video Sequences.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2019 May;41(5):1227-1241. doi: 10.1109/TPAMI.2018.2828427. Epub 2018 Apr 19.

DOI:10.1109/TPAMI.2018.2828427
PMID:29993907
Abstract

We propose a method for estimating 3D human poses from single images or video sequences. The task is challenging because: (a) many 3D poses can have similar 2D pose projections which makes the lifting ambiguous, and (b) current 2D joint detectors are not accurate which can cause big errors in 3D estimates. We represent 3D poses by a sparse combination of bases which encode structural pose priors to reduce the lifting ambiguity. This prior is strengthened by adding limb length constraints. We estimate the 3D pose by minimizing an L norm measurement error between the 2D pose and the 3D pose because it is less sensitive to inaccurate 2D poses. We modify our algorithm to output K 3D pose candidates for an image, and for videos, we impose a temporal smoothness constraint to select the best sequence of 3D poses from the candidates. We demonstrate good results on 3D pose estimation from static images and improved performance by selecting the best 3D pose from the K proposals. Our results on video sequences also show improvements (over static images) of roughly 15%.

摘要

我们提出了一种从单张图像或视频序列中估计 3D 人体姿势的方法。这个任务具有挑战性,原因在于:(a) 许多 3D 姿势可能具有相似的 2D 姿势投影,这使得提升过程变得模糊;(b) 目前的 2D 关节探测器不够精确,这可能会导致 3D 估计的误差很大。我们通过稀疏组合基来表示 3D 姿势,这些基编码了结构姿势先验,以减少提升的模糊性。通过添加肢体长度约束,进一步增强了这个先验。我们通过最小化 2D 姿势和 3D 姿势之间的 L 范数测量误差来估计 3D 姿势,因为它对不准确的 2D 姿势不太敏感。我们修改了我们的算法,为图像输出 K 个 3D 姿势候选,对于视频,我们施加一个时间平滑约束,从候选中选择最佳的 3D 姿势序列。我们在静态图像的 3D 姿势估计方面取得了良好的效果,并通过从 K 个提案中选择最佳的 3D 姿势来提高性能。我们在视频序列上的结果也显示出(相对于静态图像)约 15%的改进。

相似文献

1
Robust 3D Human Pose Estimation from Single Images or Video Sequences.基于单张图像或视频序列的鲁棒 3D 人体姿态估计。
IEEE Trans Pattern Anal Mach Intell. 2019 May;41(5):1227-1241. doi: 10.1109/TPAMI.2018.2828427. Epub 2018 Apr 19.
2
3D Human Pose Machines with Self-Supervised Learning.基于自监督学习的 3D 人体姿态估计
IEEE Trans Pattern Anal Mach Intell. 2020 May;42(5):1069-1082. doi: 10.1109/TPAMI.2019.2892452. Epub 2019 Jan 14.
3
LCR-Net++: Multi-Person 2D and 3D Pose Detection in Natural Images.LCR-Net++:自然图像中的多人 2D 和 3D 姿态检测。
IEEE Trans Pattern Anal Mach Intell. 2020 May;42(5):1146-1161. doi: 10.1109/TPAMI.2019.2892985. Epub 2019 Jan 14.
4
A self-supervised spatio-temporal attention network for video-based 3D infant pose estimation.基于视频的 3D 婴儿姿态估计的自监督时空注意网络。
Med Image Anal. 2024 Aug;96:103208. doi: 10.1016/j.media.2024.103208. Epub 2024 May 18.
5
Joint albedo estimation and pose tracking from video.视频中的联合反射率估计和姿态跟踪。
IEEE Trans Pattern Anal Mach Intell. 2013 Jul;35(7):1674-89. doi: 10.1109/TPAMI.2012.249.
6
A model-based approach for estimating human 3D poses in static images.一种用于估计静态图像中人体三维姿态的基于模型的方法。
IEEE Trans Pattern Anal Mach Intell. 2006 Jun;28(6):905-16. doi: 10.1109/TPAMI.2006.110.
7
Weakly Supervised Adversarial Learning for 3D Human Pose Estimation from Point Clouds.基于点云的弱监督对抗学习三维人体姿态估计
IEEE Trans Vis Comput Graph. 2020 May;26(5):1851-1859. doi: 10.1109/TVCG.2020.2973076. Epub 2020 Feb 13.
8
An Efficient 3D Human Pose Retrieval and Reconstruction from 2D Image-Based Landmarks.基于二维图像特征点的高效三维人体姿态检索与重建。
Sensors (Basel). 2021 Apr 1;21(7):2415. doi: 10.3390/s21072415.
9
Human Joint Angle Estimation Using Deep Learning-Based Three-Dimensional Human Pose Estimation for Application in a Real Environment.基于深度学习的三维人体姿态估计的人体关节角度估计及其在真实环境中的应用。
Sensors (Basel). 2024 Jun 13;24(12):3823. doi: 10.3390/s24123823.
10
Learning to Augment Poses for 3D Human Pose Estimation in Images and Videos.学习增强图像和视频中的 3D 人体姿态估计的姿态。
IEEE Trans Pattern Anal Mach Intell. 2023 Aug;45(8):10012-10026. doi: 10.1109/TPAMI.2023.3243400. Epub 2023 Jun 30.

引用本文的文献

1
Improved convolutional neural network for precise exercise posture recognition and intelligent health indicator prediction.用于精确运动姿势识别和智能健康指标预测的改进卷积神经网络。
Sci Rep. 2025 Jul 1;15(1):21309. doi: 10.1038/s41598-025-01854-x.
2
Natural scenes reveal diverse representations of 2D and 3D body pose in the human brain.自然场景在人类大脑中揭示了 2D 和 3D 身体姿势的多种表现形式。
Proc Natl Acad Sci U S A. 2024 Jun 11;121(24):e2317707121. doi: 10.1073/pnas.2317707121. Epub 2024 Jun 3.
3
How do people think about the implementation of speech and video recognition technology in emergency medical practice?
人们如何看待语音和视频识别技术在急诊医疗实践中的应用?
PLoS One. 2022 Sep 23;17(9):e0275280. doi: 10.1371/journal.pone.0275280. eCollection 2022.
4
Center point to pose: Multiple views 3D human pose estimation for multi-person.中心点姿态:多人多角度三维人体姿态估计
PLoS One. 2022 Sep 13;17(9):e0274450. doi: 10.1371/journal.pone.0274450. eCollection 2022.
5
LHPE-nets: A lightweight 2D and 3D human pose estimation model with well-structural deep networks and multi-view pose sample simplification method.LHPE-nets:一种具有良好结构深度网络和多视图姿态样本简化方法的轻量级 2D 和 3D 人体姿态估计模型。
PLoS One. 2022 Feb 23;17(2):e0264302. doi: 10.1371/journal.pone.0264302. eCollection 2022.
6
PGNet: Pipeline Guidance for Human Key-Point Detection.PGNet:人体关键点检测的流水线引导
Entropy (Basel). 2020 Mar 24;22(3):369. doi: 10.3390/e22030369.