• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于单目视频的多人 3D 绝对姿态估计的自顶向下系统。

Top-Down System for Multi-Person 3D Absolute Pose Estimation from Monocular Videos.

机构信息

Université Clermont-Auvergne, CNRS, Mines de Saint-Étienne, Clermont-Auvergne-INP, LIMOS, 63000 Clermont-Ferrand, France.

Alqualsadi Research Team, Rabat IT Center, ENSIAS, Mohammed V University in Rabat, Rabat 10112, Morocco.

出版信息

Sensors (Basel). 2022 May 28;22(11):4109. doi: 10.3390/s22114109.

DOI:10.3390/s22114109
PMID:35684728
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9185275/
Abstract

Two-dimensional (2D) multi-person pose estimation and three-dimensional (3D) root-relative pose estimation from a monocular RGB camera have made significant progress recently. Yet, real-world applications require depth estimations and the ability to determine the distances between people in a scene. Therefore, it is necessary to recover the 3D absolute poses of several people. However, this is still a challenge when using cameras from single points of view. Furthermore, the previously proposed systems typically required a significant amount of resources and memory. To overcome these restrictions, we herein propose a real-time framework for multi-person 3D absolute pose estimation from a monocular camera, which integrates a human detector, a 2D pose estimator, a 3D root-relative pose reconstructor, and a root depth estimator in a top-down manner. The proposed system, called Root-GAST-Net, is based on modified versions of GAST-Net and RootNet networks. The efficiency of the proposed Root-GAST-Net system is demonstrated through quantitative and qualitative evaluations on two benchmark datasets, Human3.6M and MuPoTS-3D. On all evaluated metrics, our experimental results on the MuPoTS-3D dataset outperform the current state-of-the-art by a significant margin, and can run in real-time at 15 fps on the Nvidia GeForce GTX 1080.

摘要

从单目 RGB 相机进行二维(2D)多人姿态估计和三维(3D)根相对姿态估计最近取得了显著进展。然而,实际应用需要深度估计和确定场景中人与人之间距离的能力。因此,有必要恢复多个人的 3D 绝对姿态。但是,当使用来自单点视角的相机时,这仍然是一个挑战。此外,以前提出的系统通常需要大量的资源和内存。为了克服这些限制,我们提出了一种从单目相机实时进行多人 3D 绝对姿态估计的框架,该框架自上而下集成了人体探测器、2D 姿态估计器、3D 根相对姿态重构器和根深度估计器。该系统称为 Root-GAST-Net,它基于 GAST-Net 和 RootNet 网络的修改版本。通过在 Human3.6M 和 MuPoTS-3D 两个基准数据集上进行定量和定性评估,证明了所提出的 Root-GAST-Net 系统的效率。在所有评估指标上,我们在 MuPoTS-3D 数据集上的实验结果明显优于当前的最先进技术,并且可以在 Nvidia GeForce GTX 1080 上以 15 fps 的实时速度运行。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8cc/9185275/639ab6ccb6db/sensors-22-04109-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8cc/9185275/6326233257e9/sensors-22-04109-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8cc/9185275/791736008e30/sensors-22-04109-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8cc/9185275/519f79469e62/sensors-22-04109-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8cc/9185275/386513d83c06/sensors-22-04109-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8cc/9185275/639ab6ccb6db/sensors-22-04109-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8cc/9185275/6326233257e9/sensors-22-04109-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8cc/9185275/791736008e30/sensors-22-04109-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8cc/9185275/519f79469e62/sensors-22-04109-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8cc/9185275/386513d83c06/sensors-22-04109-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8cc/9185275/639ab6ccb6db/sensors-22-04109-g005.jpg

相似文献

1
Top-Down System for Multi-Person 3D Absolute Pose Estimation from Monocular Videos.基于单目视频的多人 3D 绝对姿态估计的自顶向下系统。
Sensors (Basel). 2022 May 28;22(11):4109. doi: 10.3390/s22114109.
2
Dual Networks Based 3D Multi-Person Pose Estimation From Monocular Video.基于双网络的单目视频3D多人姿态估计
IEEE Trans Pattern Anal Mach Intell. 2023 Feb;45(2):1636-1651. doi: 10.1109/TPAMI.2022.3170353. Epub 2023 Jan 6.
3
A Unified Deep Framework for Joint 3D Pose Estimation and Action Recognition from a Single RGB Camera.基于单目 RGB 相机的联合 3D 姿态估计和动作识别的统一深度框架。
Sensors (Basel). 2020 Mar 25;20(7):1825. doi: 10.3390/s20071825.
4
3D Human Pose Machines with Self-Supervised Learning.基于自监督学习的 3D 人体姿态估计
IEEE Trans Pattern Anal Mach Intell. 2020 May;42(5):1069-1082. doi: 10.1109/TPAMI.2019.2892452. Epub 2019 Jan 14.
5
LCR-Net++: Multi-Person 2D and 3D Pose Detection in Natural Images.LCR-Net++:自然图像中的多人 2D 和 3D 姿态检测。
IEEE Trans Pattern Anal Mach Intell. 2020 May;42(5):1146-1161. doi: 10.1109/TPAMI.2019.2892985. Epub 2019 Jan 14.
6
HDPose: Post-Hierarchical Diffusion with Conditioning for 3D Human Pose Estimation.HDPose:基于条件化的后分层扩散方法用于三维人体姿态估计
Sensors (Basel). 2024 Jan 26;24(3):829. doi: 10.3390/s24030829.
7
Center point to pose: Multiple views 3D human pose estimation for multi-person.中心点姿态:多人多角度三维人体姿态估计
PLoS One. 2022 Sep 13;17(9):e0274450. doi: 10.1371/journal.pone.0274450. eCollection 2022.
8
Human Joint Angle Estimation Using Deep Learning-Based Three-Dimensional Human Pose Estimation for Application in a Real Environment.基于深度学习的三维人体姿态估计的人体关节角度估计及其在真实环境中的应用。
Sensors (Basel). 2024 Jun 13;24(12):3823. doi: 10.3390/s24123823.
9
Fast and Robust Multi-Person 3D Pose Estimation and Tracking From Multiple Views.快速稳健的多人三维姿态估计与多视角跟踪。
IEEE Trans Pattern Anal Mach Intell. 2022 Oct;44(10):6981-6992. doi: 10.1109/TPAMI.2021.3098052. Epub 2022 Sep 14.
10
Capturing Complex 3D Human Motions with Kernelized Low-Rank Representation from Monocular RGB Camera.基于单目 RGB 相机的核化低秩表示从捕获复杂的 3D 人体运动。
Sensors (Basel). 2017 Sep 3;17(9):2019. doi: 10.3390/s17092019.

引用本文的文献

1
Robust skeletal motion tracking using temporal and spatial synchronization of two video streams.利用两个视频流的时间和空间同步进行稳健的骨骼运动跟踪。
PLoS One. 2025 Aug 7;20(8):e0328969. doi: 10.1371/journal.pone.0328969. eCollection 2025.
2
A Systematic Review of Recent Deep Learning Approaches for 3D Human Pose Estimation.近期用于三维人体姿态估计的深度学习方法的系统综述。
J Imaging. 2023 Dec 12;9(12):275. doi: 10.3390/jimaging9120275.
3
Intelligent Sensors for Human Motion Analysis.用于人体运动分析的智能传感器。

本文引用的文献

1
VoxelTrack: Multi-Person 3D Human Pose Estimation and Tracking in the Wild.体素追踪:野外多人三维人体姿态估计与跟踪。
IEEE Trans Pattern Anal Mach Intell. 2023 Feb;45(2):2613-2626. doi: 10.1109/TPAMI.2022.3163709. Epub 2023 Jan 6.
2
Westdrive X LoopAR: An Open-Access Virtual Reality Project in Unity for Evaluating User Interaction Methods during Takeover Requests.西驱 X 循环 AR:Unity 中的一个开放获取虚拟现实项目,用于评估接管请求过程中的用户交互方法。
Sensors (Basel). 2021 Mar 8;21(5):1879. doi: 10.3390/s21051879.
3
The RoScan Thermal 3D Body Scanning System: Medical Applicability and Benefits for Unobtrusive Sensing and Objective Diagnosis.
Sensors (Basel). 2022 Jun 30;22(13):4952. doi: 10.3390/s22134952.
罗氏 Scan 热三维人体扫描系统:用于非侵入式感应和客观诊断的医学适用性和益处。
Sensors (Basel). 2020 Nov 20;20(22):6656. doi: 10.3390/s20226656.
4
Exploiting Ru-Induced Lattice Strain in CoRu Nanoalloys for Robust Bifunctional Hydrogen Production.利用钴钌纳米合金中钌诱导的晶格应变实现高效双功能析氢
Angew Chem Int Ed Engl. 2021 Feb 8;60(6):3290-3298. doi: 10.1002/anie.202013985. Epub 2020 Dec 8.
5
Passive Detection and Imaging of Human Body Radiation Using an Uncooled Field-Effect Transistor-Based THz Detector.利用基于场效应晶体管的太赫兹探测器被动探测和人体辐射成像。
Sensors (Basel). 2020 Jul 22;20(15):4087. doi: 10.3390/s20154087.
6
LCR-Net++: Multi-Person 2D and 3D Pose Detection in Natural Images.LCR-Net++:自然图像中的多人 2D 和 3D 姿态检测。
IEEE Trans Pattern Anal Mach Intell. 2020 May;42(5):1146-1161. doi: 10.1109/TPAMI.2019.2892985. Epub 2019 Jan 14.
7
Classification of Human Daily Activities Using Ensemble Methods Based on Smartphone Inertial Sensors.基于智能手机惯性传感器的人类日常活动分类的集成方法。
Sensors (Basel). 2018 Nov 26;18(12):4132. doi: 10.3390/s18124132.
8
Structured-Light-Based System for Shape Measurement of the Human Body in Motion.基于结构光的人体运动形态测量系统。
Sensors (Basel). 2018 Aug 27;18(9):2827. doi: 10.3390/s18092827.
9
MonoCap: Monocular Human Motion Capture using a CNN Coupled with a Geometric Prior.单目人体运动捕捉:使用结合几何先验的卷积神经网络进行单目人体运动捕捉。
IEEE Trans Pattern Anal Mach Intell. 2019 Apr;41(4):901-914. doi: 10.1109/TPAMI.2018.2816031. Epub 2018 Mar 15.
10
Sparse Representation for 3D Shape Estimation: A Convex Relaxation Approach.基于稀疏表示的三维形状估计:一种凸松弛方法。
IEEE Trans Pattern Anal Mach Intell. 2017 Aug;39(8):1648-1661. doi: 10.1109/TPAMI.2016.2605097. Epub 2016 Sep 1.