• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于实时应用的二维/三维人体姿态自动估计的统一端到端 YOLOv5-HR-TCM 框架。

Unified End-to-End YOLOv5-HR-TCM Framework for Automatic 2D/3D Human Pose Estimation for Real-Time Applications.

机构信息

Faculty of Engineering Technology, Hung Vuong University, Viet Tri City 35100, Vietnam.

Department of Intelligent Computer Systems, Czestochowa University of Technology, 42-218 Czestochowa, Poland.

出版信息

Sensors (Basel). 2022 Jul 20;22(14):5419. doi: 10.3390/s22145419.

DOI:10.3390/s22145419
PMID:35891099
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9315644/
Abstract

Three-dimensional human pose estimation is widely applied in sports, robotics, and healthcare. In the past five years, the number of CNN-based studies for 3D human pose estimation has been numerous and has yielded impressive results. However, studies often focus only on improving the accuracy of the estimation results. In this paper, we propose a fast, unified end-to-end model for estimating 3D human pose, called YOLOv5-HR-TCM (YOLOv5-HRet-Temporal Convolution Model). Our proposed model is based on the 2D to 3D lifting approach for 3D human pose estimation while taking care of each step in the estimation process, such as person detection, 2D human pose estimation, and 3D human pose estimation. The proposed model is a combination of best practices at each stage. Our proposed model is evaluated on the Human 3.6M dataset and compared with other methods at each step. The method achieves high accuracy, not sacrificing processing speed. The estimated time of the whole process is 3.146 FPS on a low-end computer. In particular, we propose a sports scoring application based on the deviation angle between the estimated 3D human posture and the standard (reference) origin. The average deviation angle evaluated on the Human 3.6M dataset (Protocol #1-Pro #1) is 8.2 degrees.

摘要

三维人体姿态估计广泛应用于体育、机器人和医疗保健领域。在过去的五年中,基于 CNN 的三维人体姿态估计研究数量众多,取得了令人瞩目的成果。然而,这些研究通常只关注提高估计结果的准确性。在本文中,我们提出了一种快速、统一的端到端模型,用于估计三维人体姿态,称为 YOLOv5-HR-TCM(YOLOv5-HRet-时间卷积模型)。我们的模型基于二维到三维提升方法进行三维人体姿态估计,同时考虑到估计过程中的每个步骤,如人体检测、二维人体姿态估计和三维人体姿态估计。该模型是每个阶段最佳实践的组合。我们的模型在 Human 3.6M 数据集上进行评估,并在每个步骤与其他方法进行比较。该方法在不牺牲处理速度的情况下实现了高精度。在低端计算机上,整个过程的估计时间为 3.146 FPS。特别是,我们提出了一种基于估计的三维人体姿态与标准(参考)原点之间偏差角度的运动评分应用。在 Human 3.6M 数据集上评估的平均偏差角度(协议 #1-Pro #1)为 8.2 度。

相似文献

1
Unified End-to-End YOLOv5-HR-TCM Framework for Automatic 2D/3D Human Pose Estimation for Real-Time Applications.用于实时应用的二维/三维人体姿态自动估计的统一端到端 YOLOv5-HR-TCM 框架。
Sensors (Basel). 2022 Jul 20;22(14):5419. doi: 10.3390/s22145419.
2
Detection, segmentation, and 3D pose estimation of surgical tools using convolutional neural networks and algebraic geometry.使用卷积神经网络和代数几何进行手术工具的检测、分割和三维姿态估计。
Med Image Anal. 2021 May;70:101994. doi: 10.1016/j.media.2021.101994. Epub 2021 Feb 7.
3
A self-supervised spatio-temporal attention network for video-based 3D infant pose estimation.基于视频的 3D 婴儿姿态估计的自监督时空注意网络。
Med Image Anal. 2024 Aug;96:103208. doi: 10.1016/j.media.2024.103208. Epub 2024 May 18.
4
Recognition of Forward Head Posture Through 3D Human Pose Estimation With a Graph Convolutional Network: Development and Feasibility Study.基于图卷积网络的 3D 人体姿态估计识别探颈姿势:开发与可行性研究。
JMIR Form Res. 2024 Aug 26;8:e55476. doi: 10.2196/55476.
5
LCR-Net++: Multi-Person 2D and 3D Pose Detection in Natural Images.LCR-Net++:自然图像中的多人 2D 和 3D 姿态检测。
IEEE Trans Pattern Anal Mach Intell. 2020 May;42(5):1146-1161. doi: 10.1109/TPAMI.2019.2892985. Epub 2019 Jan 14.
6
LHPE-nets: A lightweight 2D and 3D human pose estimation model with well-structural deep networks and multi-view pose sample simplification method.LHPE-nets:一种具有良好结构深度网络和多视图姿态样本简化方法的轻量级 2D 和 3D 人体姿态估计模型。
PLoS One. 2022 Feb 23;17(2):e0264302. doi: 10.1371/journal.pone.0264302. eCollection 2022.
7
An Efficient 3D Human Pose Retrieval and Reconstruction from 2D Image-Based Landmarks.基于二维图像特征点的高效三维人体姿态检索与重建。
Sensors (Basel). 2021 Apr 1;21(7):2415. doi: 10.3390/s21072415.
8
A deep learning approach for pose estimation from volumetric OCT data.基于深度学习的体 OCT 数据的姿态估计方法。
Med Image Anal. 2018 May;46:162-179. doi: 10.1016/j.media.2018.03.002. Epub 2018 Mar 10.
9
Top-Down System for Multi-Person 3D Absolute Pose Estimation from Monocular Videos.基于单目视频的多人 3D 绝对姿态估计的自顶向下系统。
Sensors (Basel). 2022 May 28;22(11):4109. doi: 10.3390/s22114109.
10
A Deep Neural Network-based method for estimation of 3D lifting motions.基于深度神经网络的三维提升运动估计方法。
J Biomech. 2019 Feb 14;84:87-93. doi: 10.1016/j.jbiomech.2018.12.022. Epub 2018 Dec 19.

引用本文的文献

1
Post-secondary classroom teaching quality evaluation using small object detection model.基于小目标检测模型的高等院校课堂教学质量评估
Sci Rep. 2024 Mar 9;14(1):5816. doi: 10.1038/s41598-024-56505-4.
2
A Systematic Review of Recent Deep Learning Approaches for 3D Human Pose Estimation.近期用于三维人体姿态估计的深度学习方法的系统综述。
J Imaging. 2023 Dec 12;9(12):275. doi: 10.3390/jimaging9120275.
3
Enhanced 3D Pose Estimation in Multi-Person, Multi-View Scenarios through Unsupervised Domain Adaptation with Dropout Discriminator.

本文引用的文献

1
Human Segmentation and Tracking Survey on Masks for MADS Dataset.MADS 数据集上的口罩人类分割和跟踪调查。
Sensors (Basel). 2021 Dec 16;21(24):8397. doi: 10.3390/s21248397.
2
Toward Modeling Psychomotor Performance in Karate Combats Using Computer Vision Pose Estimation.运用计算机视觉姿势估计技术对空手道搏击进行心理运动表现建模。
Sensors (Basel). 2021 Dec 15;21(24):8378. doi: 10.3390/s21248378.
3
Applications of Pose Estimation in Human Health and Performance across the Lifespan.姿势估计在人类全生命周期健康和表现中的应用。
通过带有随机失活判别器的无监督域适应在多人多视图场景中增强3D姿态估计
Sensors (Basel). 2023 Oct 12;23(20):8406. doi: 10.3390/s23208406.
4
Analysis of the mixed teaching of college physical education based on the health big data and blockchain technology.基于健康大数据与区块链技术的高校体育混合式教学分析
PeerJ Comput Sci. 2023 Jan 20;9:e1206. doi: 10.7717/peerj-cs.1206. eCollection 2023.
5
Optically Non-Contact Cross-Country Skiing Action Recognition Based on Key-Point Collaborative Estimation and Motion Feature Extraction.基于关键点协作估计和运动特征提取的非接触式越野滑雪动作识别。
Sensors (Basel). 2023 Mar 31;23(7):3639. doi: 10.3390/s23073639.
6
Faster Deep Inertial Pose Estimation with Six Inertial Sensors.六惯性传感器的更快深度惯性姿态估计。
Sensors (Basel). 2022 Sep 21;22(19):7144. doi: 10.3390/s22197144.
Sensors (Basel). 2021 Nov 3;21(21):7315. doi: 10.3390/s21217315.
4
A Systematic Review of the Application of Camera-Based Human Pose Estimation in the Field of Sport and Physical Exercise.基于摄像头的人体姿势估计在运动和体育锻炼领域的应用系统综述
Sensors (Basel). 2021 Sep 7;21(18):5996. doi: 10.3390/s21185996.
5
Deep High-Resolution Representation Learning for Visual Recognition.用于视觉识别的深度高分辨率表征学习
IEEE Trans Pattern Anal Mach Intell. 2021 Oct;43(10):3349-3364. doi: 10.1109/TPAMI.2020.2983686. Epub 2021 Sep 2.
6
3D Human Pose Machines with Self-Supervised Learning.基于自监督学习的 3D 人体姿态估计
IEEE Trans Pattern Anal Mach Intell. 2020 May;42(5):1069-1082. doi: 10.1109/TPAMI.2019.2892452. Epub 2019 Jan 14.
7
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.更快的 R-CNN:基于区域建议网络的实时目标检测。
IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.
8
Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments.Human3.6M:自然环境中 3D 人体感应的大规模数据集和预测方法。
IEEE Trans Pattern Anal Mach Intell. 2014 Jul;36(7):1325-39. doi: 10.1109/TPAMI.2013.248.