• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于图像传感器数据的改进卷积位姿机人体位姿估计

Improved Convolutional Pose Machines for Human Pose Estimation Using Image Sensor Data.

机构信息

Guangxi Key Laboratory of Trusted Software, Guilin University of Electronic Technology, Guilin 541004, China.

出版信息

Sensors (Basel). 2019 Feb 10;19(3):718. doi: 10.3390/s19030718.

DOI:10.3390/s19030718
PMID:30744191
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6386920/
Abstract

In recent years, increasing human data comes from image sensors. In this paper, a novel approach combining convolutional pose machines (CPMs) with GoogLeNet is proposed for human pose estimation using image sensor data. The first stage of the CPMs directly generates a response map of each human skeleton's key points from images, in which we introduce some layers from the GoogLeNet. On the one hand, the improved model uses deeper network layers and more complex network structures to enhance the ability of low level feature extraction. On the other hand, the improved model applies a fine-tuning strategy, which benefits the estimation accuracy. Moreover, we introduce the inception structure to greatly reduce parameters of the model, which reduces the convergence time significantly. Extensive experiments on several datasets show that the improved model outperforms most mainstream models in accuracy and training time. The prediction efficiency of the improved model is improved by 1.023 times compared with the CPMs. At the same time, the training time of the improved model is reduced 3.414 times. This paper presents a new idea for future research.

摘要

近年来,越来越多的人类数据来自图像传感器。在本文中,提出了一种结合卷积位姿机(CPMs)和 GoogLeNet 的新方法,用于使用图像传感器数据进行人体姿态估计。CPMs 的第一阶段直接从图像中生成每个人体骨骼关键点的响应图,其中引入了一些来自 GoogLeNet 的层。一方面,改进后的模型使用更深的网络层和更复杂的网络结构来增强低级特征提取的能力。另一方面,改进后的模型应用了微调策略,这有利于提高估计精度。此外,我们引入了 inception 结构,大大减少了模型的参数,显著减少了收敛时间。在几个数据集上的广泛实验表明,改进后的模型在准确性和训练时间方面优于大多数主流模型。与 CPMs 相比,改进后的模型的预测效率提高了 1.023 倍。同时,改进后的模型的训练时间减少了 3.414 倍。本文为未来的研究提供了一个新的思路。

相似文献

1
Improved Convolutional Pose Machines for Human Pose Estimation Using Image Sensor Data.基于图像传感器数据的改进卷积位姿机人体位姿估计
Sensors (Basel). 2019 Feb 10;19(3):718. doi: 10.3390/s19030718.
2
KSL-POSE: A Real-Time 2D Human Pose Estimation Method Based on Modified YOLOv8-Pose Framework.KSL-POSE:一种基于改进 YOLOv8-Pose 框架的实时 2D 人体姿态估计方法。
Sensors (Basel). 2024 Sep 26;24(19):6249. doi: 10.3390/s24196249.
3
Estimating Human Pose Efficiently by Parallel Pyramid Networks.通过并行金字塔网络高效估计人体姿势。
IEEE Trans Image Process. 2021;30:6785-6800. doi: 10.1109/TIP.2021.3097836. Epub 2021 Jul 30.
4
Head and Body Orientation Estimation Using Convolutional Random Projection Forests.基于卷积随机投影森林的头和身体方向估计。
IEEE Trans Pattern Anal Mach Intell. 2019 Jan;41(1):107-120. doi: 10.1109/TPAMI.2017.2784424. Epub 2017 Dec 18.
5
A novel biomedical image indexing and retrieval system via deep preference learning.一种基于深度偏好学习的新型生物医学图像索引和检索系统。
Comput Methods Programs Biomed. 2018 May;158:53-69. doi: 10.1016/j.cmpb.2018.02.003. Epub 2018 Feb 6.
6
Synthesizing Depth Hand Images with GANs and Style Transfer for Hand Pose Estimation.使用 GAN 和风格迁移合成深度手图像进行手姿势估计。
Sensors (Basel). 2019 Jul 1;19(13):2919. doi: 10.3390/s19132919.
7
Application of structured support vector machine backpropagation to a convolutional neural network for human pose estimation.将结构化支持向量机反向传播应用于卷积神经网络进行人体姿态估计。
Neural Netw. 2017 Aug;92:39-46. doi: 10.1016/j.neunet.2017.02.005. Epub 2017 Feb 16.
8
Real-Time 3D Hand Pose Estimation with 3D Convolutional Neural Networks.基于3D卷积神经网络的实时3D手部姿态估计
IEEE Trans Pattern Anal Mach Intell. 2019 Apr;41(4):956-970. doi: 10.1109/TPAMI.2018.2827052. Epub 2018 Apr 16.
9
In-Bed Pose Estimation: Deep Learning With Shallow Dataset.床上姿势估计:基于浅层数据集的深度学习
IEEE J Transl Eng Health Med. 2019 Jan 14;7:4900112. doi: 10.1109/JTEHM.2019.2892970. eCollection 2019.
10
Head Pose Estimation on Top of Haar-Like Face Detection: A Study Using the Kinect Sensor.基于类 Haar 人脸检测的头部姿态估计:一项使用 Kinect 传感器的研究
Sensors (Basel). 2015 Aug 26;15(9):20945-66. doi: 10.3390/s150920945.

引用本文的文献

1
A systematic review of the applications of markerless motion capture (MMC) technology for clinical measurement in rehabilitation.基于无标记运动捕捉(MMC)技术的临床康复测量应用的系统综述。
J Neuroeng Rehabil. 2023 May 2;20(1):57. doi: 10.1186/s12984-023-01186-9.
2
Thermographic Fault Diagnosis of Shaft of BLDC Motor.BLDC 电机轴的热成像故障诊断。
Sensors (Basel). 2022 Nov 5;22(21):8537. doi: 10.3390/s22218537.
3
Student Behavior Recognition System for the Classroom Environment Based on Skeleton Pose Estimation and Person Detection.

本文引用的文献

1
Hyperspectral Image Classification with Capsule Network Using Limited Training Samples.基于受限训练样本的胶囊网络高光谱图像分类
Sensors (Basel). 2018 Sep 18;18(9):3153. doi: 10.3390/s18093153.
2
Action Recognition by an Attention-Aware Temporal Weighted Convolutional Neural Network.基于注意力感知时间加权卷积神经网络的动作识别。
Sensors (Basel). 2018 Jun 21;18(7):1979. doi: 10.3390/s18071979.
3
Human Pose Estimation from Monocular Images: A Comprehensive Survey.单目图像人体姿态估计:全面综述
基于骨骼姿势估计和人员检测的课堂环境学生行为识别系统。
Sensors (Basel). 2021 Aug 6;21(16):5314. doi: 10.3390/s21165314.
4
Solubility Prediction from Molecular Properties and Analytical Data Using an In-phase Deep Neural Network (Ip-DNN).利用同相深度神经网络(Ip-DNN)从分子性质和分析数据预测溶解度
ACS Omega. 2021 May 17;6(22):14278-14287. doi: 10.1021/acsomega.1c01035. eCollection 2021 Jun 8.
5
Accuracy of Monocular Two-Dimensional Pose Estimation Compared With a Reference Standard for Kinematic Multiview Analysis: Validation Study.单目二维姿态估计与运动多角度分析参考标准的准确性比较:验证研究。
JMIR Mhealth Uhealth. 2020 Dec 21;8(12):e19608. doi: 10.2196/19608.
6
Convolutional Neural Networks-Based Object Detection Algorithm by Jointing Semantic Segmentation for Images.基于卷积神经网络的图像语义分割联合的目标检测算法。
Sensors (Basel). 2020 Sep 7;20(18):5080. doi: 10.3390/s20185080.
7
3D Pose Detection of Closely Interactive Humans Using Multi-View Cameras.使用多视角相机进行近距离交互人体的 3D 姿态检测。
Sensors (Basel). 2019 Jun 25;19(12):2831. doi: 10.3390/s19122831.
Sensors (Basel). 2016 Nov 25;16(12):1966. doi: 10.3390/s16121966.
4
Learning long-term dependencies with gradient descent is difficult.使用梯度下降法学习长期依赖关系是困难的。
IEEE Trans Neural Netw. 1994;5(2):157-66. doi: 10.1109/72.279181.