• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于人体姿态估计的分层上下文细化网络

Hierarchical Contextual Refinement Networks for Human Pose Estimation.

作者信息

Nie Xuecheng, Feng Jiashi, Xing Junliang, Xiao Shengtao, Yan Shuicheng

出版信息

IEEE Trans Image Process. 2018 Oct 5. doi: 10.1109/TIP.2018.2872628.

DOI:10.1109/TIP.2018.2872628
PMID:30296223
Abstract

Predicting human pose in the wild is a challenging problem due to high flexibility of joints and possible occlusion. Existing approaches generally tackle the difficulties either by holistic prediction or multi-stage processing, which suffer from poor performance for locating challenging joints or high computational cost. In this paper, we propose a new Hierarchical Contextual Refinement Network (HCRN) to robustly predict human poses in an efficient manner, where human body joints of different complexities are processed at different layers in a context hierarchy. Different from existing approaches, our proposed model predicts positions of joints from easy to difficult in a single stage through effectively exploiting informative contexts provided in the previous layer. Such approach offers two appealing advantages over state-of-the-arts: (1) more accurate than predicting all the joints together and (2) more efficient than multi-stage processing methods. We design a Contextual Refinement Unit (CRU) to implement the proposed model, which enables auto-diffusion of joint detection results to effectively transfer informative context from easy joints to difficult ones. In this way, difficult joints can be reliably detected even in presence of occlusion or severe distracting factors. Multiple CRUs are organized into a tree-structured hierarchy which is end-to-end trainable and does not require processing joints for multiple iterations. Comprehensive experiments evaluate the efficacy and efficiency of the proposed HCRN model to improve well-established baselines and achieve new state-of-the-art on multiple human pose estimation benchmarks.

摘要

由于关节的高度灵活性和可能的遮挡,在自然环境中预测人体姿态是一个具有挑战性的问题。现有方法通常通过整体预测或多阶段处理来解决这些困难,但在定位具有挑战性的关节时性能较差,或者计算成本较高。在本文中,我们提出了一种新的分层上下文细化网络(HCRN),以高效的方式稳健地预测人体姿态,其中不同复杂度的人体关节在上下文层次结构的不同层进行处理。与现有方法不同,我们提出的模型通过有效利用前一层提供的信息上下文,在单个阶段从易到难预测关节的位置。这种方法相对于现有技术具有两个吸引人的优点:(1)比一起预测所有关节更准确;(2)比多阶段处理方法更高效。我们设计了一个上下文细化单元(CRU)来实现所提出的模型,它能够使关节检测结果自动扩散,从而有效地将信息上下文从简单关节传递到困难关节。通过这种方式,即使在存在遮挡或严重干扰因素的情况下,也能可靠地检测到困难关节。多个CRU被组织成一个树形结构层次,该层次是端到端可训练的,并且不需要对关节进行多次迭代处理。综合实验评估了所提出的HCRN模型的有效性和效率,以改进成熟的基线,并在多个人体姿态估计基准上达到新的最先进水平。

相似文献

1
Hierarchical Contextual Refinement Networks for Human Pose Estimation.用于人体姿态估计的分层上下文细化网络
IEEE Trans Image Process. 2018 Oct 5. doi: 10.1109/TIP.2018.2872628.
2
Multi-Person Pose Estimation via Multi-Layer Fractal Network and Joints Kinship Pattern.基于多层分形网络与关节关联模式的多人姿态估计。
IEEE Trans Image Process. 2019 Jan;28(1):142-155. doi: 10.1109/TIP.2018.2865666. Epub 2018 Aug 22.
3
DetPoseNet: Improving Multi-Person Pose Estimation via Coarse-Pose Filtering.DetPoseNet:通过粗粒度姿势过滤提高多人姿势估计。
IEEE Trans Image Process. 2022;31:2782-2795. doi: 10.1109/TIP.2022.3161081. Epub 2022 Apr 4.
4
Relation-Based Associative Joint Location for Human Pose Estimation in Videos.基于关系的关联关节位置估计在视频人体姿态估计中的应用
IEEE Trans Image Process. 2022;31:3973-3986. doi: 10.1109/TIP.2022.3177959. Epub 2022 Jun 9.
5
Multi-Person Pose Estimation Using an Orientation and Occlusion Aware Deep Learning Network.基于方向和遮挡感知的深度学习网络的多人姿态估计。
Sensors (Basel). 2020 Mar 12;20(6):1593. doi: 10.3390/s20061593.
6
Dual Networks Based 3D Multi-Person Pose Estimation From Monocular Video.基于双网络的单目视频3D多人姿态估计
IEEE Trans Pattern Anal Mach Intell. 2023 Feb;45(2):1636-1651. doi: 10.1109/TPAMI.2022.3170353. Epub 2023 Jan 6.
7
Weakly Supervised Adversarial Learning for 3D Human Pose Estimation from Point Clouds.基于点云的弱监督对抗学习三维人体姿态估计
IEEE Trans Vis Comput Graph. 2020 May;26(5):1851-1859. doi: 10.1109/TVCG.2020.2973076. Epub 2020 Feb 13.
8
WPL-Based Constraint for 3D Human Pose Estimation from a Single Depth Image.基于 WPL 的单目深度图像三维人体姿态估计约束。
Sensors (Basel). 2022 Nov 22;22(23):9040. doi: 10.3390/s22239040.
9
ORGM: Occlusion Relational Graphical Model for Human Pose Estimation.ORGM:用于人体姿态估计的遮挡关系图形模型。
IEEE Trans Image Process. 2017 Feb;26(2):927-941. doi: 10.1109/TIP.2016.2639441. Epub 2016 Dec 14.
10
Edge Preserving and Multi-Scale Contextual Neural Network for Salient Object Detection.边缘保持和多尺度上下文神经网络的显著目标检测。
IEEE Trans Image Process. 2018;27(1):121-134. doi: 10.1109/TIP.2017.2756825.

引用本文的文献

1
Analysis of the mixed teaching of college physical education based on the health big data and blockchain technology.基于健康大数据与区块链技术的高校体育混合式教学分析
PeerJ Comput Sci. 2023 Jan 20;9:e1206. doi: 10.7717/peerj-cs.1206. eCollection 2023.
2
Geriatric Care Management System Powered by the IoT and Computer Vision Techniques.由物联网和计算机视觉技术驱动的老年护理管理系统
Healthcare (Basel). 2023 Apr 17;11(8):1152. doi: 10.3390/healthcare11081152.
3
Evaluation of Physical Education Teaching Effect Based on Action Skill Recognition.
基于动作技能识别的体育教学效果评估
Comput Intell Neurosci. 2022 May 31;2022:9489704. doi: 10.1155/2022/9489704. eCollection 2022.
4
Research on Multicamera Photography Image Art in BERT Motion Based on Deep Learning Mode.基于深度学习模式的 BERT 运动多相机摄影图像艺术研究。
Comput Intell Neurosci. 2022 Apr 27;2022:2819269. doi: 10.1155/2022/2819269. eCollection 2022.
5
Research on Application of Sports Training Performance Prediction Based on Convolutional Neural Network.基于卷积神经网络的运动训练性能预测应用研究。
Comput Math Methods Med. 2022 Mar 25;2022:7295833. doi: 10.1155/2022/7295833. eCollection 2022.
6
The Visual Movement Analysis of Physical Education Teaching considering the Generalized Hough Transform Model.基于广义霍夫变换模型的体育教学视向运动分析。
Comput Intell Neurosci. 2022 Mar 9;2022:3675319. doi: 10.1155/2022/3675319. eCollection 2022.
7
A Digital Feature Recognition Technology Used in Ballet Training Action Correction.一种用于芭蕾舞训练动作纠正的数字特征识别技术。
Comput Intell Neurosci. 2022 Feb 25;2022:7953172. doi: 10.1155/2022/7953172. eCollection 2022.