• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种利用知识蒸馏方法在轻量化的自上而下姿态估计网络中稳定性能的有效方法。

An Efficient Approach Using Knowledge Distillation Methods to Stabilize Performance in a Lightweight Top-Down Posture Estimation Network.

机构信息

Department of Electrical and Electronic Engineering, Yonsei University, Seoul 03722, Korea.

Department of Artificial Intelligence Convergence, Kwangju Women's University, Gwangju 62396, Korea.

出版信息

Sensors (Basel). 2021 Nov 17;21(22):7640. doi: 10.3390/s21227640.

DOI:10.3390/s21227640
PMID:34833717
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8623800/
Abstract

Multi-person pose estimation has been gaining considerable interest due to its use in several real-world applications, such as activity recognition, motion capture, and augmented reality. Although the improvement of the accuracy and speed of multi-person pose estimation techniques has been recently studied, limitations still exist in balancing these two aspects. In this paper, a novel knowledge distilled lightweight top-down pose network (KDLPN) is proposed that balances computational complexity and accuracy. For the first time in multi-person pose estimation, a network that reduces computational complexity by applying a "Pelee" structure and shuffles pixels in the dense upsampling convolution layer to reduce the number of channels is presented. Furthermore, to prevent performance degradation because of the reduced computational complexity, knowledge distillation is applied to establish the pose estimation network as a teacher network. The method performance is evaluated on the MSCOCO dataset. Experimental results demonstrate that our KDLPN network significantly reduces 95% of the parameters required by state-of-the-art methods with minimal performance degradation. Moreover, our method is compared with other pose estimation methods to substantiate the importance of computational complexity reduction and its effectiveness.

摘要

多人姿态估计由于在活动识别、运动捕捉和增强现实等多个实际应用中的应用而受到广泛关注。尽管最近已经研究了提高多人姿态估计技术的准确性和速度的方法,但在平衡这两个方面仍然存在局限性。在本文中,提出了一种新颖的知识蒸馏轻量化自上而下的姿态网络 (KDLPN),可以平衡计算复杂度和准确性。在多人姿态估计中,首次提出了一种通过应用“Pelee”结构和在密集上采样卷积层中打乱像素来减少通道数量来降低计算复杂度的网络。此外,为了防止由于计算复杂度降低而导致性能下降,应用知识蒸馏将姿态估计网络建立为教师网络。该方法在 MSCOCO 数据集上进行了性能评估。实验结果表明,我们的 KDLPN 网络在显著减少 95%的参数的同时,性能下降最小。此外,我们的方法与其他姿态估计方法进行了比较,以证明计算复杂度降低的重要性及其有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60d3/8623800/2c8cb6efc902/sensors-21-07640-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60d3/8623800/7febf275dcbc/sensors-21-07640-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60d3/8623800/7412a43761cb/sensors-21-07640-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60d3/8623800/d7d43ade3917/sensors-21-07640-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60d3/8623800/18c9f1c80c78/sensors-21-07640-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60d3/8623800/5fb038164e74/sensors-21-07640-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60d3/8623800/9a87ed3dfb25/sensors-21-07640-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60d3/8623800/c96ebaf679f8/sensors-21-07640-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60d3/8623800/2c8cb6efc902/sensors-21-07640-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60d3/8623800/7febf275dcbc/sensors-21-07640-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60d3/8623800/7412a43761cb/sensors-21-07640-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60d3/8623800/d7d43ade3917/sensors-21-07640-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60d3/8623800/18c9f1c80c78/sensors-21-07640-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60d3/8623800/5fb038164e74/sensors-21-07640-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60d3/8623800/9a87ed3dfb25/sensors-21-07640-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60d3/8623800/c96ebaf679f8/sensors-21-07640-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60d3/8623800/2c8cb6efc902/sensors-21-07640-g008.jpg

相似文献

1
An Efficient Approach Using Knowledge Distillation Methods to Stabilize Performance in a Lightweight Top-Down Posture Estimation Network.一种利用知识蒸馏方法在轻量化的自上而下姿态估计网络中稳定性能的有效方法。
Sensors (Basel). 2021 Nov 17;21(22):7640. doi: 10.3390/s21227640.
2
An improved lightweight high-resolution network based on multi-dimensional weighting for human pose estimation.基于多维加权的改进型轻量化高分辨率网络的人体姿态估计
Sci Rep. 2023 May 4;13(1):7284. doi: 10.1038/s41598-023-33938-x.
3
Shift Pose: A Lightweight Transformer-like Neural Network for Human Pose Estimation.移位姿势:一种用于人体姿势估计的轻量级类 Transformer 神经网络。
Sensors (Basel). 2022 Sep 25;22(19):7264. doi: 10.3390/s22197264.
4
AlphaPose: Whole-Body Regional Multi-Person Pose Estimation and Tracking in Real-Time.AlphaPose:实时的全身区域多人姿态估计和跟踪。
IEEE Trans Pattern Anal Mach Intell. 2023 Jun;45(6):7157-7173. doi: 10.1109/TPAMI.2022.3222784.
5
LHPE-nets: A lightweight 2D and 3D human pose estimation model with well-structural deep networks and multi-view pose sample simplification method.LHPE-nets:一种具有良好结构深度网络和多视图姿态样本简化方法的轻量级 2D 和 3D 人体姿态估计模型。
PLoS One. 2022 Feb 23;17(2):e0264302. doi: 10.1371/journal.pone.0264302. eCollection 2022.
6
Unified End-to-End YOLOv5-HR-TCM Framework for Automatic 2D/3D Human Pose Estimation for Real-Time Applications.用于实时应用的二维/三维人体姿态自动估计的统一端到端 YOLOv5-HR-TCM 框架。
Sensors (Basel). 2022 Jul 20;22(14):5419. doi: 10.3390/s22145419.
7
Cross-View Gait Recognition Method Based on Multi-Teacher Joint Knowledge Distillation.基于多教师联合知识蒸馏的跨视角步态识别方法
Sensors (Basel). 2023 Nov 20;23(22):9289. doi: 10.3390/s23229289.
8
Application of structured support vector machine backpropagation to a convolutional neural network for human pose estimation.将结构化支持向量机反向传播应用于卷积神经网络进行人体姿态估计。
Neural Netw. 2017 Aug;92:39-46. doi: 10.1016/j.neunet.2017.02.005. Epub 2017 Feb 16.
9
Fusion Poser: 3D Human Pose Estimation Using Sparse IMUs and Head Trackers in Real Time.融合定位器:利用稀疏惯性测量单元和实时头部跟踪器进行 3D 人体姿态估计。
Sensors (Basel). 2022 Jun 27;22(13):4846. doi: 10.3390/s22134846.
10
CONet: Crowd and occlusion-aware network for occluded human pose estimation.CONet:用于遮挡人体姿态估计的人群和遮挡感知网络。
Neural Netw. 2024 Apr;172:106109. doi: 10.1016/j.neunet.2024.106109. Epub 2024 Jan 9.

引用本文的文献

1
Enhanced 3D Pose Estimation in Multi-Person, Multi-View Scenarios through Unsupervised Domain Adaptation with Dropout Discriminator.通过带有随机失活判别器的无监督域适应在多人多视图场景中增强3D姿态估计
Sensors (Basel). 2023 Oct 12;23(20):8406. doi: 10.3390/s23208406.

本文引用的文献

1
Student Behavior Recognition System for the Classroom Environment Based on Skeleton Pose Estimation and Person Detection.基于骨骼姿势估计和人员检测的课堂环境学生行为识别系统。
Sensors (Basel). 2021 Aug 6;21(16):5314. doi: 10.3390/s21165314.
2
Nonintrusive Fine-Grained Home Care Monitoring: Characterizing Quality of In-Home Postural Changes Using Bone-Based Human Sensing.非侵入式精细居家护理监测:基于骨骼的人体感应技术对居家姿势变化的特征描述。
Sensors (Basel). 2020 Oct 18;20(20):5894. doi: 10.3390/s20205894.
3
A Survey of Marker-Less Tracking and Registration Techniques for Health & Environmental Applications to Augmented Reality and Ubiquitous Geospatial Information Systems.
健康与环境应用中的无标记跟踪和注册技术在增强现实和普适地理空间信息系统中的调查。
Sensors (Basel). 2020 May 25;20(10):2997. doi: 10.3390/s20102997.
4
OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields.OpenPose:基于部件亲和力字段的实时多人 2D 姿态估计。
IEEE Trans Pattern Anal Mach Intell. 2021 Jan;43(1):172-186. doi: 10.1109/TPAMI.2019.2929257. Epub 2020 Dec 4.
5
Human Pose Estimation from Monocular Images: A Comprehensive Survey.单目图像人体姿态估计:全面综述
Sensors (Basel). 2016 Nov 25;16(12):1966. doi: 10.3390/s16121966.
6
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.更快的 R-CNN:基于区域建议网络的实时目标检测。
IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.
7
Articulated human detection with flexible mixtures of parts.具有灵活部件混合的关节式人体检测。
IEEE Trans Pattern Anal Mach Intell. 2013 Dec;35(12):2878-90. doi: 10.1109/TPAMI.2012.261.