• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于师生框架的软目标训练估计行人姿态方向。

Estimation of Pedestrian Pose Orientation Using Soft Target Training Based on Teacher⁻Student Framework.

机构信息

Department of Computer Engineering, Keimyung University, Daegu 42601, Korea.

出版信息

Sensors (Basel). 2019 Mar 6;19(5):1147. doi: 10.3390/s19051147.

DOI:10.3390/s19051147
PMID:30845772
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6427411/
Abstract

Semi-supervised learning is known to achieve better generalisation than a model learned solely from labelled data. Therefore, we propose a new method for estimating a pedestrian pose orientation using a soft-target method, which is a type of semi-supervised learning method. Because a convolutional neural network (CNN) based pose orientation estimation requires large numbers of parameters and operations, we apply the teacher⁻student algorithm to generate a compressed student model with high accuracy and compactness resembling that of the teacher model by combining a deep network with a random forest. After the teacher model is generated using hard target data, the softened outputs (soft-target data) of the teacher model are used for training the student model. Moreover, the orientation of the pedestrian has specific shape patterns, and a wavelet transform is applied to the input image as a pre-processing step owing to its good spatial frequency localisation property and the ability to preserve both the spatial information and gradient information of an image. For a benchmark dataset considering real driving situations based on a single camera, we used the TUD and KITTI datasets. We applied the proposed algorithm to various driving images in the datasets, and the results indicate that its classification performance with regard to the pose orientation is better than that of other state-of-the-art methods based on a CNN. In addition, the computational speed of the proposed student model is faster than that of other deep CNNs owing to the shorter model structure with a smaller number of parameters.

摘要

半监督学习被认为比仅从标记数据中学习的模型具有更好的泛化能力。因此,我们提出了一种使用软目标方法估计行人姿态方向的新方法,软目标方法是一种半监督学习方法。由于基于卷积神经网络(CNN)的姿态方向估计需要大量的参数和操作,我们应用教师⁻学生算法通过将深度网络与随机森林相结合来生成具有高精度和紧凑性的压缩学生模型,其精度和紧凑性类似于教师模型。在使用硬目标数据生成教师模型之后,使用教师模型的软化输出(软目标数据)来训练学生模型。此外,行人的姿态具有特定的形状模式,并且由于其良好的空间频率定位特性以及能够保留图像的空间信息和梯度信息,因此将小波变换应用于输入图像作为预处理步骤。对于基于单目相机的考虑真实驾驶情况的基准数据集,我们使用了 TUD 和 KITTI 数据集。我们将所提出的算法应用于数据集的各种驾驶图像,结果表明,其关于姿态方向的分类性能优于其他基于 CNN 的最先进方法。此外,由于模型结构较短且参数较少,因此所提出的学生模型的计算速度比其他深度 CNN 更快。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aba/6427411/0e10d618d307/sensors-19-01147-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aba/6427411/7764dcecaf2b/sensors-19-01147-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aba/6427411/9374235c1172/sensors-19-01147-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aba/6427411/55a797b83232/sensors-19-01147-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aba/6427411/e8e400b9781b/sensors-19-01147-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aba/6427411/0e10d618d307/sensors-19-01147-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aba/6427411/7764dcecaf2b/sensors-19-01147-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aba/6427411/9374235c1172/sensors-19-01147-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aba/6427411/55a797b83232/sensors-19-01147-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aba/6427411/e8e400b9781b/sensors-19-01147-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5aba/6427411/0e10d618d307/sensors-19-01147-g005.jpg

相似文献

1
Estimation of Pedestrian Pose Orientation Using Soft Target Training Based on Teacher⁻Student Framework.基于师生框架的软目标训练估计行人姿态方向。
Sensors (Basel). 2019 Mar 6;19(5):1147. doi: 10.3390/s19051147.
2
Real-Time 3D Hand Pose Estimation with 3D Convolutional Neural Networks.基于3D卷积神经网络的实时3D手部姿态估计
IEEE Trans Pattern Anal Mach Intell. 2019 Apr;41(4):956-970. doi: 10.1109/TPAMI.2018.2827052. Epub 2018 Apr 16.
3
WHSP-Net: A Weakly-Supervised Approach for 3D Hand Shape and Pose Recovery from a Single Depth Image.WHSP-Net:一种用于从单张深度图像中恢复三维手部形状和姿态的弱监督方法。
Sensors (Basel). 2019 Aug 31;19(17):3784. doi: 10.3390/s19173784.
4
Esophagus segmentation in CT via 3D fully convolutional neural network and random walk.基于 3D 全卷积神经网络和随机游走的 CT 食管分割。
Med Phys. 2017 Dec;44(12):6341-6352. doi: 10.1002/mp.12593. Epub 2017 Oct 23.
5
A novel end-to-end classifier using domain transferred deep convolutional neural networks for biomedical images.一种使用域转移深度卷积神经网络的新型端到端生物医学图像分类器。
Comput Methods Programs Biomed. 2017 Mar;140:283-293. doi: 10.1016/j.cmpb.2016.12.019. Epub 2017 Jan 6.
6
Road Surface Damage Detection Using Fully Convolutional Neural Networks and Semi-Supervised Learning.基于全卷积神经网络和半监督学习的路面损伤检测
Sensors (Basel). 2019 Dec 12;19(24):5501. doi: 10.3390/s19245501.
7
Accurate Pedestrian Detection by Human Pose Regression.通过人体姿态回归实现精确的行人检测
IEEE Trans Image Process. 2019 Sep 26. doi: 10.1109/TIP.2019.2942686.
8
Semi-Supervised Deep Learning Using Pseudo Labels for Hyperspectral Image Classification.基于伪标签的半监督深度学习在高光谱图像分类中的应用。
IEEE Trans Image Process. 2018 Mar;27(3):1259-1270. doi: 10.1109/TIP.2017.2772836. Epub 2017 Nov 13.
9
Faster R-CNN for Robust Pedestrian Detection Using Semantic Segmentation Network.使用语义分割网络的快速R-CNN用于稳健行人检测
Front Neurorobot. 2018 Oct 5;12:64. doi: 10.3389/fnbot.2018.00064. eCollection 2018.
10
Hierarchical Recurrent Neural Hashing for Image Retrieval With Hierarchical Convolutional Features.基于层次卷积特征的层次递归神经网络哈希图像检索
IEEE Trans Image Process. 2018;27(1):106-120. doi: 10.1109/TIP.2017.2755766.

引用本文的文献

1
Pedestrian POSE estimation using multi-branched deep learning pose net.基于多分支深度学习姿态网络的行人姿态估计
PLoS One. 2025 Jan 24;20(1):e0312177. doi: 10.1371/journal.pone.0312177. eCollection 2025.
2
CYCLOPS: A cyclists' orientation data acquisition system using RGB camera and inertial measurement units (IMU).CYCLOPS:一种使用RGB摄像头和惯性测量单元(IMU)的自行车骑行者定向数据采集系统。
HardwareX. 2024 Apr 18;18:e00534. doi: 10.1016/j.ohx.2024.e00534. eCollection 2024 Jun.
3
A Lightweight Automatic Wildlife Recognition Model Design Method Mitigating Shortcut Learning.

本文引用的文献

1
Driver's Facial Expression Recognition in Real-Time for Safe Driving.实时驾驶员面部表情识别,保障安全驾驶。
Sensors (Basel). 2018 Dec 4;18(12):4270. doi: 10.3390/s18124270.
2
MonoCap: Monocular Human Motion Capture using a CNN Coupled with a Geometric Prior.单目人体运动捕捉:使用结合几何先验的卷积神经网络进行单目人体运动捕捉。
IEEE Trans Pattern Anal Mach Intell. 2019 Apr;41(4):901-914. doi: 10.1109/TPAMI.2018.2816031. Epub 2018 Mar 15.
3
Head and Body Orientation Estimation Using Convolutional Random Projection Forests.基于卷积随机投影森林的头和身体方向估计。
一种减轻捷径学习的轻量级自动野生动物识别模型设计方法
Animals (Basel). 2023 Feb 25;13(5):838. doi: 10.3390/ani13050838.
IEEE Trans Pattern Anal Mach Intell. 2019 Jan;41(1):107-120. doi: 10.1109/TPAMI.2017.2784424. Epub 2017 Dec 18.
4
A Brief Review of Facial Emotion Recognition Based on Visual Information.基于视觉信息的面部情绪识别综述。
Sensors (Basel). 2018 Jan 30;18(2):401. doi: 10.3390/s18020401.
5
Multi-view and 3D deformable part models.多视图和 3D 可变形部件模型。
IEEE Trans Pattern Anal Mach Intell. 2015 Nov;37(11):2232-45. doi: 10.1109/TPAMI.2015.2408347.
6
Classification of Potential Water Bodies Using Landsat 8 OLI and a Combination of Two Boosted Random Forest Classifiers.利用陆地卫星8号OLI和两种增强随机森林分类器组合对潜在水体进行分类
Sensors (Basel). 2015 Jun 11;15(6):13763-77. doi: 10.3390/s150613763.
7
Accurate estimation of human body orientation from RGB-D sensors.基于 RGB-D 传感器的人体姿态精确估计。
IEEE Trans Cybern. 2013 Oct;43(5):1442-52. doi: 10.1109/TCYB.2013.2272636. Epub 2013 Jul 23.
8
X-ray image classification using random forests with local wavelet-based CS-local binary patterns.基于局部小波的 CS-局部二值模式的随机森林 X 射线图像分类。
J Digit Imaging. 2011 Dec;24(6):1141-51. doi: 10.1007/s10278-011-9380-3.