• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于卷积随机投影森林的头和身体方向估计。

Head and Body Orientation Estimation Using Convolutional Random Projection Forests.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2019 Jan;41(1):107-120. doi: 10.1109/TPAMI.2017.2784424. Epub 2017 Dec 18.

DOI:10.1109/TPAMI.2017.2784424
PMID:29990037
Abstract

In this paper, we consider the problem of estimating the head pose and body orientation of a person from a low-resolution image. Under this setting, it is difficult to reliably extract facial features or detect body parts. We propose a convolutional random projection forest (CRPforest) algorithm for these tasks. A convolutional random projection network (CRPnet) is used at each node of the forest. It maps an input image to a high-dimensional feature space using a rich filter bank. The filter bank is designed to generate sparse responses so that they can be efficiently computed by compressive sensing. A sparse random projection matrix can capture most essential information contained in the filter bank without using all the filters in it. Therefore, the CRPnet is fast, e.g., it requires to process an image of pixels, due to the small number of convolutions (e.g., 0.01 percent of a layer of a neural network) at the expense of less than 2 percent accuracy. The overall forest estimates head and body pose well on benchmark datasets, e.g., over 98 percent on the HIIT dataset, while requiring without using a GPU. Extensive experiments on challenging datasets show that the proposed algorithm performs favorably against the state-of-the-art methods in low-resolution images with noise, occlusion, and motion blur.

摘要

在本文中,我们考虑了从低分辨率图像估计人体头部姿势和身体方向的问题。在这种设置下,很难可靠地提取面部特征或检测身体部位。为此,我们提出了一种用于这些任务的卷积随机投影森林(CRPforest)算法。在森林的每个节点都使用卷积随机投影网络(CRPnet)。它使用丰富的滤波器组将输入图像映射到高维特征空间。滤波器组的设计目的是生成稀疏响应,以便可以通过压缩感知有效地计算它们。稀疏随机投影矩阵可以捕获滤波器组中包含的大部分基本信息,而无需使用其中的所有滤波器。因此,CRPnet 速度很快,例如,由于卷积的数量很少(例如,神经网络的一层的 0.01%),因此可以在不使用 GPU 的情况下处理像素图像。基准数据集上的整体森林对头部和身体姿势的估计效果很好,例如,在 HIIT 数据集上的准确率超过 98%,而无需使用 GPU。在具有噪声、遮挡和运动模糊的挑战性数据集上进行的广泛实验表明,与现有方法相比,该算法在低分辨率图像中具有优势。

相似文献

1
Head and Body Orientation Estimation Using Convolutional Random Projection Forests.基于卷积随机投影森林的头和身体方向估计。
IEEE Trans Pattern Anal Mach Intell. 2019 Jan;41(1):107-120. doi: 10.1109/TPAMI.2017.2784424. Epub 2017 Dec 18.
2
OBELISK-Net: Fewer layers to solve 3D multi-organ segmentation with sparse deformable convolutions.OBELISK-Net:稀疏可变形卷积解决三维多器官分割问题,所需层数更少。
Med Image Anal. 2019 May;54:1-9. doi: 10.1016/j.media.2019.02.006. Epub 2019 Feb 13.
3
Estimating Human Pose Efficiently by Parallel Pyramid Networks.通过并行金字塔网络高效估计人体姿势。
IEEE Trans Image Process. 2021;30:6785-6800. doi: 10.1109/TIP.2021.3097836. Epub 2021 Jul 30.
4
Head Pose Estimation on Top of Haar-Like Face Detection: A Study Using the Kinect Sensor.基于类 Haar 人脸检测的头部姿态估计:一项使用 Kinect 传感器的研究
Sensors (Basel). 2015 Aug 26;15(9):20945-66. doi: 10.3390/s150920945.
5
Improved Convolutional Pose Machines for Human Pose Estimation Using Image Sensor Data.基于图像传感器数据的改进卷积位姿机人体位姿估计
Sensors (Basel). 2019 Feb 10;19(3):718. doi: 10.3390/s19030718.
6
Features Guided Face Super-Resolution via Hybrid Model of Deep Learning and Random Forests.基于深度学习和随机森林混合模型的特征引导人脸超分辨率。
IEEE Trans Image Process. 2021;30:4157-4170. doi: 10.1109/TIP.2021.3069554. Epub 2021 Apr 9.
7
Feature Boosting Network For 3D Pose Estimation.特征增强网络的三维姿态估计。
IEEE Trans Pattern Anal Mach Intell. 2020 Feb;42(2):494-501. doi: 10.1109/TPAMI.2019.2894422. Epub 2019 Jan 22.
8
Hedging Deep Features for Visual Tracking.基于深度特征的视觉跟踪的套期保值。
IEEE Trans Pattern Anal Mach Intell. 2019 May;41(5):1116-1130. doi: 10.1109/TPAMI.2018.2828817. Epub 2018 Apr 20.
9
Training Convolutional Neural Networks and Compressed Sensing End-to-End for Microscopy Cell Detection.端到端训练卷积神经网络和压缩感知进行显微镜细胞检测。
IEEE Trans Med Imaging. 2019 Nov;38(11):2632-2641. doi: 10.1109/TMI.2019.2907093. Epub 2019 Mar 25.
10
Fast compressed sensing-based CBCT reconstruction using Barzilai-Borwein formulation for application to on-line IGRT.基于快速压缩感知的 Barzilai-Borwein 公式的 CBCT 重建,用于在线 IGRT 应用。
Med Phys. 2012 Mar;39(3):1207-17. doi: 10.1118/1.3679865.

引用本文的文献

1
Estimation of Pedestrian Pose Orientation Using Soft Target Training Based on Teacher⁻Student Framework.基于师生框架的软目标训练估计行人姿态方向。
Sensors (Basel). 2019 Mar 6;19(5):1147. doi: 10.3390/s19051147.