• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于乘积格拉斯曼流形的加权稀疏编码模型用于基于视频的人体手势识别。

A weighted sparse coding model on product Grassmann manifold for video-based human gesture recognition.

作者信息

Wang Yuping, Zhang Junfei

机构信息

School of Statistics, Capital University of Economics and Business, Beijing, China.

School of Statistics and Mathematics, Central University of Finance and Economics, Beijing, China.

出版信息

PeerJ Comput Sci. 2022 Mar 16;8:e923. doi: 10.7717/peerj-cs.923. eCollection 2022.

DOI:10.7717/peerj-cs.923
PMID:35494827
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9044265/
Abstract

It is a challenging problem to classify multi-dimensional data with complex intrinsic geometry inherent, such as human gesture recognition based on videos. In particular, manifold structure is a good way to characterize intrinsic geometry of multi-dimensional data. The recently proposed sparse coding on Grassmann manifold shows high discriminative power in many visual classification tasks. It represents videos on Grassmann manifold using Singular Value Decomposition (SVD) of the data matrix by vectorizing each image in videos, while vectorization destroys the spatial structure of videos. To keep the spatial structure of videos, they can be represented as the form of data tensor. In this paper, we firstly represent human gesture videos on product Grassmann manifold (PGM) by Higher Order Singular Value Decomposition (HOSVD) of data tensor. Each factor manifold characterizes features of human gesture video from different perspectives and can be understood as appearance, horizontal motion and vertical motion of human gesture video respectively. We then propose a weighted sparse coding model on PGM, where weights can be understood as modeling the importance of factor manifolds. Furthermore, we propose an optimization algorithm for learning coding coefficients by embedding each factor Grassmann manifold into symmetric matrices space. Finally, we give a classification algorithm, and experimental results on three public datasets show that our method is competitive to some relevant excellent methods.

摘要

对具有复杂内在几何结构的多维数据进行分类是一个具有挑战性的问题,例如基于视频的人体手势识别。特别是,流形结构是表征多维数据内在几何结构的一种好方法。最近提出的格拉斯曼流形上的稀疏编码在许多视觉分类任务中显示出很高的判别能力。它通过对视频中的每个图像进行向量化,利用数据矩阵的奇异值分解(SVD)在格拉斯曼流形上表示视频,而向量化破坏了视频的空间结构。为了保持视频的空间结构,可以将它们表示为数据张量的形式。在本文中,我们首先通过数据张量的高阶奇异值分解(HOSVD)在乘积格拉斯曼流形(PGM)上表示人体手势视频。每个因子流形从不同角度表征人体手势视频的特征,分别可以理解为人体手势视频的外观、水平运动和垂直运动。然后,我们在PGM上提出了一种加权稀疏编码模型,其中权重可以理解为对因子流形重要性的建模。此外,我们提出了一种通过将每个因子格拉斯曼流形嵌入对称矩阵空间来学习编码系数的优化算法。最后,我们给出了一种分类算法,在三个公共数据集上的实验结果表明,我们的方法与一些相关的优秀方法相比具有竞争力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02b2/9044265/0c51fb76e95e/peerj-cs-08-923-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02b2/9044265/e90303f806f3/peerj-cs-08-923-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02b2/9044265/da9ba0b6ed39/peerj-cs-08-923-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02b2/9044265/102d166b0935/peerj-cs-08-923-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02b2/9044265/159050db980f/peerj-cs-08-923-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02b2/9044265/c5d6232c4e6b/peerj-cs-08-923-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02b2/9044265/19efc1641462/peerj-cs-08-923-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02b2/9044265/2e6bd258b10c/peerj-cs-08-923-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02b2/9044265/0c51fb76e95e/peerj-cs-08-923-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02b2/9044265/e90303f806f3/peerj-cs-08-923-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02b2/9044265/da9ba0b6ed39/peerj-cs-08-923-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02b2/9044265/102d166b0935/peerj-cs-08-923-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02b2/9044265/159050db980f/peerj-cs-08-923-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02b2/9044265/c5d6232c4e6b/peerj-cs-08-923-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02b2/9044265/19efc1641462/peerj-cs-08-923-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02b2/9044265/2e6bd258b10c/peerj-cs-08-923-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02b2/9044265/0c51fb76e95e/peerj-cs-08-923-g008.jpg

相似文献

1
A weighted sparse coding model on product Grassmann manifold for video-based human gesture recognition.基于乘积格拉斯曼流形的加权稀疏编码模型用于基于视频的人体手势识别。
PeerJ Comput Sci. 2022 Mar 16;8:e923. doi: 10.7717/peerj-cs.923. eCollection 2022.
2
Statistical computations on Grassmann and Stiefel manifolds for image and video-based recognition.基于 Grassmann 和 Stiefel 流形的统计计算在图像和视频识别中的应用。
IEEE Trans Pattern Anal Mach Intell. 2011 Nov;33(11):2273-86. doi: 10.1109/TPAMI.2011.52.
3
Dimensionality Reduction on SPD Manifolds: The Emergence of Geometry-Aware Methods.SPD 流形上的降维:几何感知方法的出现。
IEEE Trans Pattern Anal Mach Intell. 2018 Jan;40(1):48-62. doi: 10.1109/TPAMI.2017.2655048. Epub 2017 Jan 18.
4
Image Set Classification Using a Distance-Based Kernel Over Affine Grassmann Manifold.基于仿射格拉斯曼流形上基于距离的核的图像集分类
IEEE Trans Neural Netw Learn Syst. 2021 Mar;32(3):1082-1095. doi: 10.1109/TNNLS.2020.2980059. Epub 2021 Mar 1.
5
Embedding graphs on Grassmann manifold.在 Grassmann 流形上嵌入图。
Neural Netw. 2022 Aug;152:322-331. doi: 10.1016/j.neunet.2022.05.001. Epub 2022 May 10.
6
Supervised Dimensionality Reduction on Grassmannian for Image Set Recognition.用于图像集识别的格拉斯曼流形上的监督降维
Neural Comput. 2019 Jan;31(1):156-175. doi: 10.1162/neco_a_01148. Epub 2018 Nov 21.
7
Classification via Sparse Representation of Steerable Wavelet Frames on Grassmann Manifold: Application to Target Recognition in SAR Image.基于 Grassmann 流形上可操纵子波框架的稀疏表示的分类:在 SAR 图像目标识别中的应用。
IEEE Trans Image Process. 2017 Jun;26(6):2892-2904. doi: 10.1109/TIP.2017.2692524. Epub 2017 Apr 7.
8
Learning Match Kernels on Grassmann Manifolds for Action Recognition.在 Grassmann 流形上学习匹配核进行动作识别。
IEEE Trans Image Process. 2019 Jan;28(1):205-215. doi: 10.1109/TIP.2018.2866688. Epub 2018 Aug 22.
9
Multi-kernel clustering with tensor fusion on Grassmann manifold for high-dimensional genomic data.基于 Grassmann 流形上张量融合的多核聚类在高维基因组数据中的应用。
Methods. 2024 Nov;231:215-225. doi: 10.1016/j.ymeth.2024.09.015. Epub 2024 Oct 11.
10
Dictionary Pair Learning on Grassmann Manifolds for Image Denoising.基于 Grassmann 流形的字典对学习在图像去噪中的应用。
IEEE Trans Image Process. 2015 Nov;24(11):4556-69. doi: 10.1109/TIP.2015.2468172. Epub 2015 Aug 13.

本文引用的文献

1
MFA-Net: Motion Feature Augmented Network for Dynamic Hand Gesture Recognition from Skeletal Data.MFA-Net:基于运动特征增强的骨骼数据动态手势识别网络。
Sensors (Basel). 2019 Jan 10;19(2):239. doi: 10.3390/s19020239.
2
Pixel-level alignment of facial images for high accuracy recognition using ensemble of patches.使用补丁集合进行高精度识别的面部图像像素级对齐。
J Opt Soc Am A Opt Image Sci Vis. 2018 Jul 1;35(7):1149-1159. doi: 10.1364/JOSAA.35.001149.
3
Sparse subspace clustering: algorithm, theory, and applications.稀疏子空间聚类:算法、理论与应用。
IEEE Trans Pattern Anal Mach Intell. 2013 Nov;35(11):2765-81. doi: 10.1109/TPAMI.2013.57.
4
3D convolutional neural networks for human action recognition.三维卷积神经网络的人体动作识别。
IEEE Trans Pattern Anal Mach Intell. 2013 Jan;35(1):221-31. doi: 10.1109/TPAMI.2012.59.
5
Human action recognition by semilatent topic models.基于半潜在主题模型的人类行为识别
IEEE Trans Pattern Anal Mach Intell. 2009 Oct;31(10):1762-74. doi: 10.1109/TPAMI.2009.43.
6
Canonical correlation analysis of video volume tensors for action categorization and detection.用于动作分类与检测的视频体积张量典型相关分析。
IEEE Trans Pattern Anal Mach Intell. 2009 Aug;31(8):1415-28. doi: 10.1109/TPAMI.2008.167.
7
Robust face recognition via sparse representation.基于稀疏表示的鲁棒人脸识别。
IEEE Trans Pattern Anal Mach Intell. 2009 Feb;31(2):210-27. doi: 10.1109/TPAMI.2008.79.