用于动作识别的深度流形结构转移

Deep Manifold Structure Transfer for Action Recognition.

作者信息

Li Ce, Zhang Baochang, Chen Chen, Ye Qixiang, Han Jungong, Guo Guodong, Ji Rongrong

出版信息

IEEE Trans Image Process. 2019 Apr 25. doi: 10.1109/TIP.2019.2912357.

DOI:10.1109/TIP.2019.2912357

Abstract

While intrinsic data structure in subspace provides useful information for visual recognition, it has not yet been well studied in deep feature learning for action recognition. In this paper, we introduce a new spatio-temporal manifold network (STMN) that leverages data manifold structures to regularize deep action feature learning, aiming at simultaneously minimizing the intra-class variations of learned deep features and alleviating the over-fitting problem. To this end, the manifold prior is imposed from the top layer of a convolutional neural network (CNN), and is propagated across convolutional layers during forward-backward propagation. The observed correspondence of manifold structures in the data space and feature space validates that the manifold priori can be transferred across CNN layers. STMN theoretically recasts the problem of transferring the data structure prior into the deep learning architectures as a projection over the manifold via an embedding method, which can be easily solved by an Alternating Direction Method of Multipliers and Backward Propagation (ADMM-BP) algorithm. STMN is generic in the sense that it can be plugged into various backbone architectures to learn more discriminative representation for action recognition. Extensive experimental results show that our method achieves comparable or even better performance as compared with the state-of-the-art approaches on four benchmark datasets.

摘要

虽然子空间中的内在数据结构为视觉识别提供了有用信息，但在用于动作识别的深度特征学习中尚未得到充分研究。在本文中，我们引入了一种新的时空流形网络（STMN），它利用数据流形结构来规范深度动作特征学习，旨在同时最小化学习到的深度特征的类内变化并缓解过拟合问题。为此，流形先验从卷积神经网络（CNN）的顶层施加，并在正向 - 反向传播期间跨卷积层传播。数据空间和特征空间中流形结构的观察对应关系验证了流形先验可以跨CNN层转移。STMN从理论上将通过嵌入方法将数据结构先验转移到深度学习架构中的问题重新表述为流形上的投影，这可以通过交替方向乘子法和反向传播（ADMM - BP）算法轻松解决。STMN具有通用性，因为它可以插入各种骨干架构中，以学习更具判别力的动作识别表示。广泛的实验结果表明，与四个基准数据集上的现有方法相比，我们的方法实现了相当甚至更好的性能。

相似文献

Deep Manifold Structure Transfer for Action Recognition.用于动作识别的深度流形结构转移

IEEE Trans Image Process. 2019 Apr 25. doi: 10.1109/TIP.2019.2912357.

Deep Manifold Learning Combined With Convolutional Neural Networks for Action Recognition.基于深度流形学习与卷积神经网络的动作识别。

IEEE Trans Neural Netw Learn Syst. 2018 Sep;29(9):3938-3952. doi: 10.1109/TNNLS.2017.2740318. Epub 2017 Sep 15.

The Structure Transfer Machine Theory and Applications.结构转移机理论与应用

IEEE Trans Image Process. 2019 Nov 25. doi: 10.1109/TIP.2019.2954178.

Human Activity Recognition Using Cascaded Dual Attention CNN and Bi-Directional GRU Framework.基于级联双注意力卷积神经网络和双向门控循环单元框架的人类活动识别

J Imaging. 2023 Jun 26;9(7):130. doi: 10.3390/jimaging9070130.

Accelerating Cartesian MRI by domain-transform manifold learning in phase-encoding direction.通过相位编码方向的域变换流形学习加速笛卡尔磁共振成像

Med Image Anal. 2020 Jul;63:101689. doi: 10.1016/j.media.2020.101689. Epub 2020 Mar 30.

Learning Match Kernels on Grassmann Manifolds for Action Recognition.在 Grassmann 流形上学习匹配核进行动作识别。

IEEE Trans Image Process. 2019 Jan;28(1):205-215. doi: 10.1109/TIP.2018.2866688. Epub 2018 Aug 22.

Zero-Shot Learning via Robust Latent Representation and Manifold Regularization.基于鲁棒潜在表示和流形正则化的零样本学习。

IEEE Trans Image Process. 2019 Apr;28(4):1824-1836. doi: 10.1109/TIP.2018.2881926. Epub 2018 Nov 16.

A deep dive into understanding tumor foci classification using multiparametric MRI based on convolutional neural network.基于卷积神经网络，深入探究利用多参数磁共振成像进行肿瘤病灶分类。

Med Phys. 2020 Sep;47(9):4077-4086. doi: 10.1002/mp.14255. Epub 2020 Jun 12.

DLPNet: A deep manifold network for feature extraction of hyperspectral imagery.DLPNet：一种用于高光谱图像特征提取的深度流形网络。

Neural Netw. 2020 Sep;129:7-18. doi: 10.1016/j.neunet.2020.05.022. Epub 2020 May 22.

Wasserstein CNN: Learning Invariant Features for NIR-VIS Face Recognition. Wasserstein CNN：用于近红外-可见光人脸识别的不变特征学习。

IEEE Trans Pattern Anal Mach Intell. 2019 Jul;41(7):1761-1773. doi: 10.1109/TPAMI.2018.2842770. Epub 2018 Jun 1.

引用本文的文献

Failure type and failure level detection of insulators according to monitored leakage current.根据监测到的泄漏电流检测绝缘子的故障类型和故障等级。

Heliyon. 2024 Jul 5;10(14):e34143. doi: 10.1016/j.heliyon.2024.e34143. eCollection 2024 Jul 30.

Multiscale knowledge distillation with attention based fusion for robust human activity recognition.基于注意力融合的多尺度知识蒸馏用于稳健的人体活动识别。

Sci Rep. 2024 May 30;14(1):12411. doi: 10.1038/s41598-024-63195-5.

Action Recognition Using Action Sequences Optimization and Two-Stream 3D Dilated Neural Network.基于动作序列优化和双流 3D 扩张神经网络的动作识别

Comput Intell Neurosci. 2022 Jun 13;2022:6608448. doi: 10.1155/2022/6608448. eCollection 2022.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于动作识别的深度流形结构转移

Deep Manifold Structure Transfer for Action Recognition.

作者信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献