学习用于零样本视频分类的关系建模

Learning to Model Relationships for Zero-Shot Video Classification.

作者信息

Gao Junyu, Zhang Tianzhu, Xu Changsheng

出版信息

IEEE Trans Pattern Anal Mach Intell. 2021 Oct;43(10):3476-3491. doi: 10.1109/TPAMI.2020.2985708. Epub 2021 Sep 2.

DOI:10.1109/TPAMI.2020.2985708

Abstract

With the explosive growth of video categories, zero-shot learning (ZSL) in video classification has become a promising research direction in pattern analysis and machine learning. Based on some auxiliary information such as word embeddings and attributes, the key to a robust ZSL method is to transfer the learned knowledge from seen classes to unseen classes, which requires relationship modeling between these concepts (e.g., categories and attributes). However, most existing approaches ignore to model the explicit relationships in an end-to-end manner, resulting in low effectiveness of knowledge transfer. To tackle this problem, we reconsider the video ZSL task as a task-driven message passing process to jointly enjoy several merits including alleviated heterogeneity gap, low domain shift, and robust temporal modeling. Specifically, we propose a prototype-sample GNN (PS-GNN) consisting of a prototype branch and a sample branch to directly and adaptively model all the relationships between category-attribute, category-category, and attribute-attribute. The prototype branch aims to learn robust representations of video categories, which takes as input a set of word-embedding vectors corresponding to the concepts. The sample branch is designed to generate features of a video sample by leveraging its object semantics. With the co-adaption and cooperation between both branches, a unified and robust ZSL framework is achieved. Extensive experiments strongly evidence that PS-GNN obtains favorable performance on five popular video benchmarks consistently.

摘要

随着视频类别的爆炸式增长，视频分类中的零样本学习（ZSL）已成为模式分析和机器学习中一个有前途的研究方向。基于诸如词嵌入和属性等一些辅助信息，一种强大的ZSL方法的关键在于将从已见类别中学到的知识转移到未见类别，这需要对这些概念（如类别和属性）之间的关系进行建模。然而，大多数现有方法忽略了以端到端的方式对显式关系进行建模，导致知识转移的效率低下。为了解决这个问题，我们将视频ZSL任务重新视为一个任务驱动的消息传递过程，以共同具备几个优点，包括减轻异质性差距、低领域转移和强大的时间建模。具体而言，我们提出了一种由原型分支和样本分支组成的原型-样本图神经网络（PS-GNN），以直接和自适应地对类别-属性、类别-类别和属性-属性之间的所有关系进行建模。原型分支旨在学习视频类别的强大表示，它将与概念对应的一组词嵌入向量作为输入。样本分支旨在通过利用视频样本的对象语义来生成其特征。通过两个分支之间的共同适应和协作，实现了一个统一且强大的ZSL框架。大量实验有力地证明，PS-GNN在五个流行的视频基准测试中始终获得良好的性能。

相似文献

Learning to Model Relationships for Zero-Shot Video Classification.学习用于零样本视频分类的关系建模

IEEE Trans Pattern Anal Mach Intell. 2021 Oct;43(10):3476-3491. doi: 10.1109/TPAMI.2020.2985708. Epub 2021 Sep 2.

Visual-guided attentive attributes embedding for zero-shot learning.基于视觉引导的注意力属性嵌入的零样本学习。

Neural Netw. 2021 Nov;143:709-718. doi: 10.1016/j.neunet.2021.07.031. Epub 2021 Aug 11.

Zero-Shot Learning via Robust Latent Representation and Manifold Regularization.基于鲁棒潜在表示和流形正则化的零样本学习。

IEEE Trans Image Process. 2019 Apr;28(4):1824-1836. doi: 10.1109/TIP.2018.2881926. Epub 2018 Nov 16.

Zero-Shot Learning via Attribute Regression and Class Prototype Rectification.基于属性回归和类别原型校正的零样本学习。

IEEE Trans Image Process. 2018 Feb;27(2):637-648. doi: 10.1109/TIP.2017.2745109. Epub 2017 Aug 25.

TransZero++: Cross Attribute-Guided Transformer for Zero-Shot Learning.TransZero++：用于零样本学习的跨属性引导变换器

IEEE Trans Pattern Anal Mach Intell. 2023 Nov;45(11):12844-12861. doi: 10.1109/TPAMI.2022.3229526. Epub 2023 Oct 3.

ZS-VAT: Learning Unbiased Attribute Knowledge for Zero-Shot Recognition Through Visual Attribute Transformer.ZS-VAT：通过视觉属性变换器学习用于零样本识别的无偏属性知识。

IEEE Trans Neural Netw Learn Syst. 2025 Apr;36(4):7025-7036. doi: 10.1109/TNNLS.2024.3386935. Epub 2025 Apr 4.

GNDAN: Graph Navigated Dual Attention Network for Zero-Shot Learning.GNDAN：用于零样本学习的图导航双注意力网络。

IEEE Trans Neural Netw Learn Syst. 2024 Apr;35(4):4516-4529. doi: 10.1109/TNNLS.2022.3155602. Epub 2024 Apr 4.

Multi-label zero-shot learning with graph convolutional networks.基于图卷积网络的多标签零样本学习。

Neural Netw. 2020 Dec;132:333-341. doi: 10.1016/j.neunet.2020.09.010. Epub 2020 Sep 21.

Complementary Attributes: A New Clue to Zero-Shot Learning.互补属性：零样本学习的新线索。

IEEE Trans Cybern. 2021 Mar;51(3):1519-1530. doi: 10.1109/TCYB.2019.2930744. Epub 2021 Feb 17.

Boosting Zero-Shot Learning via Contrastive Optimization of Attribute Representations.

IEEE Trans Neural Netw Learn Syst. 2024 Nov;35(11):16706-16719. doi: 10.1109/TNNLS.2023.3297134. Epub 2024 Oct 29.

学习用于零样本视频分类的关系建模

Learning to Model Relationships for Zero-Shot Video Classification.

作者信息

Gao Junyu, Zhang Tianzhu, Xu Changsheng

出版信息

IEEE Trans Pattern Anal Mach Intell. 2021 Oct;43(10):3476-3491. doi: 10.1109/TPAMI.2020.2985708. Epub 2021 Sep 2.

DOI:10.1109/TPAMI.2020.2985708

PMID:32305892

Abstract

摘要

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

学习用于零样本视频分类的关系建模

Learning to Model Relationships for Zero-Shot Video Classification.

作者信息

出版信息

相似文献

学习用于零样本视频分类的关系建模

Learning to Model Relationships for Zero-Shot Video Classification.

作者信息

出版信息

相似文献