• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

学习用于零样本视频分类的关系建模

Learning to Model Relationships for Zero-Shot Video Classification.

作者信息

Gao Junyu, Zhang Tianzhu, Xu Changsheng

出版信息

IEEE Trans Pattern Anal Mach Intell. 2021 Oct;43(10):3476-3491. doi: 10.1109/TPAMI.2020.2985708. Epub 2021 Sep 2.

DOI:10.1109/TPAMI.2020.2985708
PMID:32305892
Abstract

With the explosive growth of video categories, zero-shot learning (ZSL) in video classification has become a promising research direction in pattern analysis and machine learning. Based on some auxiliary information such as word embeddings and attributes, the key to a robust ZSL method is to transfer the learned knowledge from seen classes to unseen classes, which requires relationship modeling between these concepts (e.g., categories and attributes). However, most existing approaches ignore to model the explicit relationships in an end-to-end manner, resulting in low effectiveness of knowledge transfer. To tackle this problem, we reconsider the video ZSL task as a task-driven message passing process to jointly enjoy several merits including alleviated heterogeneity gap, low domain shift, and robust temporal modeling. Specifically, we propose a prototype-sample GNN (PS-GNN) consisting of a prototype branch and a sample branch to directly and adaptively model all the relationships between category-attribute, category-category, and attribute-attribute. The prototype branch aims to learn robust representations of video categories, which takes as input a set of word-embedding vectors corresponding to the concepts. The sample branch is designed to generate features of a video sample by leveraging its object semantics. With the co-adaption and cooperation between both branches, a unified and robust ZSL framework is achieved. Extensive experiments strongly evidence that PS-GNN obtains favorable performance on five popular video benchmarks consistently.

摘要

随着视频类别的爆炸式增长,视频分类中的零样本学习(ZSL)已成为模式分析和机器学习中一个有前途的研究方向。基于诸如词嵌入和属性等一些辅助信息,一种强大的ZSL方法的关键在于将从已见类别中学到的知识转移到未见类别,这需要对这些概念(如类别和属性)之间的关系进行建模。然而,大多数现有方法忽略了以端到端的方式对显式关系进行建模,导致知识转移的效率低下。为了解决这个问题,我们将视频ZSL任务重新视为一个任务驱动的消息传递过程,以共同具备几个优点,包括减轻异质性差距、低领域转移和强大的时间建模。具体而言,我们提出了一种由原型分支和样本分支组成的原型-样本图神经网络(PS-GNN),以直接和自适应地对类别-属性、类别-类别和属性-属性之间的所有关系进行建模。原型分支旨在学习视频类别的强大表示,它将与概念对应的一组词嵌入向量作为输入。样本分支旨在通过利用视频样本的对象语义来生成其特征。通过两个分支之间的共同适应和协作,实现了一个统一且强大的ZSL框架。大量实验有力地证明,PS-GNN在五个流行的视频基准测试中始终获得良好的性能。

相似文献

1
Learning to Model Relationships for Zero-Shot Video Classification.学习用于零样本视频分类的关系建模
IEEE Trans Pattern Anal Mach Intell. 2021 Oct;43(10):3476-3491. doi: 10.1109/TPAMI.2020.2985708. Epub 2021 Sep 2.
2
Visual-guided attentive attributes embedding for zero-shot learning.基于视觉引导的注意力属性嵌入的零样本学习。
Neural Netw. 2021 Nov;143:709-718. doi: 10.1016/j.neunet.2021.07.031. Epub 2021 Aug 11.
3
Zero-Shot Learning via Robust Latent Representation and Manifold Regularization.基于鲁棒潜在表示和流形正则化的零样本学习。
IEEE Trans Image Process. 2019 Apr;28(4):1824-1836. doi: 10.1109/TIP.2018.2881926. Epub 2018 Nov 16.
4
Zero-Shot Learning via Attribute Regression and Class Prototype Rectification.基于属性回归和类别原型校正的零样本学习。
IEEE Trans Image Process. 2018 Feb;27(2):637-648. doi: 10.1109/TIP.2017.2745109. Epub 2017 Aug 25.
5
TransZero++: Cross Attribute-Guided Transformer for Zero-Shot Learning.TransZero++:用于零样本学习的跨属性引导变换器
IEEE Trans Pattern Anal Mach Intell. 2023 Nov;45(11):12844-12861. doi: 10.1109/TPAMI.2022.3229526. Epub 2023 Oct 3.
6
ZS-VAT: Learning Unbiased Attribute Knowledge for Zero-Shot Recognition Through Visual Attribute Transformer.ZS-VAT:通过视觉属性变换器学习用于零样本识别的无偏属性知识。
IEEE Trans Neural Netw Learn Syst. 2025 Apr;36(4):7025-7036. doi: 10.1109/TNNLS.2024.3386935. Epub 2025 Apr 4.
7
GNDAN: Graph Navigated Dual Attention Network for Zero-Shot Learning.GNDAN:用于零样本学习的图导航双注意力网络。
IEEE Trans Neural Netw Learn Syst. 2024 Apr;35(4):4516-4529. doi: 10.1109/TNNLS.2022.3155602. Epub 2024 Apr 4.
8
Multi-label zero-shot learning with graph convolutional networks.基于图卷积网络的多标签零样本学习。
Neural Netw. 2020 Dec;132:333-341. doi: 10.1016/j.neunet.2020.09.010. Epub 2020 Sep 21.
9
Complementary Attributes: A New Clue to Zero-Shot Learning.互补属性:零样本学习的新线索。
IEEE Trans Cybern. 2021 Mar;51(3):1519-1530. doi: 10.1109/TCYB.2019.2930744. Epub 2021 Feb 17.
10
Boosting Zero-Shot Learning via Contrastive Optimization of Attribute Representations.
IEEE Trans Neural Netw Learn Syst. 2024 Nov;35(11):16706-16719. doi: 10.1109/TNNLS.2023.3297134. Epub 2024 Oct 29.