• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于半监督少样本视频分类的标签无关记忆

Label Independent Memory for Semi-Supervised Few-Shot Video Classification.

作者信息

Zhu Linchao, Yang Yi

出版信息

IEEE Trans Pattern Anal Mach Intell. 2022 Jan;44(1):273-285. doi: 10.1109/TPAMI.2020.3007511. Epub 2021 Dec 7.

DOI:10.1109/TPAMI.2020.3007511
PMID:32750804
Abstract

In this paper, we propose to leverage freely available unlabeled video data to facilitate few-shot video classification. In this semi-supervised few-shot video classification task, millions of unlabeled data are available for each episode during training. These videos can be extremely imbalanced, while they have profound visual and motion dynamics. To tackle the semi-supervised few-shot video classification problem, we make the following contributions. First, we propose a label independent memory (LIM) to cache label related features, which enables a similarity search over a large set of videos. LIM produces a class prototype for few-shot training. This prototype is an aggregated embedding for each class, which is more robust to noisy video features. Second, we integrate a multi-modality compound memory network to capture both RGB and flow information. We propose to store the RGB and flow representation in two separate memory networks, but they are jointly optimized via a unified loss. In this way, mutual communications between the two modalities are leveraged to achieve better classification performance. Third, we conduct extensive experiments on the few-shot Kinetics-100, Something-Something-100 datasets, which validates the effectiveness of leveraging the accessible unlabeled data for few-shot classification.

摘要

在本文中,我们提议利用免费可得的未标注视频数据来促进少样本视频分类。在这个半监督少样本视频分类任务中,训练期间每个情节都有数百万未标注数据可用。这些视频可能极度不平衡,同时它们具有深刻的视觉和运动动态。为了解决半监督少样本视频分类问题,我们做出了以下贡献。首先,我们提出了一个标签无关记忆(LIM)来缓存与标签相关的特征,这使得能够在大量视频上进行相似性搜索。LIM为少样本训练生成一个类原型。这个原型是每个类的聚合嵌入,对有噪声的视频特征更具鲁棒性。其次,我们集成了一个多模态复合记忆网络来捕捉RGB和光流信息。我们提议将RGB和光流表示存储在两个单独的记忆网络中,但它们通过统一损失进行联合优化。通过这种方式,利用两种模态之间的相互通信来实现更好的分类性能。第三,我们在少样本Kinetics - 100、Something - Something - 100数据集上进行了广泛实验,这验证了利用可获取的未标注数据进行少样本分类的有效性。

相似文献

1
Label Independent Memory for Semi-Supervised Few-Shot Video Classification.用于半监督少样本视频分类的标签无关记忆
IEEE Trans Pattern Anal Mach Intell. 2022 Jan;44(1):273-285. doi: 10.1109/TPAMI.2020.3007511. Epub 2021 Dec 7.
2
Multi-label zero-shot human action recognition via joint latent ranking embedding.基于联合潜在排序嵌入的多标签零镜头人体动作识别。
Neural Netw. 2020 Feb;122:1-23. doi: 10.1016/j.neunet.2019.09.029. Epub 2019 Oct 21.
3
Sample-Centric Feature Generation for Semi-Supervised Few-Shot Learning.用于半监督少样本学习的以样本为中心的特征生成
IEEE Trans Image Process. 2022;31:2309-2320. doi: 10.1109/TIP.2022.3154938. Epub 2022 Mar 11.
4
Adaptive Prototypical Networks With Label Words and Joint Representation Learning for Few-Shot Relation Classification.基于标签词和联合表示学习的自适应原型网络用于少样本关系分类
IEEE Trans Neural Netw Learn Syst. 2023 Mar;34(3):1406-1417. doi: 10.1109/TNNLS.2021.3105377. Epub 2023 Feb 28.
5
Ensemble Transductive Propagation Network for Semi-Supervised Few-Shot Learning.用于半监督少样本学习的集成转导传播网络
Entropy (Basel). 2024 Jan 31;26(2):135. doi: 10.3390/e26020135.
6
Semi-supervised few-shot learning approach for plant diseases recognition.用于植物病害识别的半监督少样本学习方法。
Plant Methods. 2021 Jun 27;17(1):68. doi: 10.1186/s13007-021-00770-1.
7
A semi-supervised zero-shot image classification method based on soft-target.基于软目标的半监督零样本图像分类方法。
Neural Netw. 2021 Nov;143:88-96. doi: 10.1016/j.neunet.2021.05.019. Epub 2021 May 25.
8
Learning to Model Relationships for Zero-Shot Video Classification.学习用于零样本视频分类的关系建模
IEEE Trans Pattern Anal Mach Intell. 2021 Oct;43(10):3476-3491. doi: 10.1109/TPAMI.2020.2985708. Epub 2021 Sep 2.
9
Fine-Grained Feature Generation for Generalized Zero-Shot Video Classification.用于广义零样本视频分类的细粒度特征生成
IEEE Trans Image Process. 2023;32:1599-1612. doi: 10.1109/TIP.2023.3247167. Epub 2023 Mar 6.
10
Generalized Few-Shot Video Classification With Video Retrieval and Feature Generation.基于视频检索和特征生成的广义少样本视频分类
IEEE Trans Pattern Anal Mach Intell. 2022 Dec;44(12):8949-8961. doi: 10.1109/TPAMI.2021.3120550. Epub 2022 Nov 7.

引用本文的文献

1
Semantic-aware Video Representation for Few-shot Action Recognition.用于少样本动作识别的语义感知视频表示
IEEE Winter Conf Appl Comput Vis. 2024 Jan;2024:6444-6454. doi: 10.1109/wacv57701.2024.00633. Epub 2024 Apr 9.
2
Coffee With a Hint of Data: Towards Using Data-Driven Approaches in Personalised Long-Term Interactions.带有一丝数据的咖啡:迈向在个性化长期互动中使用数据驱动方法
Front Robot AI. 2021 Sep 28;8:676814. doi: 10.3389/frobt.2021.676814. eCollection 2021.