• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DGIG-Net:用于少样本人类-物体交互的动态图中带图网络

DGIG-Net: Dynamic Graph-in-Graph Networks for Few-Shot Human-Object Interaction.

作者信息

Liu Xiyao, Ji Zhong, Pang Yanwei, Han Jungong, Li Xuelong

出版信息

IEEE Trans Cybern. 2022 Aug;52(8):7852-7864. doi: 10.1109/TCYB.2021.3049537. Epub 2022 Jul 19.

DOI:10.1109/TCYB.2021.3049537
PMID:33566778
Abstract

Few-shot learning (FSL) for human-object interaction (HOI) aims at recognizing various relationships between human actions and surrounding objects only from a few samples. It is a challenging vision task, in which the diversity and interactivity of human actions result in great difficulty to learn an adaptive classifier to catch ambiguous interclass information. Therefore, traditional FSL methods usually perform unsatisfactorily in complex HOI scenes. To this end, we propose dynamic graph-in-graph networks (DGIG-Net), a novel graph prototypes framework to learn a dynamic metric space by embedding a visual subgraph to a task-oriented cross-modal graph for few-shot HOI. Specifically, we first build a knowledge reconstruction graph to learn latent representations for HOI categories by reconstructing the relationship among visual features, which generates visual representations under the category distribution of every task. Then, a dynamic relation graph integrates both reconstructible visual nodes and dynamic task-oriented semantic information to explore a graph metric space for HOI class prototypes, which applies the discriminative information from the similarities among actions or objects. We validate DGIG-Net on multiple benchmark datasets, on which it largely outperforms existing FSL approaches and achieves state-of-the-art results.

摘要

用于人类与物体交互(HOI)的少样本学习(FSL)旨在仅从少量样本中识别人类动作与周围物体之间的各种关系。这是一项具有挑战性的视觉任务,其中人类动作的多样性和交互性导致学习一个自适应分类器以捕捉模糊的类间信息变得极为困难。因此,传统的FSL方法在复杂的HOI场景中通常表现不佳。为此,我们提出了动态图中网络(DGIG-Net),这是一种新颖的图原型框架,通过将视觉子图嵌入到面向任务的跨模态图中,来学习用于少样本HOI的动态度量空间。具体而言,我们首先构建一个知识重建图,通过重建视觉特征之间的关系来学习HOI类别的潜在表示,这会在每个任务的类别分布下生成视觉表示。然后,一个动态关系图整合了可重建的视觉节点和面向任务的动态语义信息,以探索用于HOI类原型的图度量空间,该空间应用来自动作或物体之间相似性的判别信息。我们在多个基准数据集上对DGIG-Net进行了验证,在这些数据集上它大大优于现有的FSL方法,并取得了当前最优的结果。

相似文献

1
DGIG-Net: Dynamic Graph-in-Graph Networks for Few-Shot Human-Object Interaction.DGIG-Net:用于少样本人类-物体交互的动态图中带图网络
IEEE Trans Cybern. 2022 Aug;52(8):7852-7864. doi: 10.1109/TCYB.2021.3049537. Epub 2022 Jul 19.
2
Few-Shot Human-Object Interaction Recognition With Semantic-Guided Attentive Prototypes Network.基于语义引导注意原型网络的少样本人体目标交互识别。
IEEE Trans Image Process. 2021;30:1648-1661. doi: 10.1109/TIP.2020.3046861. Epub 2021 Jan 11.
3
Semantic-Aware Dynamic Generation Networks for Few-Shot Human-Object Interaction Recognition.用于少样本人类-物体交互识别的语义感知动态生成网络
IEEE Trans Neural Netw Learn Syst. 2024 Sep;35(9):12564-12575. doi: 10.1109/TNNLS.2023.3263660. Epub 2024 Sep 3.
4
Multi-label zero-shot learning with graph convolutional networks.基于图卷积网络的多标签零样本学习。
Neural Netw. 2020 Dec;132:333-341. doi: 10.1016/j.neunet.2020.09.010. Epub 2020 Sep 21.
5
A Multitask Latent Feature Augmentation Method for Few-Shot Learning.一种用于少样本学习的多任务潜在特征增强方法。
IEEE Trans Neural Netw Learn Syst. 2024 May;35(5):6976-6990. doi: 10.1109/TNNLS.2022.3213576. Epub 2024 May 2.
6
Knowledge-Guided Multi-Label Few-Shot Learning for General Image Recognition.基于知识引导的通用图像识别的多标签少样本学习。
IEEE Trans Pattern Anal Mach Intell. 2022 Mar;44(3):1371-1384. doi: 10.1109/TPAMI.2020.3025814. Epub 2022 Feb 3.
7
Graph Few-Shot Learning via Restructuring Task Graph.通过重组任务图实现图的少样本学习。
IEEE Trans Neural Netw Learn Syst. 2022 Jun 8;PP. doi: 10.1109/TNNLS.2022.3178849.
8
IPGN: Interactiveness Proposal Graph Network for Human-Object Interaction Detection.IPGN:用于人机交互检测的交互提案图网络。
IEEE Trans Image Process. 2021;30:6583-6593. doi: 10.1109/TIP.2021.3096333. Epub 2021 Jul 21.
9
Information Symmetry Matters: A Modal-Alternating Propagation Network for Few-Shot Learning.信息对称很重要:用于少样本学习的模态交替传播网络。
IEEE Trans Image Process. 2022;31:1520-1531. doi: 10.1109/TIP.2022.3143005. Epub 2022 Jan 28.
10
Cross-modality interaction for few-shot multispectral object detection with semantic knowledge.基于语义知识的少样本多光谱目标检测的跨模态交互。
Neural Netw. 2024 May;173:106156. doi: 10.1016/j.neunet.2024.106156. Epub 2024 Feb 5.

引用本文的文献

1
DepressionMIGNN: A Multiple-Instance Learning-Based Depression Detection Model with Graph Neural Networks.抑郁症MIGNN:一种基于多实例学习和图神经网络的抑郁症检测模型。
Sensors (Basel). 2025 Jul 21;25(14):4520. doi: 10.3390/s25144520.