• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于人机交互检测的分层推理网络

Hierarchical Reasoning Network for Human-Object Interaction Detection.

作者信息

Gao Yiming, Kuang Zhanghui, Li Guanbin, Zhang Wayne, Lin Liang

出版信息

IEEE Trans Image Process. 2021;30:8306-8317. doi: 10.1109/TIP.2021.3093784. Epub 2021 Oct 5.

DOI:10.1109/TIP.2021.3093784
PMID:34587007
Abstract

Human-object interaction detection that aims at detecting <human, verb, object> triplets is critical for the holistic human-centric scene understanding. Existing approaches ignore the modeling of correlations among hierarchical human parts and objects. In this work, we introduce a Hierarchical Reasoning Network (HRNet) to capture relations among human parts at multiple scales (including the holistic human, human region, and human keypoint levels) and objects via a unified graph. In particular, HRNet first constructs one multi-level human parts graph, each level of which consists of human parts at one specific scale, objects, and the unions of human part-object pairs as nodes, and their mutual visual and spatial layout relations as intra-level reasoning. To also capture the relations across scales, we further introduce inter-level reasoning between the nodes of two consecutive levels based on the prior of human body structure. The representations of graph nodes are propagated along intra-level and inter-level reasoning in turn during reasoning. Extensive experiments demonstrate our HRNet obtains new state-of-the-art results on three challenging HICO-DET, V-COCO and HOI-A benchmarks, validating the compelling effectiveness of the proposed method.

摘要

旨在检测<人、动词、物体>三元组的人与物体交互检测对于以人类为中心的整体场景理解至关重要。现有方法忽略了分层人体部位与物体之间相关性的建模。在这项工作中,我们引入了一种分层推理网络(HRNet),通过一个统一的图来捕捉多尺度(包括整体人体、人体区域和人体关键点级别)的人体部位与物体之间的关系。具体而言,HRNet首先构建一个多级人体部位图,其每一级由一个特定尺度的人体部位、物体以及人体部位-物体对的并集作为节点,以及它们相互的视觉和空间布局关系作为级内推理。为了捕捉跨尺度的关系,我们基于人体结构的先验知识,进一步引入了两个连续级别的节点之间的级间推理。在推理过程中,图节点的表示依次沿着级内和级间推理进行传播。大量实验表明,我们的HRNet在三个具有挑战性的HICO-DET、V-COCO和HOI-A基准测试中取得了新的最优结果,验证了所提方法的显著有效性。

相似文献

1
Hierarchical Reasoning Network for Human-Object Interaction Detection.用于人机交互检测的分层推理网络
IEEE Trans Image Process. 2021;30:8306-8317. doi: 10.1109/TIP.2021.3093784. Epub 2021 Oct 5.
2
Learning Human-Object Interaction via Interactive Semantic Reasoning.通过交互式语义推理学习人机交互。
IEEE Trans Image Process. 2021;30:9294-9305. doi: 10.1109/TIP.2021.3125258. Epub 2021 Nov 12.
3
IPGN: Interactiveness Proposal Graph Network for Human-Object Interaction Detection.IPGN:用于人机交互检测的交互提案图网络。
IEEE Trans Image Process. 2021;30:6583-6593. doi: 10.1109/TIP.2021.3096333. Epub 2021 Jul 21.
4
Cascaded Parsing of Human-Object Interaction Recognition.级联解析的人机交互识别。
IEEE Trans Pattern Anal Mach Intell. 2022 Jun;44(6):2827-2840. doi: 10.1109/TPAMI.2021.3049156. Epub 2022 May 5.
5
FGAHOI: Fine-Grained Anchors for Human-Object Interaction Detection.FGAHOI:用于人类与物体交互检测的细粒度锚点
IEEE Trans Pattern Anal Mach Intell. 2024 Apr;46(4):2415-2429. doi: 10.1109/TPAMI.2023.3331738. Epub 2024 Mar 6.
6
Zero-Shot Human-Object Interaction Detection via Similarity Propagation.通过相似性传播实现零样本人类-物体交互检测
IEEE Trans Neural Netw Learn Syst. 2024 Dec;35(12):17805-17816. doi: 10.1109/TNNLS.2023.3309104. Epub 2024 Dec 2.
7
Toward a Unified Transformer-Based Framework for Scene Graph Generation and Human-Object Interaction Detection.面向场景图生成和人机交互检测的统一基于 Transformer 的框架。
IEEE Trans Image Process. 2023;32:6274-6288. doi: 10.1109/TIP.2023.3330304. Epub 2023 Nov 20.
8
Transferable Interactiveness Knowledge for Human-Object Interaction Detection.可迁移交互知识用于人机交互检测。
IEEE Trans Pattern Anal Mach Intell. 2022 Jul;44(7):3870-3882. doi: 10.1109/TPAMI.2021.3054048. Epub 2022 Jun 3.
9
Multi-Perspective Representation to Part-Based Graph for Group Activity Recognition.基于图的多视角表示的部分群组活动识别。
Sensors (Basel). 2022 Jul 24;22(15):5521. doi: 10.3390/s22155521.
10
HGR-Net: Hierarchical Graph Reasoning Network for Arbitrary Shape Scene Text Detection.HGR-Net:用于任意形状场景文本检测的分层图推理网络。
IEEE Trans Image Process. 2023;32:4142-4155. doi: 10.1109/TIP.2023.3294822. Epub 2023 Jul 20.

引用本文的文献

1
Human-object interaction, connectedness with nature, and life satisfaction: a cross-sectional study.人与物体的互动、与自然的联系以及生活满意度:一项横断面研究。
Front Psychol. 2024 Apr 15;15:1360518. doi: 10.3389/fpsyg.2024.1360518. eCollection 2024.