关系正则化场景图生成。

Relation Regularized Scene Graph Generation.

出版信息

IEEE Trans Cybern. 2022 Jul;52(7):5961-5972. doi: 10.1109/TCYB.2021.3052522. Epub 2022 Jul 4.

DOI:10.1109/TCYB.2021.3052522

Abstract

Scene graph generation (SGG) is built on top of detected objects to predict object pairwise visual relations for describing the image content abstraction. Existing works have revealed that if the links between objects are given as prior knowledge, the performance of SGG is significantly improved. Inspired by this observation, in this article, we propose a relation regularized network (R2-Net), which can predict whether there is a relationship between two objects and encode this relation into object feature refinement and better SGG. Specifically, we first construct an affinity matrix among detected objects to represent the probability of a relationship between two objects. Graph convolution networks (GCNs) over this relation affinity matrix are then used as object encoders, producing relation-regularized representations of objects. With these relation-regularized features, our R2-Net can effectively refine object labels and generate scene graphs. Extensive experiments are conducted on the visual genome dataset for three SGG tasks (i.e., predicate classification, scene graph classification, and scene graph detection), demonstrating the effectiveness of our proposed method. Ablation studies also verify the key roles of our proposed components in performance improvement.

摘要

场景图生成 (SGG) 建立在检测到的对象之上，以预测对象对之间的视觉关系，用于描述图像内容的抽象。现有研究表明，如果将对象之间的链接作为先验知识给出，则 SGG 的性能将显著提高。受此观察的启发，在本文中，我们提出了一种关系正则化网络 (R2-Net)，它可以预测两个对象之间是否存在关系，并将这种关系编码为对象特征细化和更好的 SGG。具体来说，我们首先构建检测到的对象之间的亲和度矩阵，以表示两个对象之间存在关系的概率。然后，在这个关系亲和度矩阵上使用图卷积网络 (GCN) 作为对象编码器，生成对象的关系正则化表示。利用这些关系正则化的特征，我们的 R2-Net 可以有效地细化对象标签并生成场景图。在视觉基因组数据集上进行了广泛的实验，用于三个 SGG 任务（即谓语分类、场景图分类和场景图检测），验证了我们提出的方法的有效性。消融研究还验证了我们提出的组件在性能提升方面的关键作用。

相似文献

Relation Regularized Scene Graph Generation.关系正则化场景图生成。

IEEE Trans Cybern. 2022 Jul;52(7):5961-5972. doi: 10.1109/TCYB.2021.3052522. Epub 2022 Jul 4.

MLMG-SGG: Multilabel Scene Graph Generation With Multigrained Features.MLMG-SGG：基于多粒度特征的多标签场景图生成

IEEE Trans Image Process. 2024;33:1549-1559. doi: 10.1109/TIP.2022.3199089. Epub 2024 Feb 27.

A Comprehensive Survey of Scene Graphs: Generation and Application.场景图的全面综述：生成与应用

IEEE Trans Pattern Anal Mach Intell. 2023 Jan;45(1):1-26. doi: 10.1109/TPAMI.2021.3137605. Epub 2022 Dec 5.

Complex Relation Embedding for Scene Graph Generation.用于场景图生成的复杂关系嵌入

IEEE Trans Neural Netw Learn Syst. 2024 Jun;35(6):8321-8335. doi: 10.1109/TNNLS.2022.3226871. Epub 2024 Jun 3.

Toward a Unified Transformer-Based Framework for Scene Graph Generation and Human-Object Interaction Detection.面向场景图生成和人机交互检测的统一基于 Transformer 的框架。

IEEE Trans Image Process. 2023;32:6274-6288. doi: 10.1109/TIP.2023.3330304. Epub 2023 Nov 20.

RelTR: Relation Transformer for Scene Graph Generation.RelTR：用于场景图生成的关系变换器

IEEE Trans Pattern Anal Mach Intell. 2023 Sep;45(9):11169-11183. doi: 10.1109/TPAMI.2023.3268066. Epub 2023 Aug 7.

Tackling the Challenges in Scene Graph Generation With Local-to-Global Interactions.

IEEE Trans Neural Netw Learn Syst. 2023 Dec;34(12):9713-9726. doi: 10.1109/TNNLS.2022.3159990. Epub 2023 Nov 30.

Adaptive Feature Learning for Unbiased Scene Graph Generation.

IEEE Trans Image Process. 2024;33:2252-2265. doi: 10.1109/TIP.2024.3374644. Epub 2024 Mar 25.

Adaptive Fine-Grained Predicates Learning for Scene Graph Generation.用于场景图生成的自适应细粒度谓词学习

IEEE Trans Pattern Anal Mach Intell. 2023 Nov;45(11):13921-13940. doi: 10.1109/TPAMI.2023.3298356.

Composite Object Relation Modeling for Few-Shot Scene Recognition.用于少样本场景识别的复合对象关系建模

IEEE Trans Image Process. 2023;32:5678-5691. doi: 10.1109/TIP.2023.3321475. Epub 2023 Oct 17.

引用本文的文献

Art design integrating visual relation and affective semantics based on Convolutional Block Attention Mechanism-generative adversarial network model.基于卷积块注意力机制-生成对抗网络模型的融合视觉关系与情感语义的艺术设计

PeerJ Comput Sci. 2024 Aug 30;10:e2274. doi: 10.7717/peerj-cs.2274. eCollection 2024.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

关系正则化场景图生成。

Relation Regularized Scene Graph Generation.

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献