• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

配对关系:用于全景场景图生成的配对网络。

Pair Then Relation: Pair-Net for Panoptic Scene Graph Generation.

作者信息

Wang Jinghao, Wen Zhengyu, Li Xiangtai, Guo Zujin, Yang Jingkang, Liu Ziwei

出版信息

IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):10452-10465. doi: 10.1109/TPAMI.2024.3442301. Epub 2024 Nov 6.

DOI:10.1109/TPAMI.2024.3442301
PMID:39137074
Abstract

Panoptic Scene Graph (PSG) is a challenging task in Scene Graph Generation (SGG) that aims to create a more comprehensive scene graph representation using panoptic segmentation instead of boxes. Compared to SGG, PSG has several challenging problems: pixel-level segment outputs and full relationship exploration (It also considers thing and stuff relation). Thus, current PSG methods have limited performance, which hinders downstream tasks or applications. This work aims to design a novel and strong baseline for PSG. To achieve that, we first conduct an in-depth analysis to identify the bottleneck of the current PSG models, finding that inter-object pair-wise recall is a crucial factor that was ignored by previous PSG methods. Based on this and the recent query-based frameworks, we present a novel framework: Pair then Relation (Pair-Net), which uses a Pair Proposal Network (PPN) to learn and filter sparse pair-wise relationships between subjects and objects. Moreover, we also observed the sparse nature of object pairs for both. Motivated by this, we design a lightweight Matrix Learner within the PPN, which directly learns pair-wised relationships for pair proposal generation. Through extensive ablation and analysis, our approach significantly improves upon leveraging the segmenter solid baseline. Notably, our method achieves over 10% absolute gains compared to our baseline, PSGFormer.

摘要

全景场景图(PSG)是场景图生成(SGG)中的一项具有挑战性的任务,其旨在使用全景分割而非边界框来创建更全面的场景图表示。与SGG相比,PSG存在几个具有挑战性的问题:像素级分割输出和完整关系探索(它还考虑事物与物质的关系)。因此,当前的PSG方法性能有限,这阻碍了下游任务或应用。这项工作旨在为PSG设计一种新颖且强大的基线。为实现这一目标,我们首先进行深入分析以确定当前PSG模型的瓶颈,发现对象间成对召回率是先前PSG方法忽略的一个关键因素。基于此以及最近的基于查询的框架,我们提出了一种新颖的框架:先成对再关系(Pair-Net),它使用成对提议网络(PPN)来学习和过滤主体与对象之间稀疏的成对关系。此外,我们还观察到两者对象对的稀疏性质。受此启发,我们在PPN中设计了一个轻量级矩阵学习器,它直接学习用于生成成对提议的成对关系。通过广泛的消融实验和分析,我们的方法在利用分割器坚实基线的基础上有显著改进。值得注意的是,与我们的基线PSGFormer相比,我们的方法实现了超过10%的绝对增益。

相似文献

1
Pair Then Relation: Pair-Net for Panoptic Scene Graph Generation.配对关系:用于全景场景图生成的配对网络。
IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):10452-10465. doi: 10.1109/TPAMI.2024.3442301. Epub 2024 Nov 6.
2
Relation Regularized Scene Graph Generation.关系正则化场景图生成。
IEEE Trans Cybern. 2022 Jul;52(7):5961-5972. doi: 10.1109/TCYB.2021.3052522. Epub 2022 Jul 4.
3
Tracklet Pair Proposal and Context Reasoning for Video Scene Graph Generation.用于视频场景图生成的轨迹对提议和上下文推理。
Sensors (Basel). 2021 May 2;21(9):3164. doi: 10.3390/s21093164.
4
MLMG-SGG: Multilabel Scene Graph Generation With Multigrained Features.MLMG-SGG:基于多粒度特征的多标签场景图生成
IEEE Trans Image Process. 2024;33:1549-1559. doi: 10.1109/TIP.2022.3199089. Epub 2024 Feb 27.
5
Relationship-Incremental Scene Graph Generation by a Divide-and-Conquer Pipeline With Feature Adapter.基于带特征适配器的分治管道的关系增量场景图生成
IEEE Trans Image Process. 2025;34:678-688. doi: 10.1109/TIP.2024.3384096. Epub 2025 Jan 28.
6
Panoptic-PartFormer++: A Unified and Decoupled View for Panoptic Part Segmentation.全景部分Former++:用于全景部分分割的统一解耦视图
IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):11087-11103. doi: 10.1109/TPAMI.2024.3453916. Epub 2024 Nov 6.
7
Learning Category- and Instance-Aware Pixel Embedding for Fast Panoptic Segmentation.用于快速全景分割的学习类别和实例感知像素嵌入
IEEE Trans Image Process. 2021;30:6013-6023. doi: 10.1109/TIP.2021.3090522. Epub 2021 Jul 1.
8
SGTR+: End-to-End Scene Graph Generation With Transformer.SGTR+:基于Transformer的端到端场景图生成
IEEE Trans Pattern Anal Mach Intell. 2024 Apr;46(4):2191-2205. doi: 10.1109/TPAMI.2023.3332246. Epub 2024 Mar 6.
9
Object-Centric Representation Learning for Video Scene Understanding.用于视频场景理解的以对象为中心的表示学习
IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):8410-8423. doi: 10.1109/TPAMI.2024.3401409. Epub 2024 Nov 6.
10
Dense Pixel-Level Interpretation of Dynamic Scenes With Video Panoptic Segmentation.基于视频全景分割的动态场景密集像素级解读
IEEE Trans Image Process. 2022;31:5383-5395. doi: 10.1109/TIP.2022.3183440. Epub 2022 Aug 17.