目标层面场景上下文预测。

Object-Level Scene Context Prediction.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):5280-5292. doi: 10.1109/TPAMI.2021.3075676. Epub 2022 Aug 4.

DOI:10.1109/TPAMI.2021.3075676

Abstract

Contextual information plays an important role in solving various image and scene understanding tasks. Prior works have focused on the extraction of contextual information from an image and use it to infer the properties of some object(s) in the image or understand the scene behind the image, e.g., context-based object detection, recognition and semantic segmentation. In this paper, we consider an inverse problem, i.e., how to hallucinate the missing contextual information from the properties of standalone objects. We refer to it as object-level scene context prediction. This problem is difficult, as it requires extensive knowledge of the complex and diverse relationships among objects in the scene. We propose a deep neural network, which takes as input the properties (i.e., category, shape, and position) of a few standalone objects to predict an object-level scene layout that compactly encodes the semantics and structure of the scene context where the given objects are. Quantitative experiments and user studies demonstrate that our model can generate more plausible scene contexts than the baselines. Our model also enables the synthesis of realistic scene images from partial scene layouts. Finally, we validate that our model internally learns useful features for scene recognition and fake scene detection.

摘要

上下文信息在解决各种图像和场景理解任务中起着重要作用。先前的工作主要集中在从图像中提取上下文信息，并利用它来推断图像中某些物体的属性或理解图像背后的场景，例如基于上下文的目标检测、识别和语义分割。在本文中，我们考虑了一个逆向问题，即如何从孤立物体的属性中推测出缺失的上下文信息。我们将其称为对象级场景上下文预测。这个问题很困难，因为它需要广泛的知识来了解场景中物体之间复杂多样的关系。我们提出了一个深度神经网络，它将几个孤立物体的属性（即类别、形状和位置）作为输入，来预测一个紧凑地编码了给定物体所在场景上下文语义和结构的对象级场景布局。定量实验和用户研究表明，我们的模型可以生成比基线更合理的场景上下文。我们的模型还可以从部分场景布局中合成逼真的场景图像。最后，我们验证了我们的模型内部学习到了用于场景识别和虚假场景检测的有用特征。

相似文献

Object-Level Scene Context Prediction.目标层面场景上下文预测。

IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):5280-5292. doi: 10.1109/TPAMI.2021.3075676. Epub 2022 Aug 4.

The Linguistic Analysis of Scene Semantics: LASS.场景语义的语言分析：LASS。

Behav Res Methods. 2020 Dec;52(6):2349-2371. doi: 10.3758/s13428-020-01390-8.

SOLVER: Scene-Object Interrelated Visual Emotion Reasoning Network.求解器：场景-对象关联视觉情感推理网络。

IEEE Trans Image Process. 2021;30:8686-8701. doi: 10.1109/TIP.2021.3118983. Epub 2021 Oct 22.

Stuck on semantics: Processing of irrelevant object-scene inconsistencies modulates ongoing gaze behavior.纠结于语义：无关的物体-场景不一致性的处理会调节正在进行的注视行为。

Atten Percept Psychophys. 2017 Jan;79(1):154-168. doi: 10.3758/s13414-016-1203-7.

Semantic Image Segmentation with Contextual Hierarchical Models.基于上下文层次模型的语义图像分割

IEEE Trans Pattern Anal Mach Intell. 2016 May;38(5):951-64. doi: 10.1109/TPAMI.2015.2473846. Epub 2015 Aug 27.

Quantifying and transferring contextual information in object detection.量化和转移目标检测中的上下文信息。

IEEE Trans Pattern Anal Mach Intell. 2012 Apr;34(4):762-77. doi: 10.1109/TPAMI.2011.164.

Scene context is predictive of unconstrained object similarity judgments.场景背景可预测无约束物体相似性判断。

Cognition. 2023 Oct;239:105535. doi: 10.1016/j.cognition.2023.105535. Epub 2023 Jul 24.

The Ability to Use Contextual Information in Object and Scene Recognition in Patients with Mild Cognitive Impairment.轻度认知障碍患者在物体和场景识别中运用上下文信息的能力。

J Alzheimers Dis. 2023;95(3):945-963. doi: 10.3233/JAD-221132.

3-D object recognition using 2-D views.使用二维视图进行三维物体识别。

IEEE Trans Image Process. 2008 Nov;17(11):2236-55. doi: 10.1109/TIP.2008.2003404.

Refined Voting and Scene Feature Fusion for 3D Object Detection in Point Clouds.点云中的 3D 目标检测的精细化投票和场景特征融合。

Comput Intell Neurosci. 2022 Dec 29;2022:3023934. doi: 10.1155/2022/3023934. eCollection 2022.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

目标层面场景上下文预测。

Object-Level Scene Context Prediction.

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献