学习层次空间平铺，用于场景建模、解析和属性标注。

Learning Hierarchical Space Tiling for Scene Modeling, Parsing and Attribute Tagging.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2015 Dec;37(12):2478-91. doi: 10.1109/TPAMI.2015.2424880.

DOI:10.1109/TPAMI.2015.2424880

Abstract

A typical scene category contains an enormous number of distinct scene configurations that are composed of objects and regions of varying shapes in different layouts. In this paper, we first propose a representation named hierarchical space tiling (HST) to quantize the huge and continuous scene configuration space. Then, we augment the HST with attributes (nouns and adjectives) to describe the semantics of the objects and regions inside a scene. We present a weakly supervised method for simultaneously learning the scene configurations and attributes from a collection of natural images associated with descriptive text. The precise locations of attributes are unknown in the input and are mapped to the HST nodes through learning. Starting with a full HST, we iteratively estimate the HST model under a learning-by-parsing framework. Given a test image, we compute the most probable parse tree with the associated attributes by dynamic programming. We quantitatively analyze the representative efficiency of HST, show the learned representation is less ambiguous and has semantically meaningful inner concepts. In applications, we apply our model to four tasks: scene classification, attribute recognition, attribute localization, and pixel-wise scene labeling, and show the performance improvements as well as higher efficiency.

摘要

典型的场景类别包含大量不同形状的物体和区域，它们以不同的布局组合在一起。在本文中，我们首先提出了一种名为层次空间划分（HST）的表示方法，用于量化庞大而连续的场景配置空间。然后，我们使用属性（名词和形容词）来增强 HST，以描述场景中物体和区域的语义。我们提出了一种从与描述性文本相关的自然图像集合中同时学习场景配置和属性的弱监督方法。在输入中，属性的精确位置是未知的，并且通过学习映射到 HST 节点。从完整的 HST 开始，我们在学习解析框架下迭代地估计 HST 模型。对于测试图像，我们通过动态规划计算具有相关属性的最可能解析树。我们对 HST 的代表性效率进行了定量分析，结果表明学习到的表示方法歧义性更小，并且具有有意义的内在概念。在应用中，我们将模型应用于四个任务：场景分类、属性识别、属性定位和像素级场景标注，并展示了性能的提升以及更高的效率。

相似文献

Learning Hierarchical Space Tiling for Scene Modeling, Parsing and Attribute Tagging.学习层次空间平铺，用于场景建模、解析和属性标注。

IEEE Trans Pattern Anal Mach Intell. 2015 Dec;37(12):2478-91. doi: 10.1109/TPAMI.2015.2424880.

Hierarchical Scene Parsing by Weakly Supervised Learning with Image Descriptions.通过带有图像描述的弱监督学习进行分层场景解析

IEEE Trans Pattern Anal Mach Intell. 2019 Mar;41(3):596-610. doi: 10.1109/TPAMI.2018.2799846. Epub 2018 Jan 30.

A Reconfigurable Tangram Model for Scene Representation and Categorization.可重构七巧板模型用于场景表示和分类。

IEEE Trans Image Process. 2016 Jan;25(1):150-66. doi: 10.1109/TIP.2015.2498407. Epub 2015 Nov 5.

Single-View 3D Scene Reconstruction and Parsing by Attribute Grammar.基于属性文法的单视图三维场景重建与解析。

IEEE Trans Pattern Anal Mach Intell. 2018 Mar;40(3):710-725. doi: 10.1109/TPAMI.2017.2689007. Epub 2017 Mar 29.

Weakly-Supervised Image Annotation and Segmentation with Objects and Attributes.基于对象和属性的弱监督图像标注和分割。

IEEE Trans Pattern Anal Mach Intell. 2017 Dec;39(12):2525-2538. doi: 10.1109/TPAMI.2016.2645157. Epub 2016 Dec 26.

Basic level scene understanding: categories, attributes and structures.基础场景理解：类别、属性和结构。

Front Psychol. 2013 Aug 29;4:506. doi: 10.3389/fpsyg.2013.00506. eCollection 2013.

On Symbiosis of Attribute Prediction and Semantic Segmentation.属性预测与语义分割的共生。

IEEE Trans Pattern Anal Mach Intell. 2021 May;43(5):1620-1635. doi: 10.1109/TPAMI.2019.2956039. Epub 2021 Apr 1.

Unifying Visual Attribute Learning with Object Recognition in a Multiplicative Framework.在乘法框架中统一视觉属性学习与目标识别

IEEE Trans Pattern Anal Mach Intell. 2019 Jul;41(7):1747-1760. doi: 10.1109/TPAMI.2018.2836461. Epub 2018 Jun 4.

Scene Parsing From an MAP Perspective.基于 MAP 的场景解析。

IEEE Trans Cybern. 2015 Sep;45(9):1876-86. doi: 10.1109/TCYB.2014.2361489. Epub 2014 Nov 4.

Robust Scene Parsing by Mining Supportive Knowledge From Dataset.通过从数据集中挖掘支持性知识进行稳健的场景解析

IEEE Trans Neural Netw Learn Syst. 2023 May;34(5):2633-2646. doi: 10.1109/TNNLS.2021.3107194. Epub 2023 May 2.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

学习层次空间平铺，用于场景建模、解析和属性标注。

Learning Hierarchical Space Tiling for Scene Modeling, Parsing and Attribute Tagging.

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献