Suppr超能文献

采用抽象图像进行语义场景理解。

Adopting Abstract Images for Semantic Scene Understanding.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2016 Apr;38(4):627-38. doi: 10.1109/TPAMI.2014.2366143.

Abstract

Relating visual information to its linguistic semantic meaning remains an open and challenging area of research. The semantic meaning of images depends on the presence of objects, their attributes and their relations to other objects. But precisely characterizing this dependence requires extracting complex visual information from an image, which is in general a difficult and yet unsolved problem. In this paper, we propose studying semantic information in abstract images created from collections of clip art. Abstract images provide several advantages over real images. They allow for the direct study of how to infer high-level semantic information, since they remove the reliance on noisy low-level object, attribute and relation detectors, or the tedious hand-labeling of real images. Importantly, abstract images also allow the ability to generate sets of semantically similar scenes. Finding analogous sets of real images that are semantically similar would be nearly impossible. We create 1,002 sets of 10 semantically similar abstract images with corresponding written descriptions. We thoroughly analyze this dataset to discover semantically important features, the relations of words to visual features and methods for measuring semantic similarity. Finally, we study the relation between the saliency and memorability of objects and their semantic importance.

摘要

将视觉信息与语言的语义意义联系起来仍然是一个开放且具有挑战性的研究领域。图像的语义意义取决于物体的存在、它们的属性以及它们与其他物体的关系。但要准确描述这种依赖性,需要从图像中提取复杂的视觉信息,这通常是一个困难且尚未解决的问题。在本文中,我们提出从剪贴画集合中创建的抽象图像来研究语义信息。抽象图像相对于真实图像具有几个优势。它们允许直接研究如何推断高层语义信息,因为它们不需要依赖嘈杂的低级对象、属性和关系检测器,也不需要对真实图像进行繁琐的手动标记。重要的是,抽象图像还允许生成语义相似的场景集。找到语义相似的真实图像的类似集合几乎是不可能的。我们创建了 1002 组 10 个语义相似的抽象图像,以及相应的书面描述。我们对这个数据集进行了深入分析,以发现语义上重要的特征、单词与视觉特征的关系以及测量语义相似性的方法。最后,我们研究了对象的显著性和可记性与其语义重要性之间的关系。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验