Suppr超能文献

基于对象和属性的弱监督图像标注和分割。

Weakly-Supervised Image Annotation and Segmentation with Objects and Attributes.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2017 Dec;39(12):2525-2538. doi: 10.1109/TPAMI.2016.2645157. Epub 2016 Dec 26.

Abstract

We propose to model complex visual scenes using a non-parametric Bayesian model learned from weakly labelled images abundant on media sharing sites such as Flickr. Given weak image-level annotations of objects and attributes without locations or associations between them, our model aims to learn the appearance of object and attribute classes as well as their association on each object instance. Once learned, given an image, our model can be deployed to tackle a number of vision problems in a joint and coherent manner, including recognising objects in the scene (automatic object annotation), describing objects using their attributes (attribute prediction and association), and localising and delineating the objects (object detection and semantic segmentation). This is achieved by developing a novel Weakly Supervised Markov Random Field Stacked Indian Buffet Process (WS-MRF-SIBP) that models objects and attributes as latent factors and explicitly captures their correlations within and across superpixels. Extensive experiments on benchmark datasets demonstrate that our weakly supervised model significantly outperforms weakly supervised alternatives and is often comparable with existing strongly supervised models on a variety of tasks including semantic segmentation, automatic image annotation and retrieval based on object-attribute associations.

摘要

我们建议使用从媒体共享网站(如 Flickr)上丰富的弱标记图像中学习到的非参数贝叶斯模型来对复杂的视觉场景进行建模。给定没有位置或关联的弱图像级别的对象和属性注释,我们的模型旨在学习对象和属性类的外观以及它们在每个对象实例上的关联。一旦学习完毕,给定一张图像,我们的模型可以以联合和一致的方式应用于解决许多视觉问题,包括识别场景中的对象(自动对象注释)、使用对象的属性描述对象(属性预测和关联)以及定位和勾勒对象(对象检测和语义分割)。这是通过开发一种新颖的弱监督马尔可夫随机场堆叠印度自助餐过程(WS-MRF-SIBP)来实现的,该模型将对象和属性建模为潜在因素,并显式地捕获它们在超像素内和超像素之间的相关性。在基准数据集上的广泛实验表明,我们的弱监督模型在各种任务(包括语义分割、基于对象-属性关联的自动图像注释和检索)上都显著优于弱监督替代方法,并且通常可以与现有的强监督模型相媲美。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验