通过挖掘标签相关性和放松的视觉图嵌入进行网络和个人图像注释。

Web and personal image annotation by mining label correlation with relaxed visual graph embedding.

机构信息

School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213-3890, USA.

出版信息

IEEE Trans Image Process. 2012 Mar;21(3):1339-51. doi: 10.1109/TIP.2011.2169269. Epub 2011 Sep 23.

DOI:10.1109/TIP.2011.2169269

PMID:21947528

Abstract

The number of digital images rapidly increases, and it becomes an important challenge to organize these resources effectively. As a way to facilitate image categorization and retrieval, automatic image annotation has received much research attention. Considering that there are a great number of unlabeled images available, it is beneficial to develop an effective mechanism to leverage unlabeled images for large-scale image annotation. Meanwhile, a single image is usually associated with multiple labels, which are inherently correlated to each other. A straightforward method of image annotation is to decompose the problem into multiple independent single-label problems, but this ignores the underlying correlations among different labels. In this paper, we propose a new inductive algorithm for image annotation by integrating label correlation mining and visual similarity mining into a joint framework. We first construct a graph model according to image visual features. A multilabel classifier is then trained by simultaneously uncovering the shared structure common to different labels and the visual graph embedded label prediction matrix for image annotation. We show that the globally optimal solution of the proposed framework can be obtained by performing generalized eigen-decomposition. We apply the proposed framework to both web image annotation and personal album labeling using the NUS-WIDE, MSRA MM 2.0, and Kodak image data sets, and the AUC evaluation metric. Extensive experiments on large-scale image databases collected from the web and personal album show that the proposed algorithm is capable of utilizing both labeled and unlabeled data for image annotation and outperforms other algorithms.

摘要

数字图像的数量迅速增加，有效地组织这些资源成为一个重要的挑战。作为促进图像分类和检索的一种方法，自动图像标注受到了广泛关注。考虑到有大量未标记的图像可用，开发一种有效的机制来利用未标记的图像进行大规模图像标注是有益的。同时，一张图像通常与多个标签相关联，这些标签彼此之间存在内在的相关性。一种直接的图像标注方法是将问题分解为多个独立的单标签问题，但这忽略了不同标签之间的潜在相关性。在本文中，我们提出了一种新的图像标注归纳算法，通过将标签相关性挖掘和视觉相似性挖掘整合到一个联合框架中。我们首先根据图像的视觉特征构建一个图模型。然后，通过同时揭示不同标签之间的共享结构和视觉图嵌入标签预测矩阵，训练一个多标签分类器，以实现图像标注。我们表明，通过执行广义特征分解，可以得到所提出框架的全局最优解。我们将所提出的框架应用于 NUS-WIDE、MSRA MM 2.0 和 Kodak 图像数据集的网络图像标注和个人相册标注，并使用 AUC 评估指标进行评估。在从网络和个人相册收集的大规模图像数据库上进行的广泛实验表明，所提出的算法能够利用有标记和无标记的数据进行图像标注，并且优于其他算法。

相似文献

Web and personal image annotation by mining label correlation with relaxed visual graph embedding.通过挖掘标签相关性和放松的视觉图嵌入进行网络和个人图像注释。

IEEE Trans Image Process. 2012 Mar;21(3):1339-51. doi: 10.1109/TIP.2011.2169269. Epub 2011 Sep 23.

Structured max-margin learning for inter-related classifier training and multilabel image annotation.面向相关分类器训练和多标签图像标注的结构化最大间隔学习。

IEEE Trans Image Process. 2011 Mar;20(3):837-54. doi: 10.1109/TIP.2010.2073476. Epub 2010 Sep 7.

Image decomposition with multilabel context: algorithms and applications.基于多标签上下文的图像分解：算法与应用。

IEEE Trans Image Process. 2011 Aug;20(8):2301-14. doi: 10.1109/TIP.2010.2103081. Epub 2010 Dec 30.

Retrieval-based face annotation by weak label regularized local coordinate coding.基于弱标签正则化局部坐标编码的检索式人脸标注。

IEEE Trans Pattern Anal Mach Intell. 2014 Mar;36(3):550-63. doi: 10.1109/TPAMI.2013.145.

Image annotation by input-output structural grouping sparsity.基于输入输出结构分组稀疏性的图像标注。

IEEE Trans Image Process. 2012 Jun;21(6):3066-79. doi: 10.1109/TIP.2012.2183880. Epub 2012 Jan 12.

SNMFCA: supervised NMF-based image classification and annotation.SNMFCA：基于监督 NMF 的图像分类和标注。

IEEE Trans Image Process. 2012 Nov;21(11):4508-21. doi: 10.1109/TIP.2012.2206040. Epub 2012 Jun 26.

A boosting framework for visuality-preserving distance metric learning and its application to medical image retrieval.一种保持视觉保真度的距离度量学习的提升框架及其在医学图像检索中的应用。

IEEE Trans Pattern Anal Mach Intell. 2010 Jan;32(1):30-44. doi: 10.1109/TPAMI.2008.273.

Improving Web image search by bag-based reranking.基于包的重新排序改进网络图像搜索。

IEEE Trans Image Process. 2011 Nov;20(11):3280-90. doi: 10.1109/TIP.2011.2159227. Epub 2011 Jun 9.

Fast semantic diffusion for large-scale context-based image and video annotation.基于大规模上下文的图像和视频标注的快速语义扩散。

IEEE Trans Image Process. 2012 Jun;21(6):3080-91. doi: 10.1109/TIP.2012.2188038. Epub 2012 Feb 15.

Visual pattern mining in histology image collections using bag of features.使用特征袋进行组织学图像集合中的视觉模式挖掘。

Artif Intell Med. 2011 Jun;52(2):91-106. doi: 10.1016/j.artmed.2011.04.010. Epub 2011 Jun 12.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

通过挖掘标签相关性和放松的视觉图嵌入进行网络和个人图像注释。

Web and personal image annotation by mining label correlation with relaxed visual graph embedding.

机构信息

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献