多视图向量值流形正则化的多标签图像分类。

Multiview vector-valued manifold regularization for multilabel image classification.

出版信息

IEEE Trans Neural Netw Learn Syst. 2013 May;24(5):709-22. doi: 10.1109/TNNLS.2013.2238682.

DOI:10.1109/TNNLS.2013.2238682

Abstract

In computer vision, image datasets used for classification are naturally associated with multiple labels and comprised of multiple views, because each image may contain several objects (e.g., pedestrian, bicycle, and tree) and is properly characterized by multiple visual features (e.g., color, texture, and shape). Currently, available tools ignore either the label relationship or the view complementarily. Motivated by the success of the vector-valued function that constructs matrix-valued kernels to explore the multilabel structure in the output space, we introduce multiview vector-valued manifold regularization (MV(3)MR) to integrate multiple features. MV(3)MR exploits the complementary property of different features and discovers the intrinsic local geometry of the compact support shared by different features under the theme of manifold regularization. We conduct extensive experiments on two challenging, but popular, datasets, PASCAL VOC' 07 and MIR Flickr, and validate the effectiveness of the proposed MV(3)MR for image classification.

摘要

在计算机视觉中，用于分类的图像数据集通常与多个标签相关联，并由多个视图组成，因为每张图像可能包含多个对象（例如行人、自行车和树），并且可以通过多个视觉特征（例如颜色、纹理和形状）进行适当的描述。目前，现有的工具要么忽略标签之间的关系，要么忽略视图之间的互补性。受向量值函数构建矩阵值核以探索输出空间中多标签结构成功的启发，我们引入了多视图向量值流形正则化（MV(3)MR）来集成多个特征。MV(3)MR 利用不同特征的互补性，并在流形正则化的主题下发现不同特征之间共享的紧致支持的内在局部几何结构。我们在两个具有挑战性但流行的数据集 PASCAL VOC' 07 和 MIR Flickr 上进行了广泛的实验，并验证了所提出的 MV(3)MR 对图像分类的有效性。

相似文献

Multiview vector-valued manifold regularization for multilabel image classification.

IEEE Trans Neural Netw Learn Syst. 2013 May;24(5):709-22. doi: 10.1109/TNNLS.2013.2238682.

Multiview matrix completion for multilabel image classification.

IEEE Trans Image Process. 2015 Aug;24(8):2355-68. doi: 10.1109/TIP.2015.2421309. Epub 2015 Apr 9.

Manifold regularized multitask learning for semi-supervised multilabel image classification.

IEEE Trans Image Process. 2013 Feb;22(2):523-36. doi: 10.1109/TIP.2012.2218825. Epub 2012 Sep 13.

Multiview Hessian regularization for image annotation.

IEEE Trans Image Process. 2013 Jul;22(7):2676-87. doi: 10.1109/TIP.2013.2255302. Epub 2013 Mar 28.

On combining multiple features for cartoon character retrieval and clip synthesis.

IEEE Trans Syst Man Cybern B Cybern. 2012 Oct;42(5):1413-27. doi: 10.1109/TSMCB.2012.2192108. Epub 2012 Apr 25.

Intrinsic regression models for manifold-valued data.

Med Image Comput Comput Assist Interv. 2009;12(Pt 2):192-9.

Discriminative shared Gaussian processes for multiview and view-invariant facial expression recognition.

IEEE Trans Image Process. 2015 Jan;24(1):189-204. doi: 10.1109/TIP.2014.2375634. Epub 2014 Nov 26.

Generalization characteristics of complex-valued feedforward neural networks in relation to signal coherence.

IEEE Trans Neural Netw Learn Syst. 2012 Apr;23(4):541-51. doi: 10.1109/TNNLS.2012.2183613.

Multi-label image categorization with sparse factor representation.

IEEE Trans Image Process. 2014 Mar;23(3):1028-37. doi: 10.1109/TIP.2014.2298978.

Effects of magnetic resonance image interpolation on the results of texture-based pattern classification: a phantom study.

Invest Radiol. 2009 Jul;44(7):405-11. doi: 10.1097/RLI.0b013e3181a50a66.

引用本文的文献

A Low-Measurement-Cost-Based Multi-Strategy Hyperspectral Image Classification Scheme.

Sensors (Basel). 2024 Oct 15;24(20):6647. doi: 10.3390/s24206647.

Label recovery and label correlation co-learning for multi-view multi-label classification with incomplete labels.

Appl Intell (Dordr). 2023;53(8):9444-9462. doi: 10.1007/s10489-022-03945-y. Epub 2022 Aug 9.

Multi-Layer Multi-View Classification for Alzheimer's Disease Diagnosis.

Proc AAAI Conf Artif Intell. 2018 Feb;2018:4406-4413.

Biview learning for human posture segmentation from 3D points cloud.

PLoS One. 2014 Jan 20;9(1):e85811. doi: 10.1371/journal.pone.0085811. eCollection 2014.

Multiview locally linear embedding for effective medical image retrieval.

PLoS One. 2013 Dec 13;8(12):e82409. doi: 10.1371/journal.pone.0082409. eCollection 2013.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

多视图向量值流形正则化的多标签图像分类。

Multiview vector-valued manifold regularization for multilabel image classification.

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献