使用生成/判别混合方法进行场景分类。

Scene classification using a hybrid generative/discriminative approach.

作者信息

Bosch Anna, Zisserman Andrew, Muñoz Xavier

机构信息

Computer Vision and Robotics Group, Universitat de Girona, Campus Montilivi, Avenida Lluís Santaló s/n, Girona, Spain.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2008 Apr;30(4):712-27. doi: 10.1109/TPAMI.2007.70716.

DOI:10.1109/TPAMI.2007.70716

PMID:18276975

Abstract

We investigate whether dimensionality reduction using a latent generative model is beneficial for the task of weakly supervised scene classification. In detail we are given a set of labelled images of scenes (e.g. coast, forest, city, river, etc) and our objective is to classify a new image into one of these categories. Our approach consists of first discovering latent "topics" using probabilistic Latent Semantic Analysis (pLSA), a generative model from the statistical text literature here applied to a bag of visual words representation for each image, and subsequently training a multi-way classifier on the topic distribution vector for each image. We compare this approach to that of representing each image by a bag of visual words vector directly, and training a multi-way classifier on these vectors. To this end we introduce a novel vocabulary using dense colour SIFT descriptors, and then investigate the classification performance under changes in the size of the visual vocabulary, the number of latent topics learnt, and the type of discriminative classifier used (k-nearest neighbour or SVM). We achieve superior classification performance to recent publications that have used a bag of visual word representation, in all cases using the authors' own datasets and testing protocols. We also investigate the gain in adding spatial information. We show applications to image retrieval with relevance feedback and to scene classification in videos.

摘要

我们研究使用潜在生成模型进行降维是否有利于弱监督场景分类任务。具体来说，我们有一组带标签的场景图像（如海岸、森林、城市、河流等），我们的目标是将新图像分类到这些类别中的一个。我们的方法包括首先使用概率潜在语义分析（pLSA）发现潜在“主题”，pLSA是一种来自统计文本领域的生成模型，这里应用于每个图像的视觉词袋表示，随后针对每个图像的主题分布向量训练一个多类分类器。我们将这种方法与直接用视觉词袋向量表示每个图像并在这些向量上训练多类分类器的方法进行比较。为此，我们使用密集颜色SIFT描述符引入了一种新颖的词汇表，然后研究在视觉词汇表大小、学习到的潜在主题数量以及所使用的判别分类器类型（k近邻或支持向量机）变化的情况下的分类性能。在所有情况下，使用作者自己的数据集和测试协议，我们实现了比最近使用视觉词袋表示的出版物更好的分类性能。我们还研究了添加空间信息带来的增益。我们展示了其在具有相关反馈的图像检索以及视频场景分类中的应用。

相似文献

Scene classification using a hybrid generative/discriminative approach.使用生成/判别混合方法进行场景分类。

IEEE Trans Pattern Anal Mach Intell. 2008 Apr;30(4):712-27. doi: 10.1109/TPAMI.2007.70716.

A thousand words in a scene.一个场景中有一千个单词。

IEEE Trans Pattern Anal Mach Intell. 2007 Sep;29(9):1575-89. doi: 10.1109/TPAMI.2007.1155.

BM3 E: discriminative density propagation for visual tracking.BM3 E：用于视觉跟踪的判别密度传播

IEEE Trans Pattern Anal Mach Intell. 2007 Nov;29(11):2030-44. doi: 10.1109/TPAMI.2007.1111.

Visual tracker using sequential bayesian learning: discriminative, generative, and hybrid.使用序贯贝叶斯学习的视觉跟踪器：判别式、生成式和混合式。

IEEE Trans Syst Man Cybern B Cybern. 2008 Dec;38(6):1578-91. doi: 10.1109/TSMCB.2008.928226.

A discriminative learning framework with pairwise constraints for video object classification.一种用于视频对象分类的带有成对约束的判别式学习框架。

IEEE Trans Pattern Anal Mach Intell. 2006 Apr;28(4):578-93. doi: 10.1109/TPAMI.2006.65.

Universal and adapted vocabularies for generic visual categorization.用于通用视觉分类的通用和适应性词汇表。

IEEE Trans Pattern Anal Mach Intell. 2008 Jul;30(7):1243-56. doi: 10.1109/TPAMI.2007.70755.

Randomized clustering forests for image classification.用于图像分类的随机聚类森林

IEEE Trans Pattern Anal Mach Intell. 2008 Sep;30(9):1632-46. doi: 10.1109/TPAMI.2007.70822.

Semisupervised learning for a hybrid generative/discriminative classifier based on the maximum entropy principle.基于最大熵原理的混合生成/判别式分类器的半监督学习

IEEE Trans Pattern Anal Mach Intell. 2008 Mar;30(3):424-37. doi: 10.1109/TPAMI.2007.70710.

Dynamosaicing: mosaicing of dynamic scenes.动态拼接：动态场景的拼接。

IEEE Trans Pattern Anal Mach Intell. 2007 Oct;29(10):1789-801. doi: 10.1109/TPAMI.2007.1091.

Discovering thematic objects in image collections and videos.发现图像集合和视频中的主题对象。

IEEE Trans Image Process. 2012 Apr;21(4):2207-19. doi: 10.1109/TIP.2011.2181952. Epub 2011 Dec 26.

引用本文的文献

Remote Sensing Approaches for Meteorological Disaster Monitoring: Recent Achievements and New Challenges.遥感在气象灾害监测中的应用：最新进展与新挑战。

Int J Environ Res Public Health. 2022 Mar 20;19(6):3701. doi: 10.3390/ijerph19063701.

Assessment of Camouflage Effectiveness Based on Perceived Color Difference and Gradient Magnitude.基于颜色差异感知和梯度幅度的伪装效果评估。

Sensors (Basel). 2020 Aug 19;20(17):4672. doi: 10.3390/s20174672.

An efficient image descriptor for image classification and CBIR.一种用于图像分类和基于内容的图像检索的高效图像描述符。

Optik (Stuttg). 2020 Jul;214:164833. doi: 10.1016/j.ijleo.2020.164833. Epub 2020 May 4.

Subjective Ratings of and : Correlations With Statistical Image Properties in Western Oil Paintings.《西方油画中与的主观评分：与统计图像属性的相关性》（此处两个空格部分原文缺失具体内容）

Iperception. 2017 Jun 28;8(3):2041669517715474. doi: 10.1177/2041669517715474. eCollection 2017 May-Jun.

Stacked Predictive Sparse Decomposition for Classification of Histology Sections.用于组织学切片分类的堆叠预测稀疏分解

Int J Comput Vis. 2015 May;113(1):3-18. doi: 10.1007/s11263-014-0790-9. Epub 2014 Dec 23.

Dictionary Pruning with Visual Word Significance for Medical Image Retrieval.基于视觉词重要性的词典剪枝在医学图像检索中的应用

Neurocomputing (Amst). 2016 Feb 12;177:75-88. doi: 10.1016/j.neucom.2015.11.008. Epub 2015 Nov 17.

Pairwise Latent Semantic Association for Similarity Computation in Medical Imaging.用于医学成像中相似性计算的成对潜在语义关联

IEEE Trans Biomed Eng. 2016 May;63(5):1058-1069. doi: 10.1109/TBME.2015.2478028. Epub 2015 Sep 10.

Stacked Predictive Sparse Coding for Classification of Distinct Regions of Tumor Histopathology.用于肿瘤组织病理学不同区域分类的堆叠预测稀疏编码

Proc IEEE Int Conf Comput Vis. 2013:169-176. doi: 10.1109/ICCV.2013.28.

Classification of Tumor Histology via Morphometric Context.通过形态测量背景对肿瘤组织学进行分类。

Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit. 2013 Jun 23;2013. doi: 10.1109/CVPR.2013.286.

Generative-discriminative basis learning for medical imaging.基于生成-判别式的医学影像基础学习。

IEEE Trans Med Imaging. 2012 Jan;31(1):51-69. doi: 10.1109/TMI.2011.2162961. Epub 2011 Jul 25.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用生成/判别混合方法进行场景分类。

Scene classification using a hybrid generative/discriminative approach.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献