自然场景中的局部特征的多尺度空间连接及其场景分类。

Multi-scale spatial concatenations of local features in natural scenes and scene classification.

机构信息

Brain and Behavior Discovery Institute, Georgia Regents University, Augusta, Georgia, United States of America.

出版信息

PLoS One. 2013 Sep 30;8(9):e76393. doi: 10.1371/journal.pone.0076393. eCollection 2013.

DOI:10.1371/journal.pone.0076393

PMID:24098789

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3787016/

Abstract

How does the visual system encode natural scenes? What are the basic structures of natural scenes? In current models of scene perception, there are two broad feature representations, global and local representations. Both representations are useful and have some successes; however, many observations on human scene perception seem to point to an intermediate-level representation. In this paper, we proposed natural scene structures, i.e., multi-scale spatial concatenations of local features, as an intermediate-level representation of natural scenes. To compile the natural scene structures, we first sampled a large number of multi-scale circular scene patches in a hexagonal configuration. We then performed independent component analysis on the patches and classified the independent components into a set of clusters using the K-means method. Finally, we obtained a set of natural scene structures, each of which is characterized by a set of dominant clusters of independent components. We examined a range of statistics of the natural scene structures, compiled from two widely used datasets of natural scenes, and modeled their spatial arrangements at larger spatial scales using adjacency matrices. We found that the natural scene structures include a full range of concatenations of visual features in natural scenes, and can be used to encode spatial information at various scales. We then selected a set of natural scene structures with high information, and used the occurring frequencies and the eigenvalues of the adjacency matrices to classify scenes in the datasets. We found that the performance of this model is comparable to or better than the state-of-the-art models on the two datasets. These results suggest that the natural scene structures are a useful intermediate-level representation of visual scenes for our understanding of natural scene perception.

摘要

视觉系统如何对自然场景进行编码？自然场景的基本结构是什么？在当前的场景感知模型中，存在两种广泛的特征表示，即全局和局部表示。这两种表示都很有用，并且取得了一些成功；然而，许多关于人类场景感知的观察似乎指向一种中间层次的表示。在本文中，我们提出了自然场景结构，即局部特征的多尺度空间串联，作为自然场景的中间层次表示。为了编译自然场景结构，我们首先以六边形配置在大量多尺度圆形场景斑块上进行采样。然后，我们对斑块进行独立成分分析，并使用 K-均值方法将独立成分分类为一组聚类。最后，我们获得了一组自然场景结构，每个结构都由一组独立成分的主导聚类来特征化。我们检查了从两个广泛使用的自然场景数据集编译的自然场景结构的一系列统计信息，并使用邻接矩阵对较大空间尺度上的空间排列进行建模。我们发现自然场景结构包括自然场景中视觉特征的各种串联，并且可以用于编码各种尺度的空间信息。然后，我们选择了一组具有高信息量的自然场景结构，并使用邻接矩阵的出现频率和特征值对数据集中的场景进行分类。我们发现该模型的性能与两个数据集上的最新模型相当或更好。这些结果表明，自然场景结构是理解自然场景感知的一种有用的中间层次视觉场景表示。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f861/3787016/9284f3f4003a/pone.0076393.g001.jpg

相似文献

Multi-scale spatial concatenations of local features in natural scenes and scene classification.

PLoS One. 2013 Sep 30;8(9):e76393. doi: 10.1371/journal.pone.0076393. eCollection 2013.

Robust action recognition using multi-scale spatial-temporal concatenations of local features as natural action structures.

PLoS One. 2012;7(10):e46686. doi: 10.1371/journal.pone.0046686. Epub 2012 Oct 4.

Spatial scene representations formed by self-organizing learning in a hippocampal extension of the ventral visual system.

Eur J Neurosci. 2008 Nov;28(10):2116-27. doi: 10.1111/j.1460-9568.2008.06486.x.

Rapid contextualization of fragmented scene information in the human visual system.

Neuroimage. 2020 Oct 1;219:117045. doi: 10.1016/j.neuroimage.2020.117045. Epub 2020 Jun 12.

The Neural Dynamics of Attentional Selection in Natural Scenes.

J Neurosci. 2016 Oct 12;36(41):10522-10528. doi: 10.1523/JNEUROSCI.1385-16.2016.

From image statistics to scene gist: evoked neural activity reveals transition from low-level natural image structure to scene category.

J Neurosci. 2013 Nov 27;33(48):18814-24. doi: 10.1523/JNEUROSCI.3128-13.2013.

Contributions of low- and high-level properties to neural processing of visual scenes in the human brain.

Philos Trans R Soc Lond B Biol Sci. 2017 Feb 19;372(1714). doi: 10.1098/rstb.2016.0102. Epub 2017 Jan 2.

Natural scene statistics account for the representation of scene categories in human visual cortex.

Neuron. 2013 Sep 4;79(5):1025-34. doi: 10.1016/j.neuron.2013.06.034. Epub 2013 Aug 8.

Disentangling scene content from spatial boundary: complementary roles for the parahippocampal place area and lateral occipital complex in representing real-world scenes.

J Neurosci. 2011 Jan 26;31(4):1333-40. doi: 10.1523/JNEUROSCI.3885-10.2011.

Decoding individual natural scene representations during perception and imagery.

Front Hum Neurosci. 2014 Feb 12;8:59. doi: 10.3389/fnhum.2014.00059. eCollection 2014.

引用本文的文献

A time-critical adaptive approach for visualizing natural scenes on different devices.

PLoS One. 2015 Feb 27;10(2):e0117586. doi: 10.1371/journal.pone.0117586. eCollection 2015.

本文引用的文献

The ventral visual pathway: an expanded neural framework for the processing of object quality.

Trends Cogn Sci. 2013 Jan;17(1):26-49. doi: 10.1016/j.tics.2012.10.011. Epub 2012 Dec 19.

Detecting natural occlusion boundaries using local cues.

J Vis. 2012 Dec 18;12(13):15. doi: 10.1167/12.13.15.

Robust action recognition using multi-scale spatial-temporal concatenations of local features as natural action structures.

PLoS One. 2012;7(10):e46686. doi: 10.1371/journal.pone.0046686. Epub 2012 Oct 4.

Toward a unified theory of visual area V4.

Neuron. 2012 Apr 12;74(1):12-29. doi: 10.1016/j.neuron.2012.03.011.

Learning intermediate-level representations of form and motion from natural movies.

Neural Comput. 2012 Apr;24(4):827-66. doi: 10.1162/NECO_a_00247. Epub 2011 Dec 14.

A hierarchical probabilistic model for rapid object categorization in natural scenes.

PLoS One. 2011;6(5):e20002. doi: 10.1371/journal.pone.0020002. Epub 2011 May 25.

Neural representations for object perception: structure, category, and adaptive coding.

Annu Rev Neurosci. 2011;34:45-67. doi: 10.1146/annurev-neuro-060909-153218.

Emergence of visual saliency from natural scenes via context-mediated probability distributions coding.

PLoS One. 2010 Dec 29;5(12):e15796. doi: 10.1371/journal.pone.0015796.

CENTRIST: A Visual Descriptor for Scene Categorization.

IEEE Trans Pattern Anal Mach Intell. 2011 Aug;33(8):1489-501. doi: 10.1109/TPAMI.2010.224. Epub 2010 Dec 23.

A high-throughput screening approach to discovering good forms of biologically inspired visual representation.

PLoS Comput Biol. 2009 Nov;5(11):e1000579. doi: 10.1371/journal.pcbi.1000579. Epub 2009 Nov 26.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

自然场景中的局部特征的多尺度空间连接及其场景分类。

Multi-scale spatial concatenations of local features in natural scenes and scene classification.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献