自然图像的阶乘编码：线性模型在消除高阶依赖性方面的效果如何？

Factorial coding of natural images: how effective are linear models in removing higher-order dependencies?

作者信息

Bethge Matthias

机构信息

Redwood Neuroscience Institute, Menlo Park, CA 94025, USA.

出版信息

J Opt Soc Am A Opt Image Sci Vis. 2006 Jun;23(6):1253-68. doi: 10.1364/josaa.23.001253.

DOI:10.1364/josaa.23.001253

PMID:16715144

Abstract

The performance of unsupervised learning models for natural images is evaluated quantitatively by means of information theory. We estimate the gain in statistical independence (the multi-information reduction) achieved with independent component analysis (ICA), principal component analysis (PCA), zero-phase whitening, and predictive coding. Predictive coding is translated into the transform coding framework, where it can be characterized by the constraint of a triangular filter matrix. A randomly sampled whitening basis and the Haar wavelet are included in the comparison as well. The comparison of all these methods is carried out for different patch sizes, ranging from 2x2 to 16x16 pixels. In spite of large differences in the shape of the basis functions, we find only small differences in the multi-information between all decorrelation transforms (5% or less) for all patch sizes. Among the second-order methods, PCA is optimal for small patch sizes and predictive coding performs best for large patch sizes. The extra gain achieved with ICA is always less than 2%. In conclusion, the edge filters found with ICA lead to only a surprisingly small improvement in terms of its actual objective.

摘要

通过信息论对自然图像无监督学习模型的性能进行定量评估。我们估计了通过独立成分分析（ICA）、主成分分析（PCA）、零相位白化和预测编码实现的统计独立性增益（多信息减少）。预测编码被转化为变换编码框架，在此框架中它可以由三角滤波器矩阵的约束来表征。比较中还包括随机采样的白化基和哈尔小波。所有这些方法针对从2x2到16x16像素的不同图像块大小进行比较。尽管基函数形状差异很大，但我们发现对于所有图像块大小，所有去相关变换之间的多信息差异都很小（5%或更小）。在二阶方法中，PCA对于小图像块大小是最优的，而预测编码对于大图像块大小表现最佳。ICA实现的额外增益始终小于2%。总之，ICA找到的边缘滤波器在实际目标方面仅带来了惊人的小改进。

相似文献

Factorial coding of natural images: how effective are linear models in removing higher-order dependencies?

J Opt Soc Am A Opt Image Sci Vis. 2006 Jun;23(6):1253-68. doi: 10.1364/josaa.23.001253.

Automatic construction of active appearance models as an image coding problem.

IEEE Trans Pattern Anal Mach Intell. 2004 Oct;26(10):1380-4. doi: 10.1109/TPAMI.2004.77.

Computationally efficient wavelet affine invariant functions for shape recognition.

IEEE Trans Pattern Anal Mach Intell. 2004 Aug;26(8):1095-9. doi: 10.1109/TPAMI.2004.39.

Effective representation using ICA for face recognition robust to local distortion and partial occlusion.

IEEE Trans Pattern Anal Mach Intell. 2005 Dec;27(12):1977-81. doi: 10.1109/TPAMI.2005.242.

Image restoration of arbitrarily warped documents.

IEEE Trans Pattern Anal Mach Intell. 2004 Oct;26(10):1295-306. doi: 10.1109/TPAMI.2004.87.

Projective moment invariants.

IEEE Trans Pattern Anal Mach Intell. 2004 Oct;26(10):1364-7. doi: 10.1109/TPAMI.2004.89.

Comments on "fundamental limits of reconstruction-based superresolution algorithms under local translation".

IEEE Trans Pattern Anal Mach Intell. 2006 May;28(5):846; discussion 847. doi: 10.1109/TPAMI.2006.91.

The long-range saliency of edge- and corner-based salient points.

IEEE Trans Image Process. 2005 Nov;14(11):1701-6. doi: 10.1109/tip.2005.854490.

Symbol recognition via statistical integration of pixel-level constraint histograms: a new descriptor.

IEEE Trans Pattern Anal Mach Intell. 2005 Feb;27(2):278-81. doi: 10.1109/TPAMI.2005.38.

First order error propagation of the procrustes method for 3D attitude estimation.

IEEE Trans Pattern Anal Mach Intell. 2005 Feb;27(2):221-9. doi: 10.1109/TPAMI.2005.29.

引用本文的文献

Spatio-chromatic information available from different neural layers via Gaussianization.

J Math Neurosci. 2020 Nov 11;10(1):18. doi: 10.1186/s13408-020-00095-8.

On the Sparse Structure of Natural Sounds and Natural Images: Similarities, Differences, and Implications for Neural Coding.

Front Comput Neurosci. 2019 Jun 26;13:39. doi: 10.3389/fncom.2019.00039. eCollection 2019.

Two-Dimensional Hermite Filters Simplify the Description of High-Order Statistics of Natural Images.

Symmetry (Basel). 2016 Sep;8(9). doi: 10.3390/sym8090098. Epub 2016 Sep 21.

Nonlinear Image Representation Using Divisive Normalization.

Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit. 2008;2008:1-8. doi: 10.1109/CVPR.2008.4587821.

Reducing statistical dependencies in natural signals using radial Gaussianization.

Adv Neural Inf Process Syst. 2008;2008:1009-1016.

A simple model of optimal population coding for sensory systems.

PLoS Comput Biol. 2014 Aug 14;10(8):e1003761. doi: 10.1371/journal.pcbi.1003761. eCollection 2014 Aug.

Spatio-chromatic adaptation via higher-order canonical correlation analysis of natural images.

PLoS One. 2014 Feb 12;9(2):e86481. doi: 10.1371/journal.pone.0086481. eCollection 2014.

A distributed code for color in natural scenes derived from center-surround filtered cone signals.

Front Psychol. 2013 Sep 27;4:661. doi: 10.3389/fpsyg.2013.00661. eCollection 2013.

Temporal adaptation enhances efficient contrast gain control on natural images.

PLoS Comput Biol. 2013;9(1):e1002889. doi: 10.1371/journal.pcbi.1002889. Epub 2013 Jan 31.

How sensitive is the human visual system to the local statistics of natural images?

PLoS Comput Biol. 2013;9(1):e1002873. doi: 10.1371/journal.pcbi.1002873. Epub 2013 Jan 24.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

自然图像的阶乘编码：线性模型在消除高阶依赖性方面的效果如何？

Factorial coding of natural images: how effective are linear models in removing higher-order dependencies?

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献