Bethge Matthias
Redwood Neuroscience Institute, Menlo Park, CA 94025, USA.
J Opt Soc Am A Opt Image Sci Vis. 2006 Jun;23(6):1253-68. doi: 10.1364/josaa.23.001253.
The performance of unsupervised learning models for natural images is evaluated quantitatively by means of information theory. We estimate the gain in statistical independence (the multi-information reduction) achieved with independent component analysis (ICA), principal component analysis (PCA), zero-phase whitening, and predictive coding. Predictive coding is translated into the transform coding framework, where it can be characterized by the constraint of a triangular filter matrix. A randomly sampled whitening basis and the Haar wavelet are included in the comparison as well. The comparison of all these methods is carried out for different patch sizes, ranging from 2x2 to 16x16 pixels. In spite of large differences in the shape of the basis functions, we find only small differences in the multi-information between all decorrelation transforms (5% or less) for all patch sizes. Among the second-order methods, PCA is optimal for small patch sizes and predictive coding performs best for large patch sizes. The extra gain achieved with ICA is always less than 2%. In conclusion, the edge filters found with ICA lead to only a surprisingly small improvement in terms of its actual objective.
通过信息论对自然图像无监督学习模型的性能进行定量评估。我们估计了通过独立成分分析(ICA)、主成分分析(PCA)、零相位白化和预测编码实现的统计独立性增益(多信息减少)。预测编码被转化为变换编码框架,在此框架中它可以由三角滤波器矩阵的约束来表征。比较中还包括随机采样的白化基和哈尔小波。所有这些方法针对从2x2到16x16像素的不同图像块大小进行比较。尽管基函数形状差异很大,但我们发现对于所有图像块大小,所有去相关变换之间的多信息差异都很小(5%或更小)。在二阶方法中,PCA对于小图像块大小是最优的,而预测编码对于大图像块大小表现最佳。ICA实现的额外增益始终小于2%。总之,ICA找到的边缘滤波器在实际目标方面仅带来了惊人的小改进。