Suppr超能文献

用于稀疏表示的字典学习算法。

Dictionary learning algorithms for sparse representation.

作者信息

Kreutz-Delgado Kenneth, Murray Joseph F, Rao Bhaskar D, Engan Kjersti, Lee Te-Won, Sejnowski Terrence J

机构信息

Electrical and Computer Engineering, Jacobs School of Engineering, University of California, San Diego, La Jolla, California 92093-0407, USA.

出版信息

Neural Comput. 2003 Feb;15(2):349-96. doi: 10.1162/089976603762552951.

Abstract

Algorithms for data-driven learning of domain-specific overcomplete dictionaries are developed to obtain maximum likelihood and maximum a posteriori dictionary estimates based on the use of Bayesian models with concave/Schur-concave (CSC) negative log priors. Such priors are appropriate for obtaining sparse representations of environmental signals within an appropriately chosen (environmentally matched) dictionary. The elements of the dictionary can be interpreted as concepts, features, or words capable of succinct expression of events encountered in the environment (the source of the measured signals). This is a generalization of vector quantization in that one is interested in a description involving a few dictionary entries (the proverbial "25 words or less"), but not necessarily as succinct as one entry. To learn an environmentally adapted dictionary capable of concise expression of signals generated by the environment, we develop algorithms that iterate between a representative set of sparse representations found by variants of FOCUSS and an update of the dictionary using these sparse representations. Experiments were performed using synthetic data and natural images. For complete dictionaries, we demonstrate that our algorithms have improved performance over other independent component analysis (ICA) methods, measured in terms of signal-to-noise ratios of separated sources. In the overcomplete case, we show that the true underlying dictionary and sparse sources can be accurately recovered. In tests with natural images, learned overcomplete dictionaries are shown to have higher coding efficiency than complete dictionaries; that is, images encoded with an overcomplete dictionary have both higher compression (fewer bits per pixel) and higher accuracy (lower mean square error).

摘要

开发了用于特定领域超完备字典的数据驱动学习算法,以基于使用具有凹/舒尔凹(CSC)负对数先验的贝叶斯模型来获得最大似然和最大后验字典估计。这种先验适用于在适当选择的(环境匹配的)字典中获得环境信号的稀疏表示。字典的元素可以解释为能够简洁表达环境中遇到的事件(测量信号的来源)的概念、特征或单词。这是矢量量化的一种推广,因为人们感兴趣的是涉及少数字典条目(如众所周知的“25个单词或更少”)的描述,但不一定像一个条目那样简洁。为了学习能够简洁表达环境产生的信号的环境适应字典,我们开发了在由FOCUSS变体找到的一组代表性稀疏表示与使用这些稀疏表示更新字典之间进行迭代的算法。使用合成数据和自然图像进行了实验。对于完备字典,我们证明我们的算法在分离源的信噪比方面比其他独立成分分析(ICA)方法具有更好的性能。在超完备情况下,我们表明可以准确恢复真实的基础字典和稀疏源。在自然图像测试中,表明学习到的超完备字典比完备字典具有更高的编码效率;也就是说,用超完备字典编码的图像具有更高的压缩率(每像素位数更少)和更高的精度(更低的均方误差)。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验