用于前景视频编码的时空在线字典学习的稀疏表示

IEEE Trans Image Process. 2016 Oct;25(10):4580-4595. doi: 10.1109/TIP.2016.2594490. Epub 2016 Jul 27.

Classical dictionary learning methods for video coding suffer from high computational complexity and interfered coding efficiency by disregarding its underlying distribution. This paper proposes a spatio-temporal online dictionary learning (STOL) algorithm to speed up the convergence rate of dictionary learning with a guarantee of approximation error. The proposed algorithm incorporates stochastic gradient descents to form a dictionary of pairs of 3D low-frequency and high-frequency spatio-temporal volumes. In each iteration of the learning process, it randomly selects one sample volume and updates the atoms of dictionary by minimizing the expected cost, rather than optimizes empirical cost over the complete training data, such as batch learning methods, e.g., K-SVD. Since the selected volumes are supposed to be independent identically distributed samples from the underlying distribution, decomposition coefficients attained from the trained dictionary are desirable for sparse representation. Theoretically, it is proved that the proposed STOL could achieve better approximation for sparse representation than K-SVD and maintain both structured sparsity and hierarchical sparsity. It is shown to outperform batch gradient descent methods (K-SVD) in the sense of convergence speed and computational complexity, and its upper bound for prediction error is asymptotically equal to the training error. With lower computational complexity, extensive experiments validate that the STOL-based coding scheme achieves performance improvements than H.264/AVC or High Efficiency Video Coding as well as existing super-resolution-based methods in rate-distortion performance and visual quality.

用于视频编码的经典字典学习方法存在计算复杂度高的问题，并且由于忽略其潜在分布而干扰了编码效率。本文提出了一种时空在线字典学习（STOL）算法，以在保证近似误差的情况下加快字典学习的收敛速度。所提出的算法结合了随机梯度下降，以形成由3D低频和高频时空体积对组成的字典。在学习过程的每次迭代中，它随机选择一个样本体积，并通过最小化期望成本来更新字典的原子，而不是像批量学习方法（例如K-SVD）那样在完整的训练数据上优化经验成本。由于所选体积被认为是来自潜在分布的独立同分布样本，因此从训练字典中获得的分解系数对于稀疏表示是理想的。理论上，证明了所提出的STOL在稀疏表示方面比K-SVD能实现更好的近似，并且能同时保持结构稀疏性和层次稀疏性。在收敛速度和计算复杂度方面，它被证明优于批量梯度下降方法（K-SVD），并且其预测误差的上限渐近等于训练误差。具有较低的计算复杂度，大量实验验证了基于STOL的编码方案在率失真性能和视觉质量方面比H.264/AVC或高效视频编码以及现有的基于超分辨率的方法实现了性能提升。

相似文献

Sparse Representation With Spatio-Temporal Online Dictionary Learning for Promising Video Coding.

IEEE Trans Image Process. 2016 Oct;25(10):4580-4595. doi: 10.1109/TIP.2016.2594490. Epub 2016 Jul 27.

Progressive Dictionary Learning With Hierarchical Predictive Structure for Low Bit-Rate Scalable Video Coding.

IEEE Trans Image Process. 2017 Jun;26(6):2972-2987. doi: 10.1109/TIP.2017.2692882. Epub 2017 Apr 12.

Sparse coded image super-resolution using K-SVD trained dictionary based on regularized orthogonal matching pursuit.

Biomed Mater Eng. 2015;26 Suppl 1:S1399-407. doi: 10.3233/BME-151438.

A Fast Algorithm for Learning Overcomplete Dictionary for Sparse Representation Based on Proximal Operators.

Neural Comput. 2015 Sep;27(9):1951-82. doi: 10.1162/NECO_a_00763. Epub 2015 Jul 10.

Super-resolution CT Image Reconstruction Based on Dictionary Learning and Sparse Representation.

Sci Rep. 2018 Jun 11;8(1):8799. doi: 10.1038/s41598-018-27261-z.

Coupled dictionary training for image super-resolution.

IEEE Trans Image Process. 2012 Aug;21(8):3467-78. doi: 10.1109/TIP.2012.2192127. Epub 2012 Apr 3.

Orthogonal Procrustes Analysis for Dictionary Learning in Sparse Linear Representation.

PLoS One. 2017 Jan 19;12(1):e0169663. doi: 10.1371/journal.pone.0169663. eCollection 2017.

Joint and Direct Optimization for Dictionary Learning in Convolutional Sparse Representation.

IEEE Trans Neural Netw Learn Syst. 2020 Feb;31(2):559-573. doi: 10.1109/TNNLS.2019.2906074. Epub 2019 Apr 19.

Image fusion via nonlocal sparse K-SVD dictionary learning.

Appl Opt. 2016 Mar 1;55(7):1814-23. doi: 10.1364/AO.55.001814.

Alternatively Constrained Dictionary Learning For Image Superresolution.

IEEE Trans Cybern. 2014 Mar;44(3):366-77. doi: 10.1109/TCYB.2013.2256347. Epub 2013 May 2.

引用本文的文献

Progressive Dictionary Learning With Hierarchical Predictive Structure for Low Bit-Rate Scalable Video Coding.

IEEE Trans Image Process. 2017 Jun;26(6):2972-2987. doi: 10.1109/TIP.2017.2692882. Epub 2017 Apr 12.

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

相似文献

Sparse Representation With Spatio-Temporal Online Dictionary Learning for Promising Video Coding.

IEEE Trans Image Process. 2016 Oct;25(10):4580-4595. doi: 10.1109/TIP.2016.2594490. Epub 2016 Jul 27.

Progressive Dictionary Learning With Hierarchical Predictive Structure for Low Bit-Rate Scalable Video Coding.

IEEE Trans Image Process. 2017 Jun;26(6):2972-2987. doi: 10.1109/TIP.2017.2692882. Epub 2017 Apr 12.

Sparse coded image super-resolution using K-SVD trained dictionary based on regularized orthogonal matching pursuit.

Biomed Mater Eng. 2015;26 Suppl 1:S1399-407. doi: 10.3233/BME-151438.

A Fast Algorithm for Learning Overcomplete Dictionary for Sparse Representation Based on Proximal Operators.

Neural Comput. 2015 Sep;27(9):1951-82. doi: 10.1162/NECO_a_00763. Epub 2015 Jul 10.

Super-resolution CT Image Reconstruction Based on Dictionary Learning and Sparse Representation.

Sci Rep. 2018 Jun 11;8(1):8799. doi: 10.1038/s41598-018-27261-z.

Coupled dictionary training for image super-resolution.

IEEE Trans Image Process. 2012 Aug;21(8):3467-78. doi: 10.1109/TIP.2012.2192127. Epub 2012 Apr 3.

Orthogonal Procrustes Analysis for Dictionary Learning in Sparse Linear Representation.

PLoS One. 2017 Jan 19;12(1):e0169663. doi: 10.1371/journal.pone.0169663. eCollection 2017.

Joint and Direct Optimization for Dictionary Learning in Convolutional Sparse Representation.

IEEE Trans Neural Netw Learn Syst. 2020 Feb;31(2):559-573. doi: 10.1109/TNNLS.2019.2906074. Epub 2019 Apr 19.

Image fusion via nonlocal sparse K-SVD dictionary learning.

Appl Opt. 2016 Mar 1;55(7):1814-23. doi: 10.1364/AO.55.001814.

Alternatively Constrained Dictionary Learning For Image Superresolution.

IEEE Trans Cybern. 2014 Mar;44(3):366-77. doi: 10.1109/TCYB.2013.2256347. Epub 2013 May 2.

引用本文的文献

Progressive Dictionary Learning With Hierarchical Predictive Structure for Low Bit-Rate Scalable Video Coding.

IEEE Trans Image Process. 2017 Jun;26(6):2972-2987. doi: 10.1109/TIP.2017.2692882. Epub 2017 Apr 12.

Sparse Representation With Spatio-Temporal Online Dictionary Learning for Promising Video Coding.

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献