• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

渐进式字典学习与分层预测结构相结合的低比特率可扩展视频编码。

Progressive Dictionary Learning With Hierarchical Predictive Structure for Low Bit-Rate Scalable Video Coding.

出版信息

IEEE Trans Image Process. 2017 Jun;26(6):2972-2987. doi: 10.1109/TIP.2017.2692882. Epub 2017 Apr 12.

DOI:10.1109/TIP.2017.2692882
PMID:28422683
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5638692/
Abstract

Dictionary learning has emerged as a promising alternative to the conventional hybrid coding framework. However, the rigid structure of sequential training and prediction degrades its performance in scalable video coding. This paper proposes a progressive dictionary learning framework with hierarchical predictive structure for scalable video coding, especially in low bitrate region. For pyramidal layers, sparse representation based on spatio-temporal dictionary is adopted to improve the coding efficiency of enhancement layers with a guarantee of reconstruction performance. The overcomplete dictionary is trained to adaptively capture local structures along motion trajectories as well as exploit the correlations between the neighboring layers of resolutions. Furthermore, progressive dictionary learning is developed to enable the scalability in temporal domain and restrict the error propagation in a closed-loop predictor. Under the hierarchical predictive structure, online learning is leveraged to guarantee the training and prediction performance with an improved convergence rate. To accommodate with the state-of-the-art scalable extension of H.264/AVC and latest High Efficiency Video Coding (HEVC), standardized codec cores are utilized to encode the base and enhancement layers. Experimental results show that the proposed method outperforms the latest scalable extension of HEVC and HEVC simulcast over extensive test sequences with various resolutions.

摘要

字典学习已成为传统混合编码框架的一种很有前途的替代方法。然而,顺序训练和预测的刚性结构降低了其在可扩展视频编码中的性能。本文提出了一种用于可扩展视频编码的渐进式字典学习框架,具有分层预测结构,特别是在低比特率区域。对于金字塔层,采用基于时空字典的稀疏表示来提高增强层的编码效率,同时保证重建性能。训练过完备字典以自适应地捕获运动轨迹上的局部结构,并利用分辨率的相邻层之间的相关性。此外,渐进式字典学习用于在时域中实现可扩展性,并限制在闭环预测器中的误差传播。在分层预测结构下,利用在线学习来保证训练和预测性能,同时提高收敛速度。为了适应 H.264/AVC 的最新可扩展扩展和最新的高效视频编码 (HEVC),标准化的编解码器内核用于编码基础层和增强层。实验结果表明,该方法在各种分辨率的广泛测试序列上优于最新的 HEVC 可扩展扩展和 HEVC 联播。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff28/5638692/57803c563fff/nihms874820f13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff28/5638692/79a7efc5be46/nihms874820f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff28/5638692/a156785fcefa/nihms874820f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff28/5638692/33e923d96238/nihms874820f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff28/5638692/a4ce2ca823ef/nihms874820f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff28/5638692/8bb78312765a/nihms874820f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff28/5638692/2dce5579511a/nihms874820f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff28/5638692/cf03f62f9818/nihms874820f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff28/5638692/3369ce507b84/nihms874820f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff28/5638692/58dd1e5b7ea4/nihms874820f9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff28/5638692/ade89da6a5df/nihms874820f10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff28/5638692/3715c56a9ca9/nihms874820f11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff28/5638692/f099e6ed304b/nihms874820f12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff28/5638692/57803c563fff/nihms874820f13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff28/5638692/79a7efc5be46/nihms874820f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff28/5638692/a156785fcefa/nihms874820f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff28/5638692/33e923d96238/nihms874820f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff28/5638692/a4ce2ca823ef/nihms874820f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff28/5638692/8bb78312765a/nihms874820f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff28/5638692/2dce5579511a/nihms874820f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff28/5638692/cf03f62f9818/nihms874820f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff28/5638692/3369ce507b84/nihms874820f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff28/5638692/58dd1e5b7ea4/nihms874820f9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff28/5638692/ade89da6a5df/nihms874820f10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff28/5638692/3715c56a9ca9/nihms874820f11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff28/5638692/f099e6ed304b/nihms874820f12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ff28/5638692/57803c563fff/nihms874820f13.jpg

相似文献

1
Progressive Dictionary Learning With Hierarchical Predictive Structure for Low Bit-Rate Scalable Video Coding.渐进式字典学习与分层预测结构相结合的低比特率可扩展视频编码。
IEEE Trans Image Process. 2017 Jun;26(6):2972-2987. doi: 10.1109/TIP.2017.2692882. Epub 2017 Apr 12.
2
Sparse Representation With Spatio-Temporal Online Dictionary Learning for Promising Video Coding.用于前景视频编码的时空在线字典学习的稀疏表示
IEEE Trans Image Process. 2016 Oct;25(10):4580-4595. doi: 10.1109/TIP.2016.2594490. Epub 2016 Jul 27.
3
Sparse/DCT (S/DCT) two-layered representation of prediction residuals for video coding.用于视频编码的预测残差的稀疏/DCT(S/DCT)双层表示。
IEEE Trans Image Process. 2013 Jul;22(7):2711-22. doi: 10.1109/TIP.2013.2256917. Epub 2013 Apr 4.
4
Interlayer simplified depth coding for quality scalability on 3D high efficiency video coding.用于3D高效视频编码中质量可伸缩性的层间简化深度编码
ScientificWorldJournal. 2014 Mar 16;2014:841608. doi: 10.1155/2014/841608. eCollection 2014.
5
Bidirectional scalable motion for scalable video coding.双向可扩展运动用于可扩展视频编码。
IEEE Trans Image Process. 2010 Nov;19(11):3059-64. doi: 10.1109/TIP.2010.2050933. Epub 2010 May 20.
6
Learning-Based Rate Control for High Efficiency Video Coding.基于学习的高效视频编码码率控制。
Sensors (Basel). 2023 Mar 30;23(7):3607. doi: 10.3390/s23073607.
7
An estimation-theoretic framework for spatially scalable video coding.一种用于空间可分级视频编码的估计理论框架。
IEEE Trans Image Process. 2014 Aug;23(8):3684-97. doi: 10.1109/TIP.2014.2331761. Epub 2014 Jun 18.
8
Temporally Dependent Rate-Distortion Optimization for Low-Delay Hierarchical Video Coding.基于时域的率失真优化分层视频编码低延迟技术。
IEEE Trans Image Process. 2017 Sep;26(9):4457-4470. doi: 10.1109/TIP.2017.2713598. Epub 2017 Jun 8.
9
Optimizing the hierarchical prediction and coding in HEVC for surveillance and conference videos with background modeling.基于背景建模的监控和会议视频的 HEVC 分层预测和编码优化。
IEEE Trans Image Process. 2014 Oct;23(10):4511-26. doi: 10.1109/TIP.2014.2352036. Epub 2014 Aug 26.
10
λ domain rate control algorithm for high efficiency video coding.用于高效率视频编码的 λ 域率控制算法。
IEEE Trans Image Process. 2014 Sep;23(9):3841-54. doi: 10.1109/TIP.2014.2336550. Epub 2014 Jul 8.

本文引用的文献

1
Sparse Representation With Spatio-Temporal Online Dictionary Learning for Promising Video Coding.用于前景视频编码的时空在线字典学习的稀疏表示
IEEE Trans Image Process. 2016 Oct;25(10):4580-4595. doi: 10.1109/TIP.2016.2594490. Epub 2016 Jul 27.
2
An estimation-theoretic framework for spatially scalable video coding.一种用于空间可分级视频编码的估计理论框架。
IEEE Trans Image Process. 2014 Aug;23(8):3684-97. doi: 10.1109/TIP.2014.2331761. Epub 2014 Jun 18.
3
Sparse/DCT (S/DCT) two-layered representation of prediction residuals for video coding.
用于视频编码的预测残差的稀疏/DCT(S/DCT)双层表示。
IEEE Trans Image Process. 2013 Jul;22(7):2711-22. doi: 10.1109/TIP.2013.2256917. Epub 2013 Apr 4.
4
Bidirectional scalable motion for scalable video coding.双向可扩展运动用于可扩展视频编码。
IEEE Trans Image Process. 2010 Nov;19(11):3059-64. doi: 10.1109/TIP.2010.2050933. Epub 2010 May 20.
5
Image super-resolution via sparse representation.基于稀疏表示的图像超分辨率重建。
IEEE Trans Image Process. 2010 Nov;19(11):2861-73. doi: 10.1109/TIP.2010.2050625. Epub 2010 May 18.
6
Overlapped block motion compensation: an estimation-theoretic approach.重叠块运动补偿:一种估计理论方法。
IEEE Trans Image Process. 1994;3(5):693-9. doi: 10.1109/83.334974.
7
Three-dimensional subband coding with motion compensation.带运动补偿的三维子带编码。
IEEE Trans Image Process. 1994;3(5):559-71. doi: 10.1109/83.334985.
8
Motion-compensated 3-D subband coding of video.视频的运动补偿三维子带编码
IEEE Trans Image Process. 1999;8(2):155-67. doi: 10.1109/83.743851.
9
High performance scalable image compression with EBCOT.采用嵌入式块编码变换(EBCOT)的高性能可扩展图像压缩
IEEE Trans Image Process. 2000;9(7):1158-70. doi: 10.1109/83.847830.
10
Motion estimation using low-band-shift method for wavelet-based moving-picture coding.用于基于小波的运动图像编码的低带移方法的运动估计。
IEEE Trans Image Process. 2000;9(4):577-87. doi: 10.1109/83.841935.