• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于机器学习的编码单元深度决策,用于高效视频编码中的灵活复杂度分配。

Machine learning-based coding unit depth decisions for flexible complexity allocation in high efficiency video coding.

出版信息

IEEE Trans Image Process. 2015 Jul;24(7):2225-38. doi: 10.1109/TIP.2015.2417498.

DOI:10.1109/TIP.2015.2417498
PMID:25826804
Abstract

In this paper, we propose a machine learning-based fast coding unit (CU) depth decision method for High Efficiency Video Coding (HEVC), which optimizes the complexity allocation at CU level with given rate-distortion (RD) cost constraints. First, we analyze quad-tree CU depth decision process in HEVC and model it as a three-level of hierarchical binary decision problem. Second, a flexible CU depth decision structure is presented, which allows the performances of each CU depth decision be smoothly transferred between the coding complexity and RD performance. Then, a three-output joint classifier consists of multiple binary classifiers with different parameters is designed to control the risk of false prediction. Finally, a sophisticated RD-complexity model is derived to determine the optimal parameters for the joint classifier, which is capable of minimizing the complexity in each CU depth at given RD degradation constraints. Comparative experiments over various sequences show that the proposed CU depth decision algorithm can reduce the computational complexity from 28.82% to 70.93%, and 51.45% on average when compared with the original HEVC test model. The Bjøntegaard delta peak signal-to-noise ratio and Bjøntegaard delta bit rate are -0.061 dB and 1.98% on average, which is negligible. The overall performance of the proposed algorithm outperforms those of the state-of-the-art schemes.

摘要

在本文中,我们提出了一种基于机器学习的快速编码单元 (CU) 深度决策方法,用于高效视频编码 (HEVC),该方法在给定率失真 (RD) 成本约束下优化了 CU 级别的复杂度分配。首先,我们分析了 HEVC 中四叉树 CU 深度决策过程,并将其建模为三级分层二进制决策问题。其次,提出了一种灵活的 CU 深度决策结构,允许每个 CU 深度决策的性能在编码复杂度和 RD 性能之间平滑转换。然后,设计了一个由多个具有不同参数的二进制分类器组成的三输出联合分类器,以控制误判的风险。最后,推导出一种复杂的 RD 复杂度模型,以确定联合分类器的最优参数,从而能够在给定的 RD 劣化约束下最小化每个 CU 深度的复杂度。在各种序列上的对比实验表明,与原始 HEVC 测试模型相比,所提出的 CU 深度决策算法可以将计算复杂度降低 28.82%到 70.93%,平均降低 51.45%。Bjøntegaard 峰信噪比和 Bjøntegaard 比特率的平均差值分别为-0.061 dB 和 1.98%,可以忽略不计。所提出算法的整体性能优于现有技术方案。

相似文献

1
Machine learning-based coding unit depth decisions for flexible complexity allocation in high efficiency video coding.基于机器学习的编码单元深度决策,用于高效视频编码中的灵活复杂度分配。
IEEE Trans Image Process. 2015 Jul;24(7):2225-38. doi: 10.1109/TIP.2015.2417498.
2
Effective CU size decision for HEVC intracoding.HEVC 帧内编码的有效 CU 大小决策。
IEEE Trans Image Process. 2014 Oct;23(10):4232-41. doi: 10.1109/TIP.2014.2341927. Epub 2014 Jul 23.
3
Efficient Intra Mode Selection for Depth-Map Coding Utilizing Spatiotemporal, Inter-Component and Inter-View Correlations in 3D-HEVC.利用 3D-HEVC 中的时空、分量间和视图间相关性进行深度图编码的高效帧内模式选择。
IEEE Trans Image Process. 2018 Sep;27(9):4195-4206. doi: 10.1109/TIP.2018.2837379.
4
Context-adaptive based CU processing for 3D-HEVC.用于3D-HEVC的基于上下文自适应的CU处理
PLoS One. 2017 Feb 9;12(2):e0171018. doi: 10.1371/journal.pone.0171018. eCollection 2017.
5
Online Learning-Based Multi-Stage Complexity Control for Live Video Coding.基于在线学习的实时视频编码多阶段复杂度控制
IEEE Trans Image Process. 2021;30:641-656. doi: 10.1109/TIP.2020.3036766. Epub 2020 Dec 4.
6
Low complexity mode decision for 3D-HEVC.用于3D-HEVC的低复杂度模式决策
ScientificWorldJournal. 2014;2014:392505. doi: 10.1155/2014/392505. Epub 2014 Aug 28.
7
DeepQTMT: A Deep Learning Approach for Fast QTMT-Based CU Partition of Intra-Mode VVC.深度QTMT:一种基于深度学习的用于帧内模式VVC的快速基于QTMT的CU划分方法。
IEEE Trans Image Process. 2021;30:5377-5390. doi: 10.1109/TIP.2021.3083447. Epub 2021 Jun 3.
8
λ domain rate control algorithm for high efficiency video coding.用于高效率视频编码的 λ 域率控制算法。
IEEE Trans Image Process. 2014 Sep;23(9):3841-54. doi: 10.1109/TIP.2014.2336550. Epub 2014 Jul 8.
9
Low Complexity HEVC Encoder for Visual Sensor Networks.用于视觉传感器网络的低复杂度高效视频编码(HEVC)编码器
Sensors (Basel). 2015 Dec 2;15(12):30115-25. doi: 10.3390/s151229788.
10
Edge-based intramode selection for depth-map coding in 3D-HEVC.基于边缘的 3D-HEVC 深度图编码的模式内选择。
IEEE Trans Image Process. 2015 Jan;24(1):155-62. doi: 10.1109/TIP.2014.2375653. Epub 2014 Nov 25.

引用本文的文献

1
Temporal Prediction Model-Based Fast Inter CU Partition for Versatile Video Coding.基于时域预测模型的灵活视频编码快速交叉 CU 分区。
Sensors (Basel). 2022 Oct 12;22(20):7741. doi: 10.3390/s22207741.
2
A Fast Decision Algorithm for VVC Intra-Coding Based on Texture Feature and Machine Learning.基于纹理特征和机器学习的 VVC 帧内编码快速决策算法。
Comput Intell Neurosci. 2022 Sep 13;2022:7675749. doi: 10.1155/2022/7675749. eCollection 2022.
3
Decision tree accelerated CTU partition algorithm for intra prediction in versatile video coding.
决策树加速 CTU 分区算法在通用视频编码中的帧内预测。
PLoS One. 2021 Nov 8;16(11):e0258890. doi: 10.1371/journal.pone.0258890. eCollection 2021.
4
Efficient intra mode decision for low complexity HEVC screen content compression.高效的低复杂度 HEVC 屏幕内容压缩的帧内模式决策。
PLoS One. 2019 Dec 31;14(12):e0226900. doi: 10.1371/journal.pone.0226900. eCollection 2019.
5
Low Complexity HEVC Encoder for Visual Sensor Networks.用于视觉传感器网络的低复杂度高效视频编码(HEVC)编码器
Sensors (Basel). 2015 Dec 2;15(12):30115-25. doi: 10.3390/s151229788.