• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于轻量级机器学习的可调式可变视频编码(VVC)帧分区

Tunable VVC Frame Partitioning based on Lightweight Machine Learning.

作者信息

Amestoy Thomas, Mercat Alexandre, Hamidouche Wassim, Menard Daniel, Bergeron Cyril

出版信息

IEEE Trans Image Process. 2019 Sep 6. doi: 10.1109/TIP.2019.2938670.

DOI:10.1109/TIP.2019.2938670
PMID:31502973
Abstract

Block partition structure is a critical module in video coding scheme to achieve significant gap of compression performance. Under the exploration of the future video coding standard, named Versatile Video Coding (VVC), a new Quad Tree Binary Tree (QTBT) block partition structure has been introduced. In addition to the QT block partitioning defined in High Efficiency Video Coding (HEVC) standard, new horizontal and vertical BT partitions are enabled, which drastically increases the encoding time compared to HEVC. In this paper, we propose a lightweight and tunable QTBT partitioning scheme based on a Machine Learning (ML) approach. The proposed solution uses Random Forest classifiers to determine for each coding block the most probable partition modes. To minimize the encoding loss induced by misclassification, risk intervals for classifier decisions are introduced in the proposed solution. By varying the size of risk intervals, tunable trade-off between encoding complexity reduction and coding loss is achieved. The proposed solution implemented in the JEM-7.0 software offers encoding complexity reductions ranging from 30average for only 0.7% to 3.0% Bjxntegaard Delta Rate (BDBR) increase in Random Access (RA) coding configuration, with very slight overhead induced by Random Forest. The proposed solution based on Random Forest classifiers is also efficient to reduce the complexity of the Multi-Type Tree (MTT) partitioning scheme under the VTM-5.0 software, with complexity reductions ranging from 25% to 61% in average for only 0.4% to 2.2% BD-BR increase.

摘要

块划分结构是视频编码方案中的一个关键模块,用于实现显著的压缩性能差距。在对名为通用视频编码(VVC)的未来视频编码标准的探索中,引入了一种新的四叉树二叉树(QTBT)块划分结构。除了高效视频编码(HEVC)标准中定义的QT块划分之外,还启用了新的水平和垂直BT划分,这与HEVC相比大幅增加了编码时间。在本文中,我们提出了一种基于机器学习(ML)方法的轻量级且可调节的QTBT划分方案。所提出的解决方案使用随机森林分类器为每个编码块确定最可能的划分模式。为了最小化误分类引起的编码损失,在所提出的解决方案中引入了分类器决策的风险区间。通过改变风险区间的大小,实现了编码复杂度降低与编码损失之间的可调权衡。在所提出的解决方案在JEM-7.0软件中实现,在随机访问(RA)编码配置下,编码复杂度降低范围从30平均仅0.7%到3.0%的Bjøntegaard Delta比特率(BDBR)增加,随机森林引入的开销非常小。基于随机森林分类器的所提出的解决方案在VTM-5.0软件下对于降低多类型树(MTT)划分方案的复杂度也很有效,平均复杂度降低范围从25%到61%,仅BD-BR增加0.4%到2.2%。

相似文献

1
Tunable VVC Frame Partitioning based on Lightweight Machine Learning.基于轻量级机器学习的可调式可变视频编码(VVC)帧分区
IEEE Trans Image Process. 2019 Sep 6. doi: 10.1109/TIP.2019.2938670.
2
Partition Map Prediction for Fast Block Partitioning in VVC Intra-Frame Coding.VVC 帧内编码中快速分块的分区图预测。
IEEE Trans Image Process. 2023;32:2237-2251. doi: 10.1109/TIP.2023.3266165. Epub 2023 Apr 21.
3
Probabilistic Decision Based Block Partitioning for Future Video Coding.基于概率决策的未来视频编码分块。
IEEE Trans Image Process. 2018 Mar;27(3):1475-1486. doi: 10.1109/TIP.2017.2778564. Epub 2017 Nov 29.
4
A Fast Decision Algorithm for VVC Intra-Coding Based on Texture Feature and Machine Learning.基于纹理特征和机器学习的 VVC 帧内编码快速决策算法。
Comput Intell Neurosci. 2022 Sep 13;2022:7675749. doi: 10.1155/2022/7675749. eCollection 2022.
5
DeepQTMT: A Deep Learning Approach for Fast QTMT-Based CU Partition of Intra-Mode VVC.深度QTMT:一种基于深度学习的用于帧内模式VVC的快速基于QTMT的CU划分方法。
IEEE Trans Image Process. 2021;30:5377-5390. doi: 10.1109/TIP.2021.3083447. Epub 2021 Jun 3.
6
Extended Coding Unit Partitioning for Future Video Coding.面向未来视频编码的扩展编码单元划分
IEEE Trans Image Process. 2019 Nov 28. doi: 10.1109/TIP.2019.2955238.
7
A Fast Algorithm for Intra-Frame Versatile Video Coding Based on Edge Features.一种基于边缘特征的帧内通用视频编码快速算法。
Sensors (Basel). 2023 Jul 7;23(13):6244. doi: 10.3390/s23136244.
8
Decision tree accelerated CTU partition algorithm for intra prediction in versatile video coding.决策树加速 CTU 分区算法在通用视频编码中的帧内预测。
PLoS One. 2021 Nov 8;16(11):e0258890. doi: 10.1371/journal.pone.0258890. eCollection 2021.
9
Temporal Prediction Model-Based Fast Inter CU Partition for Versatile Video Coding.基于时域预测模型的灵活视频编码快速交叉 CU 分区。
Sensors (Basel). 2022 Oct 12;22(20):7741. doi: 10.3390/s22207741.
10
Object-Cooperated Ternary Tree Partitioning Decision Method for Versatile Video Coding.面向多功能视频编码的目标协作三元树分割决策方法。
Sensors (Basel). 2022 Aug 23;22(17):6328. doi: 10.3390/s22176328.

引用本文的文献

1
Temporal Prediction Model-Based Fast Inter CU Partition for Versatile Video Coding.基于时域预测模型的灵活视频编码快速交叉 CU 分区。
Sensors (Basel). 2022 Oct 12;22(20):7741. doi: 10.3390/s22207741.
2
Machine Learning for Multimedia Communications.多媒体通信中的机器学习。
Sensors (Basel). 2022 Jan 21;22(3):819. doi: 10.3390/s22030819.
3
Complexity Analysis of a Versatile Video Coding Decoder over Embedded Systems and General Purpose Processors.复杂视频编码解码器在嵌入式系统和通用处理器上的分析。
Sensors (Basel). 2021 May 11;21(10):3320. doi: 10.3390/s21103320.
4
Fast Sample Adaptive Offset Jointly Based on HOG Features and Depth Information for VVC in Visual Sensor Networks.基于 HOG 特征和深度信息的快速样本自适应偏移联合算法用于视觉传感器网络中的 VVC
Sensors (Basel). 2020 Nov 26;20(23):6754. doi: 10.3390/s20236754.