• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于分层深度学习的分区预测加速VP9帧内编码器

Speeding up VP9 Intra Encoder with Hierarchical Deep Learning Based Partition Prediction.

作者信息

Paul Somdyuti, Norkin Andrey, Bovik Alan C

出版信息

IEEE Trans Image Process. 2020 Jul 28;PP. doi: 10.1109/TIP.2020.3011270.

DOI:10.1109/TIP.2020.3011270
PMID:32746243
Abstract

In VP9 video codec, the sizes of blocks are decided during encoding by recursively partitioning 64×64 superblocks using rate-distortion optimization (RDO). This process is computationally intensive because of the combinatorial search space of possible partitions of a superblock. Here, we propose a deep learning based alternative framework to predict the intra-mode superblock partitions in the form of a four-level partition tree, using a hierarchical fully convolutional network (H-FCN). We created a large database of VP9 superblocks and the corresponding partitions to train an H-FCN model, which was subsequently integrated with the VP9 encoder to reduce the intra-mode encoding time. The experimental results establish that our approach speeds up intra-mode encoding by 69.7% on average, at the expense of a 1.71% increase in the Bjøntegaard-Delta bitrate (BD-rate). While VP9 provides several built-in speed levels which are designed to provide faster encoding at the expense of decreased rate-distortion performance, we find that our model is able to outperform the fastest recommended speed level of the reference VP9 encoder for the good quality intra encoding configuration, in terms of both speedup and BD-rate.

摘要

在VP9视频编解码器中,块的大小在编码期间通过使用率失真优化(RDO)对64×64的超块进行递归划分来确定。由于超块可能划分的组合搜索空间,这个过程计算量很大。在此,我们提出一种基于深度学习的替代框架,使用分层全卷积网络(H-FCN)以四级划分树的形式预测帧内模式超块划分。我们创建了一个包含VP9超块及其相应划分的大型数据库来训练H-FCN模型,随后将其与VP9编码器集成以减少帧内模式编码时间。实验结果表明,我们的方法平均将帧内模式编码速度提高了69.7%,代价是Bjøntegaard-Delta比特率(BD-rate)增加了1.71%。虽然VP9提供了几个内置速度级别,旨在以降低率失真性能为代价提供更快的编码,但我们发现,在加速和BD-rate方面,对于高质量帧内编码配置,我们的模型能够优于参考VP9编码器推荐的最快速度级别。

相似文献

1
Speeding up VP9 Intra Encoder with Hierarchical Deep Learning Based Partition Prediction.基于分层深度学习的分区预测加速VP9帧内编码器
IEEE Trans Image Process. 2020 Jul 28;PP. doi: 10.1109/TIP.2020.3011270.
2
Reducing Complexity of HEVC: A Deep Learning Approach.降低高效视频编码(HEVC)的复杂度:一种深度学习方法。
IEEE Trans Image Process. 2018 Jun 13. doi: 10.1109/TIP.2018.2847035.
3
Low-Complexity Error Resilient HEVC Video Coding: A Deep Learning Approach.低复杂度抗误码高效视频编码:一种深度学习方法。
IEEE Trans Image Process. 2021;30:1245-1260. doi: 10.1109/TIP.2020.3043124. Epub 2020 Dec 21.
4
DeepQTMT: A Deep Learning Approach for Fast QTMT-Based CU Partition of Intra-Mode VVC.深度QTMT:一种基于深度学习的用于帧内模式VVC的快速基于QTMT的CU划分方法。
IEEE Trans Image Process. 2021;30:5377-5390. doi: 10.1109/TIP.2021.3083447. Epub 2021 Jun 3.
5
Partition Map Prediction for Fast Block Partitioning in VVC Intra-Frame Coding.VVC 帧内编码中快速分块的分区图预测。
IEEE Trans Image Process. 2023;32:2237-2251. doi: 10.1109/TIP.2023.3266165. Epub 2023 Apr 21.
6
CU Partition Mode Decision for HEVC Hardwired Intra Encoder Using Convolution Neural Network.基于卷积神经网络的 HEVC 硬件内编码 CU 划分模式决策
IEEE Trans Image Process. 2016 Nov;25(11):5088-5103. doi: 10.1109/TIP.2016.2601264. Epub 2016 Aug 18.
7
Convex Hull Prediction for Adaptive Video Streaming by Recurrent Learning.基于循环学习的自适应视频流凸包预测
IEEE Trans Image Process. 2024;33:5114-5128. doi: 10.1109/TIP.2024.3455989. Epub 2024 Sep 19.
8
Fast Depth and Mode Decision in Intra Prediction for Quality SHVC.高质量SHVC帧内预测中的快速深度和模式决策
IEEE Trans Image Process. 2020 Apr 21. doi: 10.1109/TIP.2020.2988167.
9
Decision tree accelerated CTU partition algorithm for intra prediction in versatile video coding.决策树加速 CTU 分区算法在通用视频编码中的帧内预测。
PLoS One. 2021 Nov 8;16(11):e0258890. doi: 10.1371/journal.pone.0258890. eCollection 2021.
10
Probabilistic Decision Based Block Partitioning for Future Video Coding.基于概率决策的未来视频编码分块。
IEEE Trans Image Process. 2018 Mar;27(3):1475-1486. doi: 10.1109/TIP.2017.2778564. Epub 2017 Nov 29.