• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

降低高效视频编码(HEVC)的复杂度:一种深度学习方法。

Reducing Complexity of HEVC: A Deep Learning Approach.

作者信息

Xu Mai, Li Tianyi, Wang Zulin, Deng Xin, Yang Ren, Guan Zhenyu

出版信息

IEEE Trans Image Process. 2018 Jun 13. doi: 10.1109/TIP.2018.2847035.

DOI:10.1109/TIP.2018.2847035
PMID:29994256
Abstract

High Efficiency Video Coding (HEVC) significantly reduces bit-rates over the preceding H.264 standard but at the expense of extremely high encoding complexity. In HEVC, the quad-tree partition of coding unit (CU) consumes a large proportion of the HEVC encoding complexity, due to the brute-force search for rate-distortion optimization (RDO). Therefore, this paper proposes a deep learning approach to predict the CU partition for reducing the HEVC complexity at both intra-and inter-modes, which is based on convolutional neural network (CNN) and long-and short-term memory (LSTM) network. First, we establish a large-scale database including substantial CU partition data for HEVC intra-and inter-modes. This enables deep learning on the CU partition. Second, we represent the CU partition of an entire coding tree unit (CTU) in the form of a hierarchical CU partition map (HCPM). Then, we propose an early-terminated hierarchical CNN (ETH-CNN) for learning to predict the HCPM. Consequently, the encoding complexity of intra-mode HEVC can be drastically reduced by replacing the brute-force search with ETH-CNN to decide the CU partition. Third, an early-terminated hierarchical LSTM (ETH-LSTM) is proposed to learn the temporal correlation of the CU partition. Then, we combine ETH-LSTM and ETH-CNN to predict the CU partition for reducing the HEVC complexity at inter-mode. Finally, experimental results show that our approach outperforms other state-of-the-art approaches in reducing the HEVC complexity at both intra-and inter-modes.

摘要

高效视频编码(HEVC)相较于之前的H.264标准显著降低了比特率,但代价是编码复杂度极高。在HEVC中,由于对率失真优化(RDO)进行强力搜索,编码单元(CU)的四叉树划分消耗了HEVC编码复杂度的很大一部分。因此,本文提出一种深度学习方法,用于预测CU划分,以降低HEVC在帧内和帧间模式下的复杂度,该方法基于卷积神经网络(CNN)和长短时记忆(LSTM)网络。首先,我们建立一个大规模数据库,其中包含用于HEVC帧内和帧间模式的大量CU划分数据。这使得能够对CU划分进行深度学习。其次,我们以分层CU划分图(HCPM)的形式表示整个编码树单元(CTU)的CU划分。然后,我们提出一种提前终止的分层CNN(ETH-CNN)用于学习预测HCPM。因此,通过用ETH-CNN替代强力搜索来决定CU划分,帧内模式HEVC的编码复杂度可大幅降低。第三,提出一种提前终止的分层LSTM(ETH-LSTM)来学习CU划分的时间相关性。然后,我们将ETH-LSTM和ETH-CNN相结合来预测CU划分,以降低帧间模式下的HEVC复杂度。最后,实验结果表明,我们的方法在降低HEVC帧内和帧间模式的复杂度方面优于其他现有最先进方法。

相似文献

1
Reducing Complexity of HEVC: A Deep Learning Approach.降低高效视频编码(HEVC)的复杂度:一种深度学习方法。
IEEE Trans Image Process. 2018 Jun 13. doi: 10.1109/TIP.2018.2847035.
2
DeepQTMT: A Deep Learning Approach for Fast QTMT-Based CU Partition of Intra-Mode VVC.深度QTMT:一种基于深度学习的用于帧内模式VVC的快速基于QTMT的CU划分方法。
IEEE Trans Image Process. 2021;30:5377-5390. doi: 10.1109/TIP.2021.3083447. Epub 2021 Jun 3.
3
Fast CU Partition Algorithm for Intra Frame Coding Based on Joint Texture Classification and CNN.基于联合纹理分类与卷积神经网络的帧内编码快速CU划分算法
Sensors (Basel). 2023 Sep 15;23(18):7923. doi: 10.3390/s23187923.
4
Low-Complexity Error Resilient HEVC Video Coding: A Deep Learning Approach.低复杂度抗误码高效视频编码:一种深度学习方法。
IEEE Trans Image Process. 2021;30:1245-1260. doi: 10.1109/TIP.2020.3043124. Epub 2020 Dec 21.
5
Learned Fast HEVC Intra Coding.学习型快速高效视频编码(HEVC)帧内编码
IEEE Trans Image Process. 2020 Mar 30. doi: 10.1109/TIP.2020.2982832.
6
CU Partition Mode Decision for HEVC Hardwired Intra Encoder Using Convolution Neural Network.基于卷积神经网络的 HEVC 硬件内编码 CU 划分模式决策
IEEE Trans Image Process. 2016 Nov;25(11):5088-5103. doi: 10.1109/TIP.2016.2601264. Epub 2016 Aug 18.
7
A CU-Level Rate and Distortion Estimation Scheme for RDO of Hardware-Friendly HEVC Encoders Using Low-Complexity Integer DCTs.一种使用低复杂度整数 DCT 的硬件友好型 HEVC 编码器 RDO 的 CU 级率失真估计方案。
IEEE Trans Image Process. 2016 Aug;25(8):3787-800. doi: 10.1109/TIP.2016.2579559. Epub 2016 Jun 9.
8
Decision tree accelerated CTU partition algorithm for intra prediction in versatile video coding.决策树加速 CTU 分区算法在通用视频编码中的帧内预测。
PLoS One. 2021 Nov 8;16(11):e0258890. doi: 10.1371/journal.pone.0258890. eCollection 2021.
9
Efficient Intra Mode Selection for Depth-Map Coding Utilizing Spatiotemporal, Inter-Component and Inter-View Correlations in 3D-HEVC.利用 3D-HEVC 中的时空、分量间和视图间相关性进行深度图编码的高效帧内模式选择。
IEEE Trans Image Process. 2018 Sep;27(9):4195-4206. doi: 10.1109/TIP.2018.2837379.
10
Online Learning-Based Multi-Stage Complexity Control for Live Video Coding.基于在线学习的实时视频编码多阶段复杂度控制
IEEE Trans Image Process. 2021;30:641-656. doi: 10.1109/TIP.2020.3036766. Epub 2020 Dec 4.

引用本文的文献

1
Fast CU Partition Algorithm for Intra Frame Coding Based on Joint Texture Classification and CNN.基于联合纹理分类与卷积神经网络的帧内编码快速CU划分算法
Sensors (Basel). 2023 Sep 15;23(18):7923. doi: 10.3390/s23187923.
2
Temporal Prediction Model-Based Fast Inter CU Partition for Versatile Video Coding.基于时域预测模型的灵活视频编码快速交叉 CU 分区。
Sensors (Basel). 2022 Oct 12;22(20):7741. doi: 10.3390/s22207741.
3
A Fast Decision Algorithm for VVC Intra-Coding Based on Texture Feature and Machine Learning.基于纹理特征和机器学习的 VVC 帧内编码快速决策算法。
Comput Intell Neurosci. 2022 Sep 13;2022:7675749. doi: 10.1155/2022/7675749. eCollection 2022.
4
Gray wolf optimization-extreme learning machine approach for diabetic retinopathy detection.基于灰狼优化-极限学习机的糖尿病视网膜病变检测方法。
Front Public Health. 2022 Aug 1;10:925901. doi: 10.3389/fpubh.2022.925901. eCollection 2022.
5
ξ: An AI-Based Data Analytics Scheme for COVID-19 Prediction and Economy Boosting.ξ:一种基于人工智能的用于新冠疫情预测和经济提振的数据分析方案。
IEEE Internet Things J. 2020 Dec 25;8(21):15977-15989. doi: 10.1109/JIOT.2020.3047539. eCollection 2021 Nov 1.
6
Industry 4.0 and Digitalisation in Healthcare.医疗保健领域的工业4.0与数字化
Materials (Basel). 2022 Mar 14;15(6):2140. doi: 10.3390/ma15062140.
7
Decision tree accelerated CTU partition algorithm for intra prediction in versatile video coding.决策树加速 CTU 分区算法在通用视频编码中的帧内预测。
PLoS One. 2021 Nov 8;16(11):e0258890. doi: 10.1371/journal.pone.0258890. eCollection 2021.
8
A new model for learning-based forecasting procedure by combining k-means clustering and time series forecasting algorithms.一种通过结合k均值聚类和时间序列预测算法的基于学习的预测程序的新模型。
PeerJ Comput Sci. 2021 Jun 2;7:e534. doi: 10.7717/peerj-cs.534. eCollection 2021.
9
Flash-Flood Potential Mapping Using Deep Learning, Alternating Decision Trees and Data Provided by Remote Sensing Sensors.利用深度学习、交替决策树和遥感传感器提供的数据进行洪水暴发潜力图绘制。
Sensors (Basel). 2021 Jan 4;21(1):280. doi: 10.3390/s21010280.
10
Optimised genetic algorithm-extreme learning machine approach for automatic COVID-19 detection.优化遗传算法-极限学习机方法自动检测 COVID-19。
PLoS One. 2020 Dec 15;15(12):e0242899. doi: 10.1371/journal.pone.0242899. eCollection 2020.