用于高效视频编码中环路滤波的内容感知卷积神经网络

Content-Aware Convolutional Neural Network for In-Loop Filtering in High Efficiency Video Coding.

作者信息

Jia Chuanmin, Wang Shiqi, Zhang Xinfeng, Wang Shanshe, Liu Jiaying, Pu Shiliang, Ma Siwei

出版信息

IEEE Trans Image Process. 2019 Jul;28(7):3343-3356. doi: 10.1109/TIP.2019.2896489. Epub 2019 Jan 31.

DOI:10.1109/TIP.2019.2896489

Abstract

Recently, convolutional neural network (CNN) has attracted tremendous attention and has achieved great success in many image processing tasks. In this paper, we focus on CNN technology combined with image restoration to facilitate video coding performance and propose the content-aware CNN based in-loop filtering for high-efficiency video coding (HEVC). In particular, we quantitatively analyze the structure of the proposed CNN model from multiple dimensions to make the model interpretable and optimal for CNN-based loop filtering. More specifically, each coding tree unit (CTU) is treated as an independent region for processing, such that the proposed content-aware multimodel filtering mechanism is realized by the restoration of different regions with different CNN models under the guidance of the discriminative network. To adapt the image content, the discriminative neural network is learned to analyze the content characteristics of each region for the adaptive selection of the deep learning model. The CTU level control is also enabled in the sense of rate-distortion optimization. To learn the CNN model, an iterative training method is proposed by simultaneously labeling filter categories at the CTU level and fine-tuning the CNN model parameters. The CNN based in-loop filter is implemented after sample adaptive offset in HEVC, and extensive experiments show that the proposed approach significantly improves the coding performance and achieves up to 10.0% bit-rate reduction. On average, 4.1%, 6.0%, 4.7%, and 6.0% bit-rate reduction can be obtained under all intra, low delay, low delay P, and random access configurations, respectively.

摘要

最近，卷积神经网络（CNN）引起了极大关注，并在许多图像处理任务中取得了巨大成功。在本文中，我们专注于将CNN技术与图像恢复相结合以提升视频编码性能，并提出了基于内容感知CNN的帧内滤波用于高效视频编码（HEVC）。具体而言，我们从多个维度对所提出的CNN模型结构进行定量分析，以使模型具有可解释性并针对基于CNN的环路滤波实现最优。更具体地说，每个编码树单元（CTU）被视为一个独立的处理区域，这样所提出的内容感知多模型滤波机制通过在判别网络的引导下用不同的CNN模型恢复不同区域来实现。为了适应图像内容，学习判别神经网络以分析每个区域的内容特征，用于深度学习模型的自适应选择。在率失真优化的意义上也实现了CTU级控制。为了学习CNN模型，提出了一种迭代训练方法，通过在CTU级别同时标记滤波器类别并微调CNN模型参数。基于CNN的帧内滤波器在HEVC中的样本自适应偏移之后实现，大量实验表明所提出的方法显著提高了编码性能，实现了高达10.0%的比特率降低。平均而言，在所有帧内、低延迟、低延迟P和随机访问配置下，分别可获得4.1%、6.0%、4.7%和6.0%的比特率降低。

相似文献

Content-Aware Convolutional Neural Network for In-Loop Filtering in High Efficiency Video Coding.用于高效视频编码中环路滤波的内容感知卷积神经网络

IEEE Trans Image Process. 2019 Jul;28(7):3343-3356. doi: 10.1109/TIP.2019.2896489. Epub 2019 Jan 31.

Adaptive Deep Reinforcement Learning-Based In-Loop Filter for VVC.基于自适应深度强化学习的VVC帧内滤波器

IEEE Trans Image Process. 2021;30:5439-5451. doi: 10.1109/TIP.2021.3084345. Epub 2021 Jun 8.

A Deep Learning Approach for Multi-Frame In-Loop Filter of HEVC.一种用于高效视频编码（HEVC）多帧帧内循环滤波器的深度学习方法。

IEEE Trans Image Process. 2019 Nov;28(11):5663-5678. doi: 10.1109/TIP.2019.2921877. Epub 2019 Jun 14.

High Efficiency Video Coding (HEVC)-Based Surgical Telementoring System Using Shallow Convolutional Neural Network.基于高效视频编码 (HEVC) 的浅层卷积神经网络手术远程指导系统。

J Digit Imaging. 2019 Dec;32(6):1027-1043. doi: 10.1007/s10278-019-00206-2.

Reducing Complexity of HEVC: A Deep Learning Approach.降低高效视频编码（HEVC）的复杂度：一种深度学习方法。

IEEE Trans Image Process. 2018 Jun 13. doi: 10.1109/TIP.2018.2847035.

Learning a Convolutional Neural Network for Image Compact-Resolution.学习用于图像小分辨率的卷积神经网络。

IEEE Trans Image Process. 2019 Mar;28(3):1092-1107. doi: 10.1109/TIP.2018.2872876. Epub 2018 Sep 28.

Invertibility-Driven Interpolation Filter for Video Coding.用于视频编码的可逆性驱动插值滤波器

IEEE Trans Image Process. 2019 Oct;28(10):4912-4925. doi: 10.1109/TIP.2019.2913092. Epub 2019 May 7.

Residual Highway Convolutional Neural Networks for in-loop Filtering in HEVC.用于 HEVC 中环路滤波的剩余公路卷积神经网络。

IEEE Trans Image Process. 2018 Aug;27(8):3827-3841. doi: 10.1109/TIP.2018.2815841.

S-CNN: Subcategory-Aware Convolutional Networks for Object Detection.S-CNN：用于目标检测的子类别感知卷积网络

IEEE Trans Pattern Anal Mach Intell. 2018 Oct;40(10):2522-2528. doi: 10.1109/TPAMI.2017.2756936. Epub 2017 Sep 26.

Efficient In-loop Filtering Based on Enhanced Deep Convolutional Neural Networks for HEVC.基于增强深度卷积神经网络的高效HEVC帧内滤波

IEEE Trans Image Process. 2020 Mar 27. doi: 10.1109/TIP.2020.2982534.

引用本文的文献

A novel multi-user collaborative cognitive radio spectrum sensing model: Based on a CNN-LSTM model.一种新型多用户协作认知无线电频谱感知模型：基于卷积神经网络-长短期记忆模型。

PLoS One. 2025 Jan 15;20(1):e0316291. doi: 10.1371/journal.pone.0316291. eCollection 2025.

Attention-Based Bi-Prediction Network for Versatile Video Coding (VVC) over 5G Network.基于注意力的双向预测网络在 5G 网络上的通用视频编码 (VVC)中的应用。

Sensors (Basel). 2023 Feb 27;23(5):2631. doi: 10.3390/s23052631.

Deep Learning Post-Filtering Using Multi-Head Attention and Multiresolution Feature Fusion for Image and Intra-Video Quality Enhancement.基于多头注意力和多分辨率特征融合的深度学习后滤波技术在图像和视频内质量增强中的应用。

Sensors (Basel). 2022 Feb 10;22(4):1353. doi: 10.3390/s22041353.

Attention Networks for the Quality Enhancement of Light Field Images.注意网络在提升光场图像质量中的应用。

Sensors (Basel). 2021 May 7;21(9):3246. doi: 10.3390/s21093246.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于高效视频编码中环路滤波的内容感知卷积神经网络

Content-Aware Convolutional Neural Network for In-Loop Filtering in High Efficiency Video Coding.

作者信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献