神经网络的内预测迭代训练。

Iterative Training of Neural Networks for Intra Prediction.

出版信息

IEEE Trans Image Process. 2021;30:697-711. doi: 10.1109/TIP.2020.3038348. Epub 2020 Dec 4.

DOI:10.1109/TIP.2020.3038348

Abstract

This paper presents an iterative training of neural networks for intra prediction in a block-based image and video codec. First, the neural networks are trained on blocks arising from the codec partitioning of images, each paired with its context. Then, iteratively, blocks are collected from the partitioning of images via the codec including the neural networks trained at the previous iteration, each paired with its context, and the neural networks are retrained on the new pairs. Thanks to this training, the neural networks can learn intra prediction functions that both stand out from those already in the initial codec and boost the codec in terms of rate-distortion. Moreover, the iterative process allows the design of training data cleansings essential for the neural network training. When the iteratively trained neural networks are put into H.265 (HM-16.15), -4.2% of mean BD-rate reduction is obtained, i.e. -1.8% above the state-of-the-art. By moving them into H.266 (VTM-5.0), the mean BD-rate reduction reaches -1.9%.

摘要

本文提出了一种基于神经网络的迭代训练方法，用于块基图像和视频编解码器中的帧内预测。首先，在图像编解码器的分区中对神经网络进行训练，每个分区都与上下文配对。然后，通过包含在上一次迭代中训练的神经网络的编解码器，迭代地从图像分区中收集块，每个块都与上下文配对，并对新的配对进行神经网络重新训练。通过这种训练，神经网络可以学习到从初始编解码器中脱颖而出的帧内预测函数，并在率失真方面提高编解码器的性能。此外，迭代过程允许设计对神经网络训练至关重要的训练数据清洗。当将迭代训练后的神经网络应用于 H.265（HM-16.15）时，平均 BD 率降低了-4.2%，即比最先进的技术高出 1.8%。将它们应用于 H.266（VTM-5.0）时，平均 BD 率降低达到-1.9%。

相似文献

Iterative Training of Neural Networks for Intra Prediction.神经网络的内预测迭代训练。

IEEE Trans Image Process. 2021;30:697-711. doi: 10.1109/TIP.2020.3038348. Epub 2020 Dec 4.

Tree-Structured Data Clustering-Driven Neural Network for Intra Prediction in Video Coding.基于树状数据聚类驱动神经网络的视频编码帧内预测方法。

IEEE Trans Image Process. 2023;32:3493-3506. doi: 10.1109/TIP.2023.3286256. Epub 2023 Jun 23.

High-Quality Video Watermarking Based on Deep Neural Networks and Adjustable Subsquares Properties Algorithm.基于深度神经网络和可调子方块特性算法的高质量视频水印。

Sensors (Basel). 2022 Jul 19;22(14):5376. doi: 10.3390/s22145376.

MRI Gibbs-ringing artifact reduction by means of machine learning using convolutional neural networks.基于卷积神经网络的机器学习方法降低 MRI 的 Gibbs 环伪影。

Magn Reson Med. 2019 Dec;82(6):2133-2145. doi: 10.1002/mrm.27894. Epub 2019 Aug 2.

DeepFoveaNet: Deep Fovea Eagle-Eye Bioinspired Model to Detect Moving Objects.DeepFoveaNet：一种用于检测移动物体的 Deep Fovea 鹰眼仿生模型。

IEEE Trans Image Process. 2021;30:7090-7100. doi: 10.1109/TIP.2021.3101398. Epub 2021 Aug 10.

Context-adaptive neural network based prediction for image compression.基于上下文自适应神经网络的图像压缩预测

IEEE Trans Image Process. 2019 Aug 16. doi: 10.1109/TIP.2019.2934565.

Deep flow-net for EPI distortion estimation.深度流网络用于 EPI 失真估计。

Neuroimage. 2020 Aug 15;217:116886. doi: 10.1016/j.neuroimage.2020.116886. Epub 2020 May 7.

Quality-based Regularization for Iterative Deep Image Segmentation.基于质量的迭代深度图像分割正则化

Annu Int Conf IEEE Eng Med Biol Soc. 2019 Jul;2019:6734-6737. doi: 10.1109/EMBC.2019.8857237.

ProxIQA: A Proxy Approach to Perceptual Optimization of Learned Image Compression.ProxIQA：一种感知优化学习图像压缩的代理方法。

IEEE Trans Image Process. 2021;30:360-373. doi: 10.1109/TIP.2020.3036752. Epub 2020 Nov 23.

Designing Interpretable Recurrent Neural Networks for Video Reconstruction via Deep Unfolding.通过深度展开设计用于视频重建的可解释循环神经网络。

IEEE Trans Image Process. 2021;30:4099-4113. doi: 10.1109/TIP.2021.3069296. Epub 2021 Apr 8.

引用本文的文献

QP-Adaptive Dual-Path Residual Integrated Frequency Transformer for Data-Driven In-Loop Filter in VVC.用于VVC中数据驱动环路滤波器的QP自适应双路径残差集成频率变压器

Sensors (Basel). 2025 Jul 7;25(13):4234. doi: 10.3390/s25134234.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

神经网络的内预测迭代训练。

Iterative Training of Neural Networks for Intra Prediction.

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献