基于学习的感知视频编码的刚可察觉量化失真建模。

Learning-Based Just-Noticeable-Quantization- Distortion Modeling for Perceptual Video Coding.

出版信息

IEEE Trans Image Process. 2018 Jul;27(7):3178-3193. doi: 10.1109/TIP.2018.2818439.

DOI:10.1109/TIP.2018.2818439

Abstract

Conventional predictive video coding-based approaches are reaching the limit of their potential coding efficiency improvements, because of severely increasing computation complexity. As an alternative approach, perceptual video coding (PVC) has attempted to achieve high coding efficiency by eliminating perceptual redundancy, using just-noticeable-distortion (JND) directed PVC. The previous JNDs were modeled by adding white Gaussian noise or specific signal patterns into the original images, which were not appropriate in finding JND thresholds due to distortion with energy reduction. In this paper, we present a novel discrete cosine transform-based energy-reduced JND model, called ERJND, that is more suitable for JND-based PVC schemes. Then, the proposed ERJND model is extended to two learning-based just-noticeable-quantization-distortion (JNQD) models as preprocessing that can be applied for perceptual video coding. The two JNQD models can automatically adjust JND levels based on given quantization step sizes. One of the two JNQD models, called LR-JNQD, is based on linear regression and determines the model parameter for JNQD based on extracted handcraft features. The other JNQD model is based on a convolution neural network (CNN), called CNN-JNQD. To our best knowledge, our paper is the first approach to automatically adjust JND levels according to quantization step sizes for preprocessing the input to video encoders. In experiments, both the LR-JNQD and CNN-JNQD models were applied to high efficiency video coding (HEVC) and yielded maximum (average) bitrate reductions of 38.51% (10.38%) and 67.88% (24.91%), respectively, with little subjective video quality degradation, compared with the input without preprocessing applied.

摘要

基于传统预测视频编码的方法由于计算复杂度的急剧增加，已经达到了潜在编码效率提高的极限。作为一种替代方法，感知视频编码（PVC）试图通过消除感知冗余，使用仅可察觉失真（JND）指导的 PVC 来实现高效率的编码。之前的 JND 是通过在原始图像中添加白噪声或特定的信号模式来建模的，由于能量减少导致失真，因此不适合寻找 JND 阈值。在本文中，我们提出了一种新的基于离散余弦变换的能量减少 JND 模型，称为 ERJND，它更适合基于 JND 的 PVC 方案。然后，将所提出的 ERJND 模型扩展到两个基于学习的仅可察觉量化失真（JNQD）模型，作为预处理，可以应用于感知视频编码。这两个 JNQD 模型可以根据给定的量化步长自动调整 JND 水平。这两个 JNQD 模型中的一个称为 LR-JNQD，它基于线性回归，并根据提取的手工特征确定 JNQD 的模型参数。另一个 JNQD 模型基于卷积神经网络（CNN），称为 CNN-JNQD。据我们所知，我们的论文是第一个根据量化步长自动调整 JND 水平的方法，用于预处理视频编码器的输入。在实验中，LR-JNQD 和 CNN-JNQD 模型都应用于高效视频编码（HEVC），与未应用预处理的输入相比，分别获得了最大（平均）比特率降低 38.51%（10.38%）和 67.88%（24.91%），而主观视频质量几乎没有下降。

相似文献

Learning-Based Just-Noticeable-Quantization- Distortion Modeling for Perceptual Video Coding.基于学习的感知视频编码的刚可察觉量化失真建模。

IEEE Trans Image Process. 2018 Jul;27(7):3178-3193. doi: 10.1109/TIP.2018.2818439.

HEVC-Based Perceptually Adaptive Video Coding Using a DCT-Based Local Distortion Detection Probability Model.基于离散余弦变换（DCT）的局部失真检测概率模型的基于高效视频编码（HEVC）的感知自适应视频编码

IEEE Trans Image Process. 2016 Jul;25(7):3343-3357. doi: 10.1109/TIP.2016.2568459. Epub 2016 May 13.

A CU-Level Rate and Distortion Estimation Scheme for RDO of Hardware-Friendly HEVC Encoders Using Low-Complexity Integer DCTs.一种使用低复杂度整数 DCT 的硬件友好型 HEVC 编码器 RDO 的 CU 级率失真估计方案。

IEEE Trans Image Process. 2016 Aug;25(8):3787-800. doi: 10.1109/TIP.2016.2579559. Epub 2016 Jun 9.

Quality-Oriented Perceptual HEVC Based on the Spatiotemporal Saliency Detection Model.基于时空显著性检测模型的面向质量的感知型高效视频编码

Entropy (Basel). 2019 Feb 11;21(2):165. doi: 10.3390/e21020165.

A novel generalized DCT-based JND profile based on an elaborate CM-JND model for variable block-sized transforms in monochrome images.一种基于新型广义 DCT 的 JND 轮廓，该轮廓基于精心设计的 CM-JND 模型，用于单色图像中具有不同块大小的变换。

IEEE Trans Image Process. 2014 Aug;23(8):3227-40. doi: 10.1109/TIP.2014.2327808.

A JND-Based Pixel-Domain Algorithm and Hardware Architecture for Perceptual Image Coding.一种基于JND的感知图像编码像素域算法及硬件架构

J Imaging. 2019 Apr 26;5(5):50. doi: 10.3390/jimaging5050050.

A Perceptual Distinguishability Predictor For JND-noise-contaminated Images.一种用于JND噪声污染图像的感知可区分性预测器。

IEEE Trans Image Process. 2018 Dec 3. doi: 10.1109/TIP.2018.2883893.

Visual saliency guided perceptual adaptive quantization based on HEVC intra-coding for planetary images.基于高效视频编码（HEVC）帧内编码的视觉显著性引导的行星图像感知自适应量化

PLoS One. 2022 Feb 9;17(2):e0263729. doi: 10.1371/journal.pone.0263729. eCollection 2022.

Optimal Adaptive Quantization Based on Temporal Distortion Propagation Model for HEVC.基于用于高效视频编码（HEVC）的时间失真传播模型的最优自适应量化

IEEE Trans Image Process. 2019 Nov;28(11):5419-5434. doi: 10.1109/TIP.2019.2919180. Epub 2019 Jun 3.

Perceptual quality-regulable video coding system with region-based rate control scheme.基于区域的率控制方案的可感知质量调节视频编码系统。

IEEE Trans Image Process. 2013 Jun;22(6):2247-58. doi: 10.1109/TIP.2013.2247409.

引用本文的文献

Semantically Adaptive JND Modeling with Object-Wise Feature Characterization, Context Inhibition and Cross-Object Interaction.基于对象特征刻画、上下文抑制和跨对象交互的语义自适应 JND 建模

Sensors (Basel). 2023 Mar 15;23(6):3149. doi: 10.3390/s23063149.

Just Noticeable Difference Model for Images with Color Sensitivity.具有颜色敏感性的图像的可察觉差异模型。

Sensors (Basel). 2023 Feb 27;23(5):2634. doi: 10.3390/s23052634.

Research on a Rice Counting Algorithm Based on an Improved MCNN and a Density Map.基于改进的MCNN和密度图的水稻计数算法研究

Entropy (Basel). 2021 Jun 5;23(6):721. doi: 10.3390/e23060721.

Quality-Oriented Perceptual HEVC Based on the Spatiotemporal Saliency Detection Model.基于时空显著性检测模型的面向质量的感知型高效视频编码

Entropy (Basel). 2019 Feb 11;21(2):165. doi: 10.3390/e21020165.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于学习的感知视频编码的刚可察觉量化失真建模。

Learning-Based Just-Noticeable-Quantization- Distortion Modeling for Perceptual Video Coding.

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献