IEEE Trans Image Process. 2021;30:4198-4211. doi: 10.1109/TIP.2021.3068638. Epub 2021 Apr 12.
In this paper, we aim to address issues of (1) joint spatial-temporal modeling and (2) side information injection for deep-learning based in-loop filter. For (1), we design a deep network with both progressive rethinking and collaborative learning mechanisms to improve quality of the reconstructed intra-frames and inter-frames, respectively. For intra coding, a Progressive Rethinking Network (PRN) is designed to simulate the human decision mechanism for effective spatial modeling. Our designed block introduces an additional inter-block connection to bypass a high-dimensional informative feature before the bottleneck module across blocks to review the complete past memorized experiences and rethinks progressively. For inter coding, the current reconstructed frame interacts with reference frames (peak quality frame and the nearest adjacent frame) collaboratively at the feature level. For (2), we extract both intra-frame and inter-frame side information for better context modeling. A coarse-to-fine partition map based on HEVC partition trees is built as the intra-frame side information. Furthermore, the warped features of the reference frames are offered as the inter-frame side information. Our PRN with intra-frame side information provides 9.0% BD-rate reduction on average compared to HEVC baseline under All-intra (AI) configuration. While under Low-Delay B (LDB), Low-Delay P (LDP) and Random Access (RA) configuration, our PRN with inter-frame side information provides 9.0%, 10.6% and 8.0% BD-rate reduction on average respectively. Our project webpage is https://dezhao-wang.github.io/PRN-v2/.
在本文中,我们旨在解决基于深度学习的环路滤波器中的(1)联合时空建模和(2)侧信息注入问题。对于(1),我们设计了一个具有渐进式再思考和协作学习机制的深度网络,分别提高了重建的 Intra 帧和 Inter 帧的质量。对于 Intra 编码,设计了一个 Progressive Rethinking Network (PRN) 来模拟人类决策机制,进行有效的空间建模。我们设计的模块在块之间引入了额外的块间连接,在瓶颈模块之前绕过高维信息特征,以回顾完整的过去记忆经验,并逐步重新思考。对于 Inter 编码,当前重建的帧在特征级别上与参考帧(峰值质量帧和最近邻帧)协作交互。对于(2),我们提取 Intra 帧和 Inter 帧侧信息以进行更好的上下文建模。基于 HEVC 分区树构建了一个从粗到细的分区图作为 Intra 帧侧信息。此外,还提供了参考帧的扭曲特征作为 Inter 帧侧信息。在 All-intra (AI) 配置下,具有 Intra 帧侧信息的 PRN 与 HEVC 基线相比平均减少了 9.0%的 BD 率。而在 Low-Delay B (LDB)、Low-Delay P (LDP) 和 Random Access (RA) 配置下,具有 Inter 帧侧信息的 PRN 平均分别减少了 9.0%、10.6%和 8.0%的 BD 率。我们的项目网页是 https://dezhao-wang.github.io/PRN-v2/。