• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

结合渐进式再思考和协作学习:一种用于环内滤波的深度框架。

Combining Progressive Rethinking and Collaborative Learning: A Deep Framework for In-Loop Filtering.

出版信息

IEEE Trans Image Process. 2021;30:4198-4211. doi: 10.1109/TIP.2021.3068638. Epub 2021 Apr 12.

DOI:10.1109/TIP.2021.3068638
PMID:33798081
Abstract

In this paper, we aim to address issues of (1) joint spatial-temporal modeling and (2) side information injection for deep-learning based in-loop filter. For (1), we design a deep network with both progressive rethinking and collaborative learning mechanisms to improve quality of the reconstructed intra-frames and inter-frames, respectively. For intra coding, a Progressive Rethinking Network (PRN) is designed to simulate the human decision mechanism for effective spatial modeling. Our designed block introduces an additional inter-block connection to bypass a high-dimensional informative feature before the bottleneck module across blocks to review the complete past memorized experiences and rethinks progressively. For inter coding, the current reconstructed frame interacts with reference frames (peak quality frame and the nearest adjacent frame) collaboratively at the feature level. For (2), we extract both intra-frame and inter-frame side information for better context modeling. A coarse-to-fine partition map based on HEVC partition trees is built as the intra-frame side information. Furthermore, the warped features of the reference frames are offered as the inter-frame side information. Our PRN with intra-frame side information provides 9.0% BD-rate reduction on average compared to HEVC baseline under All-intra (AI) configuration. While under Low-Delay B (LDB), Low-Delay P (LDP) and Random Access (RA) configuration, our PRN with inter-frame side information provides 9.0%, 10.6% and 8.0% BD-rate reduction on average respectively. Our project webpage is https://dezhao-wang.github.io/PRN-v2/.

摘要

在本文中,我们旨在解决基于深度学习的环路滤波器中的(1)联合时空建模和(2)侧信息注入问题。对于(1),我们设计了一个具有渐进式再思考和协作学习机制的深度网络,分别提高了重建的 Intra 帧和 Inter 帧的质量。对于 Intra 编码,设计了一个 Progressive Rethinking Network (PRN) 来模拟人类决策机制,进行有效的空间建模。我们设计的模块在块之间引入了额外的块间连接,在瓶颈模块之前绕过高维信息特征,以回顾完整的过去记忆经验,并逐步重新思考。对于 Inter 编码,当前重建的帧在特征级别上与参考帧(峰值质量帧和最近邻帧)协作交互。对于(2),我们提取 Intra 帧和 Inter 帧侧信息以进行更好的上下文建模。基于 HEVC 分区树构建了一个从粗到细的分区图作为 Intra 帧侧信息。此外,还提供了参考帧的扭曲特征作为 Inter 帧侧信息。在 All-intra (AI) 配置下,具有 Intra 帧侧信息的 PRN 与 HEVC 基线相比平均减少了 9.0%的 BD 率。而在 Low-Delay B (LDB)、Low-Delay P (LDP) 和 Random Access (RA) 配置下,具有 Inter 帧侧信息的 PRN 平均分别减少了 9.0%、10.6%和 8.0%的 BD 率。我们的项目网页是 https://dezhao-wang.github.io/PRN-v2/。

相似文献

1
Combining Progressive Rethinking and Collaborative Learning: A Deep Framework for In-Loop Filtering.结合渐进式再思考和协作学习:一种用于环内滤波的深度框架。
IEEE Trans Image Process. 2021;30:4198-4211. doi: 10.1109/TIP.2021.3068638. Epub 2021 Apr 12.
2
A Deep Learning Approach for Multi-Frame In-Loop Filter of HEVC.一种用于高效视频编码(HEVC)多帧帧内循环滤波器的深度学习方法。
IEEE Trans Image Process. 2019 Nov;28(11):5663-5678. doi: 10.1109/TIP.2019.2921877. Epub 2019 Jun 14.
3
Deep Learning Post-Filtering Using Multi-Head Attention and Multiresolution Feature Fusion for Image and Intra-Video Quality Enhancement.基于多头注意力和多分辨率特征融合的深度学习后滤波技术在图像和视频内质量增强中的应用。
Sensors (Basel). 2022 Feb 10;22(4):1353. doi: 10.3390/s22041353.
4
Enhanced Motion-Compensated Video Coding With Deep Virtual Reference Frame Generation.基于深度虚拟参考帧生成的增强运动补偿视频编码
IEEE Trans Image Process. 2019 Oct;28(10):4832-4844. doi: 10.1109/TIP.2019.2913545. Epub 2019 May 2.
5
VNVC: A Versatile Neural Video Coding Framework for Efficient Human-Machine Vision.VNVC:一种用于高效人机视觉的通用神经视频编码框架。
IEEE Trans Pattern Anal Mach Intell. 2024 Jul;46(7):4579-4596. doi: 10.1109/TPAMI.2024.3356548. Epub 2024 Jun 5.
6
Context-adaptive based CU processing for 3D-HEVC.用于3D-HEVC的基于上下文自适应的CU处理
PLoS One. 2017 Feb 9;12(2):e0171018. doi: 10.1371/journal.pone.0171018. eCollection 2017.
7
Rate distortion optimization for H.264 interframe coding: a general framework and algorithms.用于H.264帧间编码的率失真优化:通用框架与算法
IEEE Trans Image Process. 2007 Jul;16(7):1774-84. doi: 10.1109/tip.2007.896685.
8
Neural Reference Synthesis for Inter Frame Coding.用于帧间编码的神经参考合成
IEEE Trans Image Process. 2022;31:773-787. doi: 10.1109/TIP.2021.3134465. Epub 2021 Dec 28.
9
Video compression for lossy packet networks with mode switching and a dual-frame buffer.用于具有模式切换和双帧缓冲区的有损分组网络的视频压缩
IEEE Trans Image Process. 2004 Jul;13(7):885-97. doi: 10.1109/tip.2004.828429.
10
MPCNet: Compressed multi-view video restoration via motion-parallax complementation network.MPCNet:通过运动视差互补网络进行压缩多视图视频恢复
Neural Netw. 2023 Oct;167:601-614. doi: 10.1016/j.neunet.2023.08.037. Epub 2023 Sep 9.

引用本文的文献

1
Local Adaptive Image Filtering Based on Recursive Dilation Segmentation.基于递归膨胀分割的局部自适应图像滤波。
Sensors (Basel). 2023 Jun 21;23(13):5776. doi: 10.3390/s23135776.
2
VVC In-Loop Filtering Based on Deep Convolutional Neural Network.基于深度卷积神经网络的 VVC 环内滤波。
Comput Intell Neurosci. 2021 Jul 7;2021:9912839. doi: 10.1155/2021/9912839. eCollection 2021.