• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

MDEformer:用于压缩视频质量增强的混合差分方程启发式Transformer

MDEformer: Mixed Difference Equation Inspired Transformer for Compressed Video Quality Enhancement.

作者信息

Zhang Mingjin, Bai Haichen, Shang Wenteng, Guo Jie, Li Yunsong, Gao Xinbo

出版信息

IEEE Trans Neural Netw Learn Syst. 2025 Feb;36(2):2410-2422. doi: 10.1109/TNNLS.2024.3354982. Epub 2025 Feb 6.

DOI:10.1109/TNNLS.2024.3354982
PMID:38285580
Abstract

Deep learning methods have achieved impressive performance in compressed video quality enhancement tasks. However, these methods rely excessively on practical experience by manually designing the network structure and do not fully exploit the potential of the feature information contained in the video sequences, i.e., not taking full advantage of the multiscale similarity of the compressed artifact information and not seriously considering the impact of the partition boundaries in the compressed video on the overall video quality. In this article, we propose a novel Mixed Difference Equation inspired Transformer (MDEformer) for compressed video quality enhancement, which provides a relatively reliable principle to guide the network design and yields a new insight into the interpretable transformer. Specifically, drawing on the graphical concept of the mixed difference equation (MDE), we utilize multiple cross-layer cross-attention aggregation (CCA) modules to establish long-range dependencies between encoders and decoders of the transformer, where partition boundary smoothing (PBS) modules are inserted as feedforward networks. The CCA module can make full use of the multiscale similarity of compression artifacts to effectively remove compression artifacts, and recover the texture and detail information of the frame. The PBS module leverages the sensitivity of smoothing convolution to partition boundaries to eliminate the impact of partition boundaries on the quality of compressed video and improve its overall quality, while not having too much impacts on non-boundary pixels. Extensive experiments on the MFQE 2.0 dataset demonstrate that the proposed MDEformer can eliminate compression artifacts for improving the quality of the compressed video, and surpasses the state-of-the-arts (SOTAs) in terms of both objective metrics and visual quality.

摘要

深度学习方法在压缩视频质量增强任务中取得了令人瞩目的性能。然而,这些方法过度依赖通过手动设计网络结构的实践经验,没有充分挖掘视频序列中包含的特征信息的潜力,即没有充分利用压缩伪像信息的多尺度相似性,也没有认真考虑压缩视频中的分区边界对整体视频质量的影响。在本文中,我们提出了一种用于压缩视频质量增强的新型混合差分方程启发式Transformer(MDEformer),它为指导网络设计提供了一个相对可靠的原则,并为可解释的Transformer带来了新的见解。具体来说,借鉴混合差分方程(MDE)的图形概念,我们利用多个跨层交叉注意力聚合(CCA)模块在Transformer的编码器和解码器之间建立长程依赖关系,其中插入分区边界平滑(PBS)模块作为前馈网络。CCA模块可以充分利用压缩伪像的多尺度相似性来有效去除压缩伪像,并恢复帧的纹理和细节信息。PBS模块利用平滑卷积对分区边界的敏感性来消除分区边界对压缩视频质量的影响并提高其整体质量,同时对非边界像素影响不大。在MFQE 2.0数据集上进行的大量实验表明,所提出的MDEformer可以消除压缩伪像以提高压缩视频的质量,并且在客观指标和视觉质量方面都超过了现有技术(SOTAs)。

相似文献

1
MDEformer: Mixed Difference Equation Inspired Transformer for Compressed Video Quality Enhancement.MDEformer:用于压缩视频质量增强的混合差分方程启发式Transformer
IEEE Trans Neural Netw Learn Syst. 2025 Feb;36(2):2410-2422. doi: 10.1109/TNNLS.2024.3354982. Epub 2025 Feb 6.
2
MFQE 2.0: A New Approach for Multi-Frame Quality Enhancement on Compressed Video.MFQE 2.0:一种用于压缩视频多帧质量增强的新方法。
IEEE Trans Pattern Anal Mach Intell. 2021 Mar;43(3):949-963. doi: 10.1109/TPAMI.2019.2944806. Epub 2021 Feb 4.
3
Fast-MFQE: A Fast Approach for Multi-Frame Quality Enhancement on Compressed Video.快速多帧量化增强(Fast-MFQE):一种用于压缩视频多帧质量增强的快速方法。
Sensors (Basel). 2023 Aug 17;23(16):7227. doi: 10.3390/s23167227.
4
PixRevive: Latent Feature Diffusion Model for Compressed Video Quality Enhancement.PixRevive:用于压缩视频质量增强的潜在特征扩散模型
Sensors (Basel). 2024 Mar 16;24(6):1907. doi: 10.3390/s24061907.
5
Quality evaluation of motion-compensated edge artifacts in compressed video.压缩视频中运动补偿边缘伪像的质量评估
IEEE Trans Image Process. 2007 Apr;16(4):943-56. doi: 10.1109/tip.2007.891778.
6
Edge-Oriented Compressed Video Super-Resolution.面向边缘的压缩视频超分辨率
Sensors (Basel). 2023 Dec 28;24(1):170. doi: 10.3390/s24010170.
7
Efficient Transformer-Based Compressed Video Modeling via Informative Patch Selection.通过信息性补丁选择实现基于高效Transformer的压缩视频建模
Sensors (Basel). 2022 Dec 26;23(1):244. doi: 10.3390/s23010244.
8
ETU-Net: edge enhancement-guided U-Net with transformer for skin lesion segmentation.ETU-Net:基于边缘增强引导的 U-Net 与 Transformer 的皮肤病变分割。
Phys Med Biol. 2023 Dec 22;69(1). doi: 10.1088/1361-6560/ad13d2.
9
DSTAN: A Deformable Spatial-temporal Attention Network with Bidirectional Sequence Feature Refinement for Speckle Noise Removal in Thyroid Ultrasound Video.DSTAN:一种具有双向序列特征细化的可变形时空注意力网络,用于去除甲状腺超声视频中的斑点噪声。
J Imaging Inform Med. 2024 Dec;37(6):3264-3281. doi: 10.1007/s10278-023-00935-5. Epub 2024 Jun 5.
10
Compressed Domain Deep Video Super-Resolution.压缩域深度视频超分辨率
IEEE Trans Image Process. 2021;30:7156-7169. doi: 10.1109/TIP.2021.3101826. Epub 2021 Aug 12.