• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

压缩域深度视频超分辨率

Compressed Domain Deep Video Super-Resolution.

作者信息

Chen Peilin, Yang Wenhan, Wang Meng, Sun Long, Hu Kangkang, Wang Shiqi

出版信息

IEEE Trans Image Process. 2021;30:7156-7169. doi: 10.1109/TIP.2021.3101826. Epub 2021 Aug 12.

DOI:10.1109/TIP.2021.3101826
PMID:34370665
Abstract

Real-world video processing algorithms are often faced with the great challenges of processing the compressed videos instead of pristine videos. Despite the tremendous successes achieved in deep-learning based video super-resolution (SR), much less work has been dedicated to the SR of compressed videos. Herein, we propose a novel approach for compressed domain deep video SR by jointly leveraging the coding priors and deep priors. By exploiting the diverse and ready-made spatial and temporal coding priors (e.g., partition maps and motion vectors) extracted directly from the video bitstream in an effortless way, the video SR in the compressed domain allows us to accurately reconstruct the high resolution video with high flexibility and substantially economized computational complexity. More specifically, to incorporate the spatial coding prior, the Guided Spatial Feature Transform (GSFT) layer is proposed to modulate features of the prior with the guidance of the video information, making the prior features more fine-grained and content-adaptive. To incorporate the temporal coding prior, a guided soft alignment scheme is designed to generate local attention off-sets to compensate for decoded motion vectors. Our soft alignment scheme combines the merits of explicit and implicit motion modeling methods, rendering the alignment of features more effective for SR in terms of the computational complexity and robustness to inaccurate motion fields. Furthermore, to fully make use of the deep priors, the multi-scale fused features are generated from a scale-wise convolution reconstruction network for final SR video reconstruction. To promote the compressed domain video SR research, we build an extensive Compressed Videos with Coding Prior (CVCP) dataset, including compressed videos of diverse content and various coding priors extracted from the bitstream. Extensive experimental results show the effectiveness of coding priors in compressed domain video SR.

摘要

现实世界中的视频处理算法常常面临处理压缩视频而非原始视频的巨大挑战。尽管基于深度学习的视频超分辨率(SR)取得了巨大成功,但针对压缩视频超分辨率的研究却少得多。在此,我们提出了一种通过联合利用编码先验和深度先验来进行压缩域深度视频超分辨率的新方法。通过轻松利用直接从视频比特流中提取的多样且现成的空间和时间编码先验(例如,分区图和运动矢量),压缩域中的视频超分辨率使我们能够以高灵活性准确重建高分辨率视频,并大幅降低计算复杂度。更具体地说,为了纳入空间编码先验,我们提出了引导空间特征变换(GSFT)层,以在视频信息的引导下调制先验的特征,使先验特征更精细且适应内容。为了纳入时间编码先验,设计了一种引导软对齐方案来生成局部注意力偏移,以补偿解码后的运动矢量。我们的软对齐方案结合了显式和隐式运动建模方法的优点,在计算复杂度和对不准确运动场的鲁棒性方面,使特征对齐对于超分辨率更有效。此外,为了充分利用深度先验,从一个逐尺度卷积重建网络生成多尺度融合特征,用于最终的超分辨率视频重建。为了推动压缩域视频超分辨率研究,我们构建了一个广泛的带有编码先验的压缩视频(CVCP)数据集,包括具有不同内容和从比特流中提取的各种编码先验的压缩视频。大量实验结果表明了编码先验在压缩域视频超分辨率中的有效性。

相似文献

1
Compressed Domain Deep Video Super-Resolution.压缩域深度视频超分辨率
IEEE Trans Image Process. 2021;30:7156-7169. doi: 10.1109/TIP.2021.3101826. Epub 2021 Aug 12.
2
Edge-Oriented Compressed Video Super-Resolution.面向边缘的压缩视频超分辨率
Sensors (Basel). 2023 Dec 28;24(1):170. doi: 10.3390/s24010170.
3
Learning Temporal Dynamics for Video Super-Resolution: A Deep Learning Approach.学习视频超分辨率的时间动态:一种深度学习方法。
IEEE Trans Image Process. 2018 Mar 30. doi: 10.1109/TIP.2018.2820807.
4
Video Super-Resolution via a Spatio-Temporal Alignment Network.通过时空对齐网络实现视频超分辨率
IEEE Trans Image Process. 2022;31:1761-1773. doi: 10.1109/TIP.2022.3146625. Epub 2022 Feb 8.
5
FVC: An End-to-End Framework Towards Deep Video Compression in Feature Space.FVC:一种面向特征空间中深度视频压缩的端到端框架。
IEEE Trans Pattern Anal Mach Intell. 2023 Apr;45(4):4569-4585. doi: 10.1109/TPAMI.2022.3210652. Epub 2023 Mar 7.
6
Video Salient Object Detection via Fully Convolutional Networks.基于全卷积网络的视频显著目标检测
IEEE Trans Image Process. 2018;27(1):38-49. doi: 10.1109/TIP.2017.2754941.
7
Patch-Wise Spatial-Temporal Quality Enhancement for HEVC Compressed Video.用于HEVC压缩视频的逐块时空质量增强
IEEE Trans Image Process. 2021;30:6459-6472. doi: 10.1109/TIP.2021.3092949. Epub 2021 Jul 15.
8
Video Super-Resolution via Bidirectional Recurrent Convolutional Networks.基于双向循环卷积网络的视频超分辨率重建
IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):1015-1028. doi: 10.1109/TPAMI.2017.2701380. Epub 2017 May 4.
9
Real-Time Video Super-Resolution with Spatio-Temporal Modeling and Redundancy-Aware Inference.基于时空建模和冗余感知推理的实时视频超分辨率
Sensors (Basel). 2023 Sep 14;23(18):7880. doi: 10.3390/s23187880.
10
MRI super-resolution reconstruction for MRI-guided adaptive radiotherapy using cascaded deep learning: In the presence of limited training data and unknown translation model.基于级联深度学习的 MRI 引导自适应放疗中 MRI 超分辨率重建:在有限的训练数据和未知的平移模型的情况下。
Med Phys. 2019 Sep;46(9):4148-4164. doi: 10.1002/mp.13717. Epub 2019 Aug 7.