• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PixRevive:用于压缩视频质量增强的潜在特征扩散模型

PixRevive: Latent Feature Diffusion Model for Compressed Video Quality Enhancement.

作者信息

Wang Weiran, Jing Minge, Fan Yibo, Weng Wei

机构信息

School of Microelectronics, Fudan University, Shanghai 200433, China.

Department of Liberal Arts and Science, Kanazawa University, Ishikawa 920-1192, Japan.

出版信息

Sensors (Basel). 2024 Mar 16;24(6):1907. doi: 10.3390/s24061907.

DOI:10.3390/s24061907
PMID:38544170
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10974212/
Abstract

In recent years, the rapid prevalence of high-definition video in Internet of Things (IoT) systems has been directly facilitated by advances in imaging sensor technology. To adapt to limited uplink bandwidth, most media platforms opt to compress videos to bitrate streams for transmission. However, this compression often leads to significant texture loss and artifacts, which severely degrade the Quality of Experience (QoE). We propose a latent feature diffusion model (LFDM) for compressed video quality enhancement, which comprises a compact edge latent feature prior network (ELPN) and a conditional noise prediction network (CNPN). Specifically, we first pre-train ELPNet to construct a latent feature space that captures rich detail information for representing sharpness latent variables. Second, we incorporate these latent variables into the prediction network to iteratively guide the generation direction, thus resolving the problem that the direct application of diffusion models to temporal prediction disrupts inter-frame dependencies, thereby completing the modeling of temporal correlations. Lastly, we innovatively develop a Grouped Domain Fusion module that effectively addresses the challenges of diffusion distortion caused by naive cross-domain information fusion. Comparative experiments on the MFQEv2 benchmark validate our algorithm's superior performance in terms of both objective and subjective metrics. By integrating with codecs and image sensors, our method can provide higher video quality.

摘要

近年来,成像传感器技术的进步直接推动了高清视频在物联网(IoT)系统中的迅速普及。为了适应有限的上行链路带宽,大多数媒体平台选择将视频压缩为比特率流进行传输。然而,这种压缩通常会导致显著的纹理损失和伪像,严重降低体验质量(QoE)。我们提出了一种用于压缩视频质量增强的潜在特征扩散模型(LFDM),它由一个紧凑的边缘潜在特征先验网络(ELPN)和一个条件噪声预测网络(CNPN)组成。具体来说,我们首先对ELPNet进行预训练,以构建一个潜在特征空间,该空间捕获丰富的细节信息来表示清晰度潜在变量。其次,我们将这些潜在变量纳入预测网络,以迭代地引导生成方向,从而解决直接将扩散模型应用于时间预测会破坏帧间依赖性的问题,进而完成时间相关性的建模。最后,我们创新性地开发了一个分组域融合模块,有效解决了简单跨域信息融合导致的扩散失真挑战。在MFQEv2基准上的对比实验验证了我们算法在客观和主观指标方面的卓越性能。通过与编解码器和图像传感器集成,我们的方法可以提供更高的视频质量。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/831e/10974212/8df9608afad6/sensors-24-01907-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/831e/10974212/33a827d5cb60/sensors-24-01907-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/831e/10974212/849364b82634/sensors-24-01907-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/831e/10974212/30490ecdad72/sensors-24-01907-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/831e/10974212/24386358530c/sensors-24-01907-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/831e/10974212/4ed71f30b4cf/sensors-24-01907-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/831e/10974212/6b0e6d391c43/sensors-24-01907-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/831e/10974212/425cb2b5f0b0/sensors-24-01907-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/831e/10974212/8df9608afad6/sensors-24-01907-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/831e/10974212/33a827d5cb60/sensors-24-01907-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/831e/10974212/849364b82634/sensors-24-01907-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/831e/10974212/30490ecdad72/sensors-24-01907-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/831e/10974212/24386358530c/sensors-24-01907-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/831e/10974212/4ed71f30b4cf/sensors-24-01907-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/831e/10974212/6b0e6d391c43/sensors-24-01907-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/831e/10974212/425cb2b5f0b0/sensors-24-01907-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/831e/10974212/8df9608afad6/sensors-24-01907-g008.jpg

相似文献

1
PixRevive: Latent Feature Diffusion Model for Compressed Video Quality Enhancement.PixRevive:用于压缩视频质量增强的潜在特征扩散模型
Sensors (Basel). 2024 Mar 16;24(6):1907. doi: 10.3390/s24061907.
2
Edge-Oriented Compressed Video Super-Resolution.面向边缘的压缩视频超分辨率
Sensors (Basel). 2023 Dec 28;24(1):170. doi: 10.3390/s24010170.
3
MDEformer: Mixed Difference Equation Inspired Transformer for Compressed Video Quality Enhancement.MDEformer:用于压缩视频质量增强的混合差分方程启发式Transformer
IEEE Trans Neural Netw Learn Syst. 2025 Feb;36(2):2410-2422. doi: 10.1109/TNNLS.2024.3354982. Epub 2025 Feb 6.
4
Recurrent and Dynamic Models for Predicting Streaming Video Quality of Experience.用于预测流媒体视频体验质量的反复和动态模型。
IEEE Trans Image Process. 2018 Jul;27(7):3316-3331. doi: 10.1109/TIP.2018.2815842.
5
Satisfied-User-Ratio Modeling for Compressed Video.压缩视频的满意用户比例建模
IEEE Trans Image Process. 2020 Jan 17. doi: 10.1109/TIP.2020.2965994.
6
Study of Temporal Effects on Subjective Video Quality of Experience.时间效应对主观视频体验质量影响的研究。
IEEE Trans Image Process. 2017 Nov;26(11):5217-5231. doi: 10.1109/TIP.2017.2729891. Epub 2017 Jul 20.
7
Fast-MFQE: A Fast Approach for Multi-Frame Quality Enhancement on Compressed Video.快速多帧量化增强(Fast-MFQE):一种用于压缩视频多帧质量增强的快速方法。
Sensors (Basel). 2023 Aug 17;23(16):7227. doi: 10.3390/s23167227.
8
Compressed Video Quality Index Based on Saliency-Aware Artifact Detection.基于显著度感知伪影检测的压缩视频质量指数
Sensors (Basel). 2021 Sep 26;21(19):6429. doi: 10.3390/s21196429.
9
Patch-Wise Spatial-Temporal Quality Enhancement for HEVC Compressed Video.用于HEVC压缩视频的逐块时空质量增强
IEEE Trans Image Process. 2021;30:6459-6472. doi: 10.1109/TIP.2021.3092949. Epub 2021 Jul 15.
10
Unsupervised Conditional Diffusion Models in Video Anomaly Detection for Monitoring Dust Pollution.用于监测粉尘污染的视频异常检测中的无监督条件扩散模型
Sensors (Basel). 2024 Feb 23;24(5):1464. doi: 10.3390/s24051464.

本文引用的文献

1
Image Super-Resolution via Iterative Refinement.通过迭代细化实现图像超分辨率
IEEE Trans Pattern Anal Mach Intell. 2023 Apr;45(4):4713-4726. doi: 10.1109/TPAMI.2022.3204461. Epub 2023 Mar 7.
2
MFQE 2.0: A New Approach for Multi-Frame Quality Enhancement on Compressed Video.MFQE 2.0:一种用于压缩视频多帧质量增强的新方法。
IEEE Trans Pattern Anal Mach Intell. 2021 Mar;43(3):949-963. doi: 10.1109/TPAMI.2019.2944806. Epub 2021 Feb 4.
3
Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising.超越高斯去噪器:用于图像去噪的深度 CNN 的残差学习。
IEEE Trans Image Process. 2017 Jul;26(7):3142-3155. doi: 10.1109/TIP.2017.2662206. Epub 2017 Feb 1.
4
Compression artifact reduction by overlapped-block transform coefficient estimation with block similarity.基于块相似性的重叠块变换系数估计减少压缩伪影。
IEEE Trans Image Process. 2013 Dec;22(12):4613-26. doi: 10.1109/TIP.2013.2274386. Epub 2013 Jul 23.
5
Pointwise shape-adaptive DCT for high-quality denoising and deblocking of grayscale and color images.用于高质量去噪和去块的逐点形状自适应离散余弦变换,适用于灰度图像和彩色图像。
IEEE Trans Image Process. 2007 May;16(5):1395-411. doi: 10.1109/tip.2007.891788.