• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于卷积网络拉普拉斯金字塔的语义感知图像压缩

Semantic Perceptual Image Compression With a Laplacian Pyramid of Convolutional Networks.

作者信息

Wang Juan, Duan Yiping, Tao Xiaoming, Xu Mai, Lu Jianhua

出版信息

IEEE Trans Image Process. 2021;30:4225-4237. doi: 10.1109/TIP.2021.3065244. Epub 2021 Apr 12.

DOI:10.1109/TIP.2021.3065244
PMID:33735078
Abstract

The existing image compression methods usually choose or optimize low-level representation manually. Actually, these methods struggle for the texture restoration at low bit rates. Recently, deep neural network (DNN)-based image compression methods have achieved impressive results. To achieve better perceptual quality, generative models are widely used, especially generative adversarial networks (GAN). However, training GAN is intractable, especially for high-resolution images, with the challenges of unconvincing reconstructions and unstable training. To overcome these problems, we propose a novel DNN-based image compression framework in this paper. The key point is decomposing an image into multi-scale sub-images using the proposed Laplacian pyramid based multi-scale networks. For each pyramid scale, we train a specific DNN to exploit the compressive representation. Meanwhile, each scale is optimized with different aspects, including pixel, semantics, distribution and entropy, for a good "rate-distortion-perception" trade-off. By independently optimizing each pyramid scale, we make each stage manageable and make each sub-image plausible. Experimental results demonstrate that our method achieves state-of-the-art performance, with advantages over existing methods in providing improved visual quality. Additionally, a better performance in the down-stream visual analysis tasks which are conducted on the reconstructed images, validates the excellent semantics-preserving ability of the proposed method.

摘要

现有的图像压缩方法通常手动选择或优化低级表示。实际上,这些方法在低比特率下难以进行纹理恢复。最近,基于深度神经网络(DNN)的图像压缩方法取得了令人瞩目的成果。为了获得更好的感知质量,生成模型被广泛使用,特别是生成对抗网络(GAN)。然而,训练GAN很棘手,尤其是对于高分辨率图像,存在重建效果不佳和训练不稳定的挑战。为了克服这些问题,我们在本文中提出了一种新颖的基于DNN的图像压缩框架。关键在于使用所提出的基于拉普拉斯金字塔的多尺度网络将图像分解为多尺度子图像。对于每个金字塔尺度,我们训练一个特定的DNN来利用压缩表示。同时,每个尺度在像素、语义、分布和熵等不同方面进行优化,以实现良好的“率失真感知”权衡。通过独立优化每个金字塔尺度,我们使每个阶段易于管理,并使每个子图像合理。实验结果表明,我们的方法实现了最优性能,在提供改进的视觉质量方面优于现有方法。此外,在对重建图像进行的下游视觉分析任务中表现更好,验证了所提出方法出色的语义保留能力。

相似文献

1
Semantic Perceptual Image Compression With a Laplacian Pyramid of Convolutional Networks.基于卷积网络拉普拉斯金字塔的语义感知图像压缩
IEEE Trans Image Process. 2021;30:4225-4237. doi: 10.1109/TIP.2021.3065244. Epub 2021 Apr 12.
2
Hierarchical Recurrent Neural Hashing for Image Retrieval With Hierarchical Convolutional Features.基于层次卷积特征的层次递归神经网络哈希图像检索
IEEE Trans Image Process. 2018;27(1):106-120. doi: 10.1109/TIP.2017.2755766.
3
Lightweight Deep Exemplar Colorization via Semantic Attention-Guided Laplacian Pyramid.基于语义注意力引导拉普拉斯金字塔的轻量级深度示例图像上色
IEEE Trans Vis Comput Graph. 2025 Aug;31(8):4257-4269. doi: 10.1109/TVCG.2024.3398791.
4
Perceptual Adversarial Networks With a Feature Pyramid for Image Translation.用于图像翻译的具有特征金字塔的感知对抗网络
IEEE Comput Graph Appl. 2019 Jul-Aug;39(4):68-77. doi: 10.1109/MCG.2019.2914426. Epub 2019 May 8.
5
Deep Inception-Residual Laplacian Pyramid Networks for Accurate Single-Image Super-Resolution.用于精确单图像超分辨率的深度 inception-残差拉普拉斯金字塔网络
IEEE Trans Neural Netw Learn Syst. 2020 May;31(5):1514-1528. doi: 10.1109/TNNLS.2019.2920852. Epub 2019 Jun 28.
6
Synthetic CT reconstruction using a deep spatial pyramid convolutional framework for MR-only breast radiotherapy.基于深度空间金字塔卷积框架的合成 CT 重建技术在仅 MRI 乳腺癌放疗中的应用。
Med Phys. 2019 Sep;46(9):4135-4147. doi: 10.1002/mp.13716. Epub 2019 Aug 7.
7
Super-resolution of cardiac magnetic resonance images using Laplacian Pyramid based on Generative Adversarial Networks.基于生成对抗网络的拉普拉斯金字塔的心脏磁共振图像超分辨率。
Comput Med Imaging Graph. 2020 Mar;80:101698. doi: 10.1016/j.compmedimag.2020.101698. Epub 2020 Jan 3.
8
Fast and Accurate Image Super-Resolution with Deep Laplacian Pyramid Networks.基于深度拉普拉斯金字塔网络的快速准确图像超分辨率
IEEE Trans Pattern Anal Mach Intell. 2019 Nov;41(11):2599-2613. doi: 10.1109/TPAMI.2018.2865304. Epub 2018 Aug 13.
9
SAM-GAN: Self-Attention supporting Multi-stage Generative Adversarial Networks for text-to-image synthesis.SAM-GAN:用于文本到图像合成的支持多阶段生成对抗网络的自注意力模型。
Neural Netw. 2021 Jun;138:57-67. doi: 10.1016/j.neunet.2021.01.023. Epub 2021 Feb 10.
10
UENet: A Novel Generative Adversarial Network for Angiography Image Segmentation.UENet:一种用于血管造影图像分割的新型生成对抗网络。
Annu Int Conf IEEE Eng Med Biol Soc. 2020 Jul;2020:1612-1615. doi: 10.1109/EMBC44109.2020.9175334.

引用本文的文献

1
A novel image semantic communication method via dynamic decision generation network and generative adversarial network.一种基于动态决策生成网络和生成对抗网络的新型图像语义通信方法。
Sci Rep. 2024 Aug 23;14(1):19636. doi: 10.1038/s41598-024-70619-9.