• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

GLIBS: 用于图像超分辨率的生成潜在银行及其他。

GLEAN: Generative Latent Bank for Image Super-Resolution and Beyond.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):3154-3168. doi: 10.1109/TPAMI.2022.3186715. Epub 2023 Feb 3.

DOI:10.1109/TPAMI.2022.3186715
PMID:35763473
Abstract

We show that pre-trained Generative Adversarial Networks (GANs) such as StyleGAN and BigGAN can be used as a latent bank to improve the performance of image super-resolution. While most existing perceptual-oriented approaches attempt to generate realistic outputs through learning with adversarial loss, our method, Generative LatEnt bANk (GLEAN), goes beyond existing practices by directly leveraging rich and diverse priors encapsulated in a pre-trained GAN. But unlike prevalent GAN inversion methods that require expensive image-specific optimization at runtime, our approach only needs a single forward pass for restoration. GLEAN can be easily incorporated in a simple encoder-bank-decoder architecture with multi-resolution skip connections. Employing priors from different generative models allows GLEAN to be applied to diverse categories (e.g., human faces, cats, buildings, and cars). We further present a lightweight version of GLEAN, named LightGLEAN, which retains only the critical components in GLEAN. Notably, LightGLEAN consists of only 21% of parameters and 35% of FLOPs while achieving comparable image quality. We extend our method to different tasks including image colorization and blind image restoration, and extensive experiments show that our proposed models perform favorably in comparison to existing methods. Codes and models are available at https://github.com/open-mmlab/mmediting.

摘要

我们证明,像 StyleGAN 和 BigGAN 这样的预训练生成对抗网络 (GAN) 可以用作潜在库来提高图像超分辨率的性能。虽然大多数现有的基于感知的方法试图通过对抗性损失学习生成逼真的输出,但我们的方法,Generative LatEnt bANk (GLEAN),超越了现有的实践,直接利用预训练 GAN 中封装的丰富多样的先验知识。但与需要在运行时进行昂贵的特定图像优化的流行 GAN 反转方法不同,我们的方法只需要一次正向传递即可进行恢复。 GLEAN 可以轻松地与具有多分辨率跳过连接的简单编码器-库-解码器架构结合使用。从不同生成模型中利用先验知识,允许 GLEAN 应用于不同的类别(例如,人脸、猫、建筑物和汽车)。我们进一步提出了 GLEAN 的轻量级版本,称为 LightGLEAN,它仅保留 GLEAN 中的关键组件。值得注意的是,LightGLEAN 的参数仅为 21%,FLOPs 仅为 35%,而图像质量相当。我们将我们的方法扩展到不同的任务,包括图像着色和盲图像恢复,广泛的实验表明,与现有方法相比,我们提出的模型表现良好。代码和模型可在 https://github.com/open-mmlab/mmediting 上获得。

相似文献

1
GLEAN: Generative Latent Bank for Image Super-Resolution and Beyond.GLIBS: 用于图像超分辨率的生成潜在银行及其他。
IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):3154-3168. doi: 10.1109/TPAMI.2022.3186715. Epub 2023 Feb 3.
2
Generative adversarial networks with decoder-encoder output noises.生成对抗网络与解码器编码器输出噪声。
Neural Netw. 2020 Jul;127:19-28. doi: 10.1016/j.neunet.2020.04.005. Epub 2020 Apr 9.
3
StudioGAN: A Taxonomy and Benchmark of GANs for Image Synthesis.StudioGAN:用于图像合成的生成对抗网络分类法与基准测试
IEEE Trans Pattern Anal Mach Intell. 2023 Dec;45(12):15725-15742. doi: 10.1109/TPAMI.2023.3306436. Epub 2023 Nov 3.
4
RankSRGAN: Super Resolution Generative Adversarial Networks With Learning to Rank.RankSRGAN:带有学习排序的超分辨率生成对抗网络。
IEEE Trans Pattern Anal Mach Intell. 2022 Oct;44(10):7149-7166. doi: 10.1109/TPAMI.2021.3096327. Epub 2022 Sep 14.
5
GH-Feat: Learning Versatile Generative Hierarchical Features From GANs.GH-Feat:从 GAN 中学习通用生成层次特征。
IEEE Trans Pattern Anal Mach Intell. 2023 Jun;45(6):7395-7411. doi: 10.1109/TPAMI.2022.3225788. Epub 2023 May 5.
6
Inverting the Generator of a Generative Adversarial Network.反转生成对抗网络的生成器
IEEE Trans Neural Netw Learn Syst. 2018 Nov 2. doi: 10.1109/TNNLS.2018.2875194.
7
Physics-Based Generative Adversarial Models for Image Restoration and Beyond.用于图像恢复及其他领域的基于物理的生成对抗模型
IEEE Trans Pattern Anal Mach Intell. 2021 Jul;43(7):2449-2462. doi: 10.1109/TPAMI.2020.2969348. Epub 2021 Jun 8.
8
2D facial landmark localization method for multi-view face synthesis image using a two-pathway generative adversarial network approach.基于双通路生成对抗网络方法的多视角人脸合成图像的二维面部地标定位方法
PeerJ Comput Sci. 2022 Feb 16;8:e897. doi: 10.7717/peerj-cs.897. eCollection 2022.
9
Rectified Wasserstein Generative Adversarial Networks for Perceptual Image Restoration.用于感知图像恢复的校正瓦瑟斯坦生成对抗网络
IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):3648-3663. doi: 10.1109/TPAMI.2022.3185316. Epub 2023 Feb 3.
10
StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks.StackGAN++:基于堆叠生成对抗网络的逼真图像合成
IEEE Trans Pattern Anal Mach Intell. 2019 Aug;41(8):1947-1962. doi: 10.1109/TPAMI.2018.2856256. Epub 2018 Jul 16.

引用本文的文献

1
Multi-scale adversarial diffusion network for image super-resolution.用于图像超分辨率的多尺度对抗扩散网络。
Sci Rep. 2025 Apr 5;15(1):11690. doi: 10.1038/s41598-025-96185-2.
2
Dual contrastive learning for synthesizing unpaired fundus fluorescein angiography from retinal fundus images.用于从眼底图像合成未配对眼底荧光血管造影的双对比学习。
Quant Imaging Med Surg. 2024 Mar 15;14(3):2193-2212. doi: 10.21037/qims-23-1304. Epub 2024 Feb 23.