• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Pro-PULSE:在用于照片超分辨率的生成对抗网络中学习潜在语义的渐进编码器

Pro-PULSE: Learning Progressive Encoders of Latent Semantics in GANs for Photo Upsampling.

作者信息

Zhou Yang, Xu Yangyang, Du Yong, Wen Qiang, He Shengfeng

出版信息

IEEE Trans Image Process. 2022;31:1230-1242. doi: 10.1109/TIP.2022.3140603. Epub 2022 Jan 19.

DOI:10.1109/TIP.2022.3140603
PMID:35015636
Abstract

The state-of-the-art photo upsampling method, PULSE, demonstrates that a sharp, high-resolution (HR) version of a given low-resolution (LR) input can be obtained by exploring the latent space of generative models. However, mapping an extreme LR input (16) directly to an HR image (1024) is too ambiguous to preserve faithful local facial semantics. In this paper, we propose an enhanced upsampling approach, Pro-PULSE, that addresses the issues of semantic inconsistency and optimization complexity. Our idea is to learn an encoder that progressively constructs the HR latent codes in the extended W+ latent space of StyleGAN. This design divides the complex 64× upsampling problem into several steps, and therefore small-scale facial semantics can be inherited from one end to the other. In particular, we train two encoders, the base encoder maps latent vectors in W space and serves as a foundation of the HR latent vector, while the second scale-specific encoder performed in W+ space gradually replaces the previous vector produced by the base encoder at each scale. This process produces intermediate side-outputs, which injects deep supervision into the training of encoder. Extensive experiments demonstrate superiorities over the latest latent space exploration methods, in terms of efficiency, quantitative quality metrics, and qualitative visual results.

摘要

最先进的照片超分辨率方法PULSE表明,通过探索生成模型的潜在空间,可以获得给定低分辨率(LR)输入的清晰、高分辨率(HR)版本。然而,将极低分辨率输入(16×16)直接映射到高分辨率图像(1024×1024)过于模糊,无法保留忠实的局部面部语义。在本文中,我们提出了一种增强的超分辨率方法Pro-PULSE,该方法解决了语义不一致和优化复杂性的问题。我们的想法是学习一个编码器,该编码器在StyleGAN的扩展W+潜在空间中逐步构建高分辨率潜在代码。这种设计将复杂的64倍超分辨率问题分解为几个步骤,因此小规模的面部语义可以从一端继承到另一端。具体来说,我们训练了两个编码器,基础编码器映射W空间中的潜在向量,并作为高分辨率潜在向量的基础,而在W+空间中执行的第二个特定尺度编码器在每个尺度上逐渐替换由基础编码器产生的先前向量。这个过程产生中间侧输出,将深度监督注入到编码器的训练中。大量实验表明,在效率、定量质量指标和定性视觉结果方面,Pro-PULSE优于最新的潜在空间探索方法。

相似文献

1
Pro-PULSE: Learning Progressive Encoders of Latent Semantics in GANs for Photo Upsampling.Pro-PULSE:在用于照片超分辨率的生成对抗网络中学习潜在语义的渐进编码器
IEEE Trans Image Process. 2022;31:1230-1242. doi: 10.1109/TIP.2022.3140603. Epub 2022 Jan 19.
2
Identity preserving multi-pose facial expression recognition using fine tuned VGG on the latent space vector of generative adversarial network.基于生成对抗网络潜在空间向量的微调 VGG 进行身份保留多姿态面部表情识别。
Math Biosci Eng. 2021 Apr 28;18(4):3699-3717. doi: 10.3934/mbe.2021186.
3
A Semantic Segment Encoder (SSE): Improving human face inversion quality through minimized learning space.语义分割编码器 (SSE):通过最小化学习空间来提高人脸反演质量。
PLoS One. 2023 Dec 5;18(12):e0295316. doi: 10.1371/journal.pone.0295316. eCollection 2023.
4
Semantic Face Hallucination: Super-Resolving Very Low-Resolution Face Images with Supplementary Attributes.语义人脸幻觉:利用补充属性超分辨超低分辨率人脸图像。
IEEE Trans Pattern Anal Mach Intell. 2020 Nov;42(11):2926-2943. doi: 10.1109/TPAMI.2019.2916881. Epub 2019 May 14.
5
ShapeEditor: A StyleGAN Encoder for Stable and High Fidelity Face Swapping.ShapeEditor:一种用于稳定且高保真面部交换的StyleGAN编码器。
Front Neurorobot. 2022 Jan 21;15:785808. doi: 10.3389/fnbot.2021.785808. eCollection 2021.
6
Local and Global GANs With Semantic-Aware Upsampling for Image Generation.用于图像生成的具有语义感知上采样的局部和全局生成对抗网络
IEEE Trans Pattern Anal Mach Intell. 2023 Jan;45(1):768-784. doi: 10.1109/TPAMI.2022.3155989. Epub 2022 Dec 5.
7
Domain-Scalable Unpaired Image Translation via Latent Space Anchoring.通过潜在空间锚定实现域可扩展的无配对图像翻译
IEEE Trans Pattern Anal Mach Intell. 2023 Oct;45(10):11707-11719. doi: 10.1109/TPAMI.2023.3287774. Epub 2023 Sep 5.
8
A Novel Upsampling and Context Convolution for Image Semantic Segmentation.一种用于图像语义分割的新型上采样与上下文卷积
Sensors (Basel). 2021 Mar 20;21(6):2170. doi: 10.3390/s21062170.
9
InterFaceGAN: Interpreting the Disentangled Face Representation Learned by GANs.InterFaceGAN:解释 GAN 学习到的解缠面部表示。
IEEE Trans Pattern Anal Mach Intell. 2022 Apr;44(4):2004-2018. doi: 10.1109/TPAMI.2020.3034267. Epub 2022 Mar 4.
10
Deep Unsupervised Fusion Learning for Hyperspectral Image Super Resolution.用于高光谱图像超分辨率的深度无监督融合学习
Sensors (Basel). 2021 Mar 28;21(7):2348. doi: 10.3390/s21072348.

引用本文的文献

1
Enhanced Rail Surface Defect Segmentation Using Polarization Imaging and Dual-Stream Feature Fusion.基于偏振成像和双流特征融合的增强型铁轨表面缺陷分割
Sensors (Basel). 2025 Jun 4;25(11):3546. doi: 10.3390/s25113546.