• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

BSD-GAN:用于尺度解缠表示学习和图像合成的分支生成对抗网络。

BSD-GAN: Branched Generative Adversarial Network for Scale-Disentangled Representation Learning and Image Synthesis.

作者信息

Yi Zili, Chen Zhiqin, Cai Hao, Mao Wendong, Gong Minglun, Zhang Hao

出版信息

IEEE Trans Image Process. 2020 Aug 12;PP. doi: 10.1109/TIP.2020.3014608.

DOI:10.1109/TIP.2020.3014608
PMID:32784136
Abstract

We introduce BSD-GAN, a novel multi-branch and scale-disentangled training method which enables unconditional Generative Adversarial Networks (GANs) to learn image representations at multiple scales, benefiting a wide range of generation and editing tasks. The key feature of BSD-GAN is that it is trained in multiple branches, progressively covering both the breadth and depth of the network, as resolutions of the training images increase to reveal finer-scale features. Specifically, each noise vector, as input to the generator network of BSD-GAN, is deliberately split into several sub-vectors, each corresponding to, and is trained to learn, image representations at a particular scale. During training, we progressively "de-freeze" the sub-vectors, one at a time, as a new set of higher-resolution images is employed for training and more network layers are added. A consequence of such an explicit sub-vector designation is that we can directly manipulate and even combine latent (sub-vector) codes which model different feature scales. Extensive experiments demonstrate the effectiveness of our training method in scale-disentangled learning of image representations and synthesis of novel image contents, without any extra labels and without compromising quality of the synthesized high-resolution images. We further demonstrate several image generation and manipulation applications enabled or improved by BSD-GAN.

摘要

我们提出了BSD-GAN,这是一种新颖的多分支和尺度解缠训练方法,它使无条件生成对抗网络(GAN)能够在多个尺度上学习图像表示,从而有利于广泛的生成和编辑任务。BSD-GAN的关键特性在于它在多个分支中进行训练,随着训练图像分辨率的提高以揭示更精细尺度的特征,逐步覆盖网络的广度和深度。具体而言,作为BSD-GAN生成器网络输入的每个噪声向量被有意地拆分为几个子向量,每个子向量对应于特定尺度的图像表示并被训练以学习该表示。在训练期间,随着一组新的更高分辨率图像用于训练并添加更多网络层,我们一次逐步“解冻”一个子向量。这种明确的子向量指定的一个结果是,我们可以直接操纵甚至组合对不同特征尺度进行建模的潜在(子向量)代码。大量实验证明了我们的训练方法在图像表示的尺度解缠学习和新图像内容合成方面的有效性,无需任何额外标签且不影响合成高分辨率图像的质量。我们进一步展示了由BSD-GAN实现或改进的几个图像生成和操纵应用。

相似文献

1
BSD-GAN: Branched Generative Adversarial Network for Scale-Disentangled Representation Learning and Image Synthesis.BSD-GAN:用于尺度解缠表示学习和图像合成的分支生成对抗网络。
IEEE Trans Image Process. 2020 Aug 12;PP. doi: 10.1109/TIP.2020.3014608.
2
Generative adversarial networks with decoder-encoder output noises.生成对抗网络与解码器编码器输出噪声。
Neural Netw. 2020 Jul;127:19-28. doi: 10.1016/j.neunet.2020.04.005. Epub 2020 Apr 9.
3
Inverting the Generator of a Generative Adversarial Network.反转生成对抗网络的生成器
IEEE Trans Neural Netw Learn Syst. 2018 Nov 2. doi: 10.1109/TNNLS.2018.2875194.
4
StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks.StackGAN++:基于堆叠生成对抗网络的逼真图像合成
IEEE Trans Pattern Anal Mach Intell. 2019 Aug;41(8):1947-1962. doi: 10.1109/TPAMI.2018.2856256. Epub 2018 Jul 16.
5
HRGAN: A Generative Adversarial Network Producing Higher-Resolution Images than Training Sets.HRGAN:一种生成对抗网络,可生成比训练集分辨率更高的图像。
Sensors (Basel). 2022 Feb 13;22(4):1435. doi: 10.3390/s22041435.
6
ROP-GAN: an image synthesis method for retinopathy of prematurity based on generative adversarial network.ROP-GAN:一种基于生成对抗网络的早产儿视网膜病变图像合成方法。
Phys Med Biol. 2023 Oct 6;68(20). doi: 10.1088/1361-6560/acf3c9.
7
Robust Data Augmentation Generative Adversarial Network for Object Detection.用于目标检测的鲁棒数据增强生成对抗网络。
Sensors (Basel). 2022 Dec 23;23(1):157. doi: 10.3390/s23010157.
8
InterFaceGAN: Interpreting the Disentangled Face Representation Learned by GANs.InterFaceGAN:解释 GAN 学习到的解缠面部表示。
IEEE Trans Pattern Anal Mach Intell. 2022 Apr;44(4):2004-2018. doi: 10.1109/TPAMI.2020.3034267. Epub 2022 Mar 4.
9
CiwGAN and fiwGAN: Encoding information in acoustic data to model lexical learning with Generative Adversarial Networks.CiwGAN 和 fiwGAN:利用生成对抗网络将声学数据中的信息编码,以建模词汇学习。
Neural Netw. 2021 Jul;139:305-325. doi: 10.1016/j.neunet.2021.03.017. Epub 2021 Mar 19.
10
A Method based on Evolutionary Algorithms and Channel Attention Mechanism to Enhance Cycle Generative Adversarial Network Performance for Image Translation.基于进化算法和通道注意力机制的方法来提高用于图像翻译的循环生成对抗网络性能。
Int J Neural Syst. 2023 May;33(5):2350026. doi: 10.1142/S0129065723500260. Epub 2023 Apr 5.