基于信息的边界平衡生成对抗网络与可解释的表示学习。

Information-Based Boundary Equilibrium Generative Adversarial Networks with Interpretable Representation Learning.

机构信息

Industrial Engineering, Seoul National University, 1 Gwanakro, Gwanak-gu, Seoul 08826, Republic of Korea.

出版信息

Comput Intell Neurosci. 2018 Oct 17;2018:6465949. doi: 10.1155/2018/6465949. eCollection 2018.

DOI:10.1155/2018/6465949

PMID:30416519

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6207896/

Abstract

This paper describes a new image generation algorithm based on generative adversarial network. With an information-theoretic extension to the autoencoder-based discriminator, this new algorithm is able to learn interpretable representations from the input images. Our model not only adversarially minimizes the Wasserstein distance-based losses of the discriminator and generator but also maximizes the mutual information between small subset of the latent variables and the observation. We also train our model with proportional control theory to keep the equilibrium between the discriminator and the generator balanced, and as a result, our generative adversarial network can mitigate the convergence problem. Through the experiments on real images, we validate our proposed method, which can manipulate the generated images as desired by controlling the latent codes of input variables. In addition, the visual qualities of produced images are effectively maintained, and the model can stably converge to the equilibrium. However, our model has a difficulty in learning disentangling factors because our model does not regularize the independence between the interpretable factors. Therefore, in the future, we will develop a generative model that can learn disentangling factors.

摘要

本文描述了一种基于生成对抗网络的新图像生成算法。通过对基于自动编码器的判别器进行信息论扩展，该新算法能够从输入图像中学习可解释的表示。我们的模型不仅通过对抗最小化判别器和生成器的基于 Wasserstein 距离的损失，还通过最大化小部分潜在变量和观测之间的互信息来最大化。我们还使用比例控制理论来训练我们的模型，以保持判别器和生成器之间的平衡，因此，我们的生成对抗网络可以缓解收敛问题。通过对真实图像的实验，我们验证了我们提出的方法，该方法可以通过控制输入变量的潜在代码来操纵所需的生成图像。此外，产生的图像的视觉质量得到有效保持，并且模型可以稳定地收敛到平衡。但是，我们的模型在学习解耦因素方面存在困难，因为我们的模型没有对可解释因素之间的独立性进行正则化。因此，在未来，我们将开发一种能够学习解耦因素的生成模型。

相似文献

Information-Based Boundary Equilibrium Generative Adversarial Networks with Interpretable Representation Learning.

Comput Intell Neurosci. 2018 Oct 17;2018:6465949. doi: 10.1155/2018/6465949. eCollection 2018.

Optimizing Latent Distributions for Non-Adversarial Generative Networks.

IEEE Trans Pattern Anal Mach Intell. 2022 May;44(5):2657-2672. doi: 10.1109/TPAMI.2020.3043745. Epub 2022 Apr 1.

Learning brain representation using recurrent Wasserstein generative adversarial net.

Comput Methods Programs Biomed. 2022 Aug;223:106979. doi: 10.1016/j.cmpb.2022.106979. Epub 2022 Jun 27.

Generative adversarial networks with mixture of t-distributions noise for diverse image generation.

Neural Netw. 2020 Feb;122:374-381. doi: 10.1016/j.neunet.2019.11.003. Epub 2019 Nov 18.

CiwGAN and fiwGAN: Encoding information in acoustic data to model lexical learning with Generative Adversarial Networks.

Neural Netw. 2021 Jul;139:305-325. doi: 10.1016/j.neunet.2021.03.017. Epub 2021 Mar 19.

Generative adversarial networks with decoder-encoder output noises.

Neural Netw. 2020 Jul;127:19-28. doi: 10.1016/j.neunet.2020.04.005. Epub 2020 Apr 9.

Generative adversarial network based telecom fraud detection at the receiving bank.

Neural Netw. 2018 Jun;102:78-86. doi: 10.1016/j.neunet.2018.02.015. Epub 2018 Mar 5.

Representation Learning by Rotating Your Faces.

IEEE Trans Pattern Anal Mach Intell. 2019 Dec;41(12):3007-3021. doi: 10.1109/TPAMI.2018.2868350. Epub 2018 Sep 3.

f-AnoGAN: Fast unsupervised anomaly detection with generative adversarial networks.

Med Image Anal. 2019 May;54:30-44. doi: 10.1016/j.media.2019.01.010. Epub 2019 Jan 31.

The Deep Learning Generative Adversarial Random Neural Network in data marketplaces: The digital creative.

Neural Netw. 2023 Aug;165:420-434. doi: 10.1016/j.neunet.2023.05.028. Epub 2023 May 30.

引用本文的文献

Intelligent Generation Method of Innovative Structures Based on Topology Optimization and Deep Learning.

Materials (Basel). 2021 Dec 13;14(24):7680. doi: 10.3390/ma14247680.

本文引用的文献

Representation learning: a review and new perspectives.

IEEE Trans Pattern Anal Mach Intell. 2013 Aug;35(8):1798-828. doi: 10.1109/TPAMI.2013.50.

Face recognition: a convolutional neural-network approach.

IEEE Trans Neural Netw. 1997;8(1):98-113. doi: 10.1109/72.554195.

A fast learning algorithm for deep belief nets.

Neural Comput. 2006 Jul;18(7):1527-54. doi: 10.1162/neco.2006.18.7.1527.

Separating style and content with bilinear models.

Neural Comput. 2000 Jun;12(6):1247-83. doi: 10.1162/089976600300015349.

The "wake-sleep" algorithm for unsupervised neural networks.

Science. 1995 May 26;268(5214):1158-61. doi: 10.1126/science.7761831.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于信息的边界平衡生成对抗网络与可解释的表示学习。

Information-Based Boundary Equilibrium Generative Adversarial Networks with Interpretable Representation Learning.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献