Department of Computer Science and Technology, Ocean University of China, Qingdao 266100, China.
Department of Computer Science and Technology, Fudan University, Shanghai 200433, China.
Neural Netw. 2020 Jul;127:19-28. doi: 10.1016/j.neunet.2020.04.005. Epub 2020 Apr 9.
In recent years, research on image generation has been developing very fast. The generative adversarial network (GAN) emerges as a promising framework, which uses adversarial training to improve the generative ability of its generator. However, since GAN and most of its variants use randomly sampled noises as the input of their generators, they have to learn a mapping function from a whole random distribution to the image manifold. As the structures of the random distribution and the image manifold are generally different, this results in GAN and its variants difficult to train and converge. In this paper, we propose a novel deep model called generative adversarial networks with decoder-encoder output noises (DE-GANs), which take advantage of both the adversarial training and the variational Bayesian inference to improve GAN and its variants on image generation performances. DE-GANs use a pre-trained decoder-encoder architecture to map the random noise vectors to informative ones and feed them to the generator of the adversarial networks. Since the decoder-encoder architecture is trained with the same data set as the generator, its output vectors, as the inputs of the generator, could carry the intrinsic distribution information of the training images, which greatly improves the learnability of the generator and the quality of the generated images. Extensive experiments demonstrate the effectiveness of the proposed model, DE-GANs.
近年来,图像生成的研究发展非常迅速。生成对抗网络(GAN)作为一种很有前途的框架出现了,它使用对抗训练来提高其生成器的生成能力。然而,由于 GAN 及其大多数变体使用随机采样的噪声作为其生成器的输入,它们必须从整个随机分布学习到图像流形的映射函数。由于随机分布和图像流形的结构通常不同,这导致 GAN 及其变体难以训练和收敛。在本文中,我们提出了一种称为带解码器-编码器输出噪声的生成对抗网络(DE-GANs)的新型深度模型,它利用对抗训练和变分贝叶斯推断来提高 GAN 及其变体在图像生成性能方面的能力。DE-GANs 使用预先训练的解码器-编码器架构将随机噪声向量映射到信息丰富的向量,并将其馈送到对抗网络的生成器中。由于解码器-编码器架构是使用与生成器相同的数据集进行训练的,因此它的输出向量作为生成器的输入,可以携带训练图像的内在分布信息,这极大地提高了生成器的可学习性和生成图像的质量。大量的实验证明了所提出的模型 DE-GANs 的有效性。