You Haoran, Cheng Yu, Cheng Tianheng, Li Chunliang, Zhou Pan
IEEE Trans Neural Netw Learn Syst. 2021 Oct;32(10):4389-4403. doi: 10.1109/TNNLS.2020.3017669. Epub 2021 Oct 5.
Recent techniques built on generative adversarial networks (GANs), such as cycle-consistent GANs, are able to learn mappings among different domains built from unpaired data sets, through min-max optimization games between generators and discriminators. However, it remains challenging to stabilize the training process and thus cyclic models fall into mode collapse accompanied by the success of discriminator. To address this problem, we propose an novel Bayesian cyclic model and an integrated cyclic framework for interdomain mappings. The proposed method motivated by Bayesian GAN explores the full posteriors of cyclic model via sampling latent variables and optimizes the model with maximum a posteriori (MAP) estimation. Hence, we name it Bayesian CycleGAN. In addition, original CycleGAN cannot generate diversified results. But it is feasible for Bayesian framework to diversify generated images by replacing restricted latent variables in inference process. We evaluate the proposed Bayesian CycleGAN on multiple benchmark data sets, including Cityscapes, Maps, and Monet2photo. The proposed method improve the per-pixel accuracy by 15% for the Cityscapes semantic segmentation task within origin framework and improve 20% within the proposed integrated framework, showing better resilience to imbalance confrontation. The diversified results of Monet2Photo style transfer also demonstrate its superiority over original cyclic model. We provide codes for all of our experiments in https://github.com/ranery/Bayesian-CycleGAN.
基于生成对抗网络(GAN)构建的最新技术,如循环一致GAN,能够通过生成器和判别器之间的最小-最大优化博弈,从未配对数据集中学习不同域之间的映射。然而,稳定训练过程仍然具有挑战性,因此循环模型会随着判别器的成功而陷入模式崩溃。为了解决这个问题,我们提出了一种新颖的贝叶斯循环模型和一个用于域间映射的集成循环框架。所提出的方法受贝叶斯GAN的启发,通过对潜在变量进行采样来探索循环模型的完整后验,并使用最大后验(MAP)估计对模型进行优化。因此,我们将其命名为贝叶斯循环一致生成对抗网络(Bayesian CycleGAN)。此外,原始的循环一致生成对抗网络无法生成多样化的结果。但是,对于贝叶斯框架来说,通过在推理过程中替换受限的潜在变量来使生成的图像多样化是可行的。我们在多个基准数据集上评估了所提出的贝叶斯循环一致生成对抗网络,包括城市景观数据集(Cityscapes)、地图数据集(Maps)和莫奈到照片数据集(Monet2photo)。所提出的方法在原始框架内,将城市景观语义分割任务的每像素准确率提高了15%,在提出的集成框架内提高了20%,显示出对不平衡对抗更好的弹性。莫奈到照片风格迁移的多样化结果也证明了它优于原始的循环模型。我们在https://github.com/ranery/Bayesian-CycleGAN中提供了所有实验的代码。