Kang Minguk, Shin Joonghyuk, Park Jaesik
IEEE Trans Pattern Anal Mach Intell. 2023 Dec;45(12):15725-15742. doi: 10.1109/TPAMI.2023.3306436. Epub 2023 Nov 3.
Generative Adversarial Network (GAN) is one of the state-of-the-art generative models for realistic image synthesis. While training and evaluating GAN becomes increasingly important, the current GAN research ecosystem does not provide reliable benchmarks for which the evaluation is conducted consistently and fairly. Furthermore, because there are few validated GAN implementations, researchers devote considerable time to reproducing baselines. We study the taxonomy of GAN approaches and present a new open-source library named StudioGAN. StudioGAN supports 7 GAN architectures, 9 conditioning methods, 4 adversarial losses, 12 regularization modules, 3 differentiable augmentations, 7 evaluation metrics, and 5 evaluation backbones. With our training and evaluation protocol, we present a large-scale benchmark using various datasets (CIFAR10, ImageNet, AFHQv2, FFHQ, and Baby/Papa/Granpa-ImageNet) and 3 different evaluation backbones (InceptionV3, SwAV, and Swin Transformer). Unlike other benchmarks used in the GAN community, we train representative GANs, including BigGAN and StyleGAN series in a unified training pipeline and quantify generation performance with 7 evaluation metrics. The benchmark evaluates other cutting-edge generative models (e.g., StyleGAN-XL, ADM, MaskGIT, and RQ-Transformer). StudioGAN provides GAN implementations, training, and evaluation scripts with the pre-trained weights. StudioGAN is available at https://github.com/POSTECH-CVLab/PyTorch-StudioGAN.
生成对抗网络(GAN)是用于逼真图像合成的最先进生成模型之一。虽然训练和评估GAN变得越来越重要,但当前的GAN研究生态系统并未提供可靠的基准,无法据此进行一致且公平的评估。此外,由于经过验证的GAN实现很少,研究人员需要花费大量时间来重现基准。我们研究了GAN方法的分类,并提出了一个名为StudioGAN的新开源库。StudioGAN支持7种GAN架构、9种条件方法、4种对抗损失、12个正则化模块、3种可微增强、7种评估指标和5种评估主干。通过我们的训练和评估协议,我们使用各种数据集(CIFAR10、ImageNet、AFHQv2、FFHQ以及婴儿/爸爸/爷爷-ImageNet)和3种不同的评估主干(InceptionV3、SwAV和Swin Transformer)提出了一个大规模基准。与GAN社区中使用的其他基准不同,我们在统一的训练管道中训练代表性的GAN,包括BigGAN和StyleGAN系列,并使用7种评估指标量化生成性能。该基准评估其他前沿生成模型(例如,StyleGAN-XL、ADM、MaskGIT和RQ-Transformer)。StudioGAN提供带有预训练权重的GAN实现、训练和评估脚本。StudioGAN可在https://github.com/POSTECH-CVLab/PyTorch-StudioGAN上获取。