• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

StudioGAN:用于图像合成的生成对抗网络分类法与基准测试

StudioGAN: A Taxonomy and Benchmark of GANs for Image Synthesis.

作者信息

Kang Minguk, Shin Joonghyuk, Park Jaesik

出版信息

IEEE Trans Pattern Anal Mach Intell. 2023 Dec;45(12):15725-15742. doi: 10.1109/TPAMI.2023.3306436. Epub 2023 Nov 3.

DOI:10.1109/TPAMI.2023.3306436
PMID:37594871
Abstract

Generative Adversarial Network (GAN) is one of the state-of-the-art generative models for realistic image synthesis. While training and evaluating GAN becomes increasingly important, the current GAN research ecosystem does not provide reliable benchmarks for which the evaluation is conducted consistently and fairly. Furthermore, because there are few validated GAN implementations, researchers devote considerable time to reproducing baselines. We study the taxonomy of GAN approaches and present a new open-source library named StudioGAN. StudioGAN supports 7 GAN architectures, 9 conditioning methods, 4 adversarial losses, 12 regularization modules, 3 differentiable augmentations, 7 evaluation metrics, and 5 evaluation backbones. With our training and evaluation protocol, we present a large-scale benchmark using various datasets (CIFAR10, ImageNet, AFHQv2, FFHQ, and Baby/Papa/Granpa-ImageNet) and 3 different evaluation backbones (InceptionV3, SwAV, and Swin Transformer). Unlike other benchmarks used in the GAN community, we train representative GANs, including BigGAN and StyleGAN series in a unified training pipeline and quantify generation performance with 7 evaluation metrics. The benchmark evaluates other cutting-edge generative models (e.g., StyleGAN-XL, ADM, MaskGIT, and RQ-Transformer). StudioGAN provides GAN implementations, training, and evaluation scripts with the pre-trained weights. StudioGAN is available at https://github.com/POSTECH-CVLab/PyTorch-StudioGAN.

摘要

生成对抗网络(GAN)是用于逼真图像合成的最先进生成模型之一。虽然训练和评估GAN变得越来越重要,但当前的GAN研究生态系统并未提供可靠的基准,无法据此进行一致且公平的评估。此外,由于经过验证的GAN实现很少,研究人员需要花费大量时间来重现基准。我们研究了GAN方法的分类,并提出了一个名为StudioGAN的新开源库。StudioGAN支持7种GAN架构、9种条件方法、4种对抗损失、12个正则化模块、3种可微增强、7种评估指标和5种评估主干。通过我们的训练和评估协议,我们使用各种数据集(CIFAR10、ImageNet、AFHQv2、FFHQ以及婴儿/爸爸/爷爷-ImageNet)和3种不同的评估主干(InceptionV3、SwAV和Swin Transformer)提出了一个大规模基准。与GAN社区中使用的其他基准不同,我们在统一的训练管道中训练代表性的GAN,包括BigGAN和StyleGAN系列,并使用7种评估指标量化生成性能。该基准评估其他前沿生成模型(例如,StyleGAN-XL、ADM、MaskGIT和RQ-Transformer)。StudioGAN提供带有预训练权重的GAN实现、训练和评估脚本。StudioGAN可在https://github.com/POSTECH-CVLab/PyTorch-StudioGAN上获取。

相似文献

1
StudioGAN: A Taxonomy and Benchmark of GANs for Image Synthesis.StudioGAN:用于图像合成的生成对抗网络分类法与基准测试
IEEE Trans Pattern Anal Mach Intell. 2023 Dec;45(12):15725-15742. doi: 10.1109/TPAMI.2023.3306436. Epub 2023 Nov 3.
2
GLEAN: Generative Latent Bank for Image Super-Resolution and Beyond.GLIBS: 用于图像超分辨率的生成潜在银行及其他。
IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):3154-3168. doi: 10.1109/TPAMI.2022.3186715. Epub 2023 Feb 3.
3
Insights and Considerations in Development and Performance Evaluation of Generative Adversarial Networks (GANs): What Radiologists Need to Know.生成对抗网络(GANs)开发与性能评估中的见解与思考:放射科医生需要了解的内容。
Diagnostics (Basel). 2024 Aug 13;14(16):1756. doi: 10.3390/diagnostics14161756.
4
Ea-GANs: Edge-Aware Generative Adversarial Networks for Cross-Modality MR Image Synthesis.Ea-GANs:用于跨模态磁共振图像合成的边缘感知生成对抗网络。
IEEE Trans Med Imaging. 2019 Jul;38(7):1750-1762. doi: 10.1109/TMI.2019.2895894. Epub 2019 Jan 29.
5
A multimodal comparison of latent denoising diffusion probabilistic models and generative adversarial networks for medical image synthesis.基于潜在去噪扩散概率模型和生成对抗网络的医学图像合成的多模态比较。
Sci Rep. 2023 Jul 26;13(1):12098. doi: 10.1038/s41598-023-39278-0.
6
Fast MRI Reconstruction: How Powerful Transformers Are?快速磁共振成像重建:Transformer 有多强大?
Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul;2022:2066-2070. doi: 10.1109/EMBC48229.2022.9871475.
7
Generative Adversarial Networks and Other Generative Models生成对抗网络及其他生成模型
8
Generative adversarial network based synthetic data training model for lightweight convolutional neural networks.用于轻量级卷积神经网络的基于生成对抗网络的合成数据训练模型。
Multimed Tools Appl. 2023 May 20:1-23. doi: 10.1007/s11042-023-15747-6.
9
CosSIF: Cosine similarity-based image filtering to overcome low inter-class variation in synthetic medical image datasets.基于余弦相似度的图像滤波,以克服合成医学图像数据集类内变化低的问题。
Comput Biol Med. 2024 Apr;172:108317. doi: 10.1016/j.compbiomed.2024.108317. Epub 2024 Mar 16.
10
GANs for Medical Image Synthesis: An Empirical Study.用于医学图像合成的生成对抗网络:一项实证研究。
J Imaging. 2023 Mar 16;9(3):69. doi: 10.3390/jimaging9030069.

引用本文的文献

1
Feature Extraction for Generative Medical Imaging Evaluation: New Evidence Against an Evolving Trend.用于生成式医学成像评估的特征提取:反对一种不断演变趋势的新证据。
Med Image Comput Comput Assist Interv. 2024 Oct;15012:87-97. doi: 10.1007/978-3-031-72390-2_9. Epub 2024 Oct 23.
2
Expert-guided StyleGAN2 image generation elevates AI diagnostic accuracy for maxillary sinus lesions.专家指导的StyleGAN2图像生成提高了上颌窦病变的人工智能诊断准确性。
Commun Med (Lond). 2025 May 20;5(1):185. doi: 10.1038/s43856-025-00907-6.
3
Multimodal histopathologic models stratify hormone receptor-positive early breast cancer.
多模态组织病理学模型对激素受体阳性早期乳腺癌进行分层。
Nat Commun. 2025 Mar 2;16(1):2106. doi: 10.1038/s41467-025-57283-x.
4
FQ-UWF: Unpaired Generative Image Enhancement for Fundus Quality Ultra-Widefield Retinal Images.FQ-UWF:用于眼底高质量超广角视网膜图像的非配对生成式图像增强
Bioengineering (Basel). 2024 Jun 4;11(6):568. doi: 10.3390/bioengineering11060568.
5
: a Python library of pretrained generative models for medical image synthesis.用于医学图像合成的预训练生成模型的Python库。
J Med Imaging (Bellingham). 2023 Nov;10(6):061403. doi: 10.1117/1.JMI.10.6.061403. Epub 2023 Feb 20.