• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

《SuperstarGAN:大规模域图像到图像转换的生成对抗网络》。

SuperstarGAN: Generative adversarial networks for image-to-image translation in large-scale domains.

机构信息

School of Electrical and Electronics Engineering, Chung-Ang University, 84 Heukseok-ro, Dongjak-gu, Seoul 06974, South Korea.

School of Mechanical Engineering, Chung-Ang University, 84 Heukseok-ro, Dongjak-gu, Seoul 06974, South Korea.

出版信息

Neural Netw. 2023 May;162:330-339. doi: 10.1016/j.neunet.2023.02.042. Epub 2023 Mar 7.

DOI:10.1016/j.neunet.2023.02.042
PMID:36940493
Abstract

Image-to-image translation with generative adversarial networks (GANs) has been extensively studied in recent years. Among the models, StarGAN has achieved image-to-image translation for multiple domains with a single generator, whereas conventional models require multiple generators. However, StarGAN has several limitations, including the lack of capacity to learn mappings among large-scale domains; furthermore, StarGAN can barely express small feature changes. To address the limitations, we propose an improved StarGAN, namely SuperstarGAN. We adopted the idea, first proposed in controllable GAN (ControlGAN), of training an independent classifier with the data augmentation techniques to handle the overfitting problem in the classification of StarGAN structures. Since the generator with a well-trained classifier can express small features belonging to the target domain, SuperstarGAN achieves image-to-image translation in large-scale domains. Evaluated with a face image dataset, SuperstarGAN demonstrated improved performance in terms of Fréchet Inception distance (FID) and learned perceptual image patch similarity (LPIPS). Specifically, compared to StarGAN, SuperstarGAN exhibited decreased FID and LPIPS by 18.1% and 42.5%, respectively. Furthermore, we conducted an additional experiment with interpolated and extrapolated label values, indicating the ability of SuperstarGAN to control the degree of expression of the target domain features in generated images. Additionally, SuperstarGAN was successfully adapted to an animal face dataset and a painting dataset, where it can translate styles of animal faces (i.e., a cat to a tiger) and styles of painters (i.e., Hassam to Picasso), respectively, which explains the generality of SuperstarGAN regardless of datasets.

摘要

近年来,基于生成对抗网络(GAN)的图像到图像翻译得到了广泛研究。在这些模型中,StarGAN 仅用一个生成器就实现了多个领域的图像到图像翻译,而传统模型则需要多个生成器。然而,StarGAN 存在一些局限性,包括缺乏学习大规模领域之间映射的能力;此外,StarGAN 几乎无法表达小的特征变化。为了解决这些局限性,我们提出了一种改进的 StarGAN,即 SuperstarGAN。我们采用了在可控 GAN(ControlGAN)中首次提出的思想,即使用数据增强技术训练独立的分类器,以解决 StarGAN 结构分类中的过拟合问题。由于具有经过良好训练的分类器的生成器可以表达属于目标域的小特征,因此 SuperstarGAN 可以在大规模领域中实现图像到图像的翻译。在人脸图像数据集上进行评估时,SuperstarGAN 在 Fréchet Inception 距离(FID)和学习感知图像补丁相似度(LPIPS)方面表现出了改进的性能。具体而言,与 StarGAN 相比,SuperstarGAN 的 FID 和 LPIPS 分别降低了 18.1%和 42.5%。此外,我们还进行了带有插值和外推标签值的额外实验,表明了 SuperstarGAN 控制生成图像中目标域特征表达程度的能力。此外,SuperstarGAN 成功地应用于动物脸数据集和绘画数据集,它可以分别翻译动物脸的风格(即猫到老虎)和画家的风格(即哈萨姆到毕加索),这说明了 SuperstarGAN 的通用性,无论数据集如何。

相似文献

1
SuperstarGAN: Generative adversarial networks for image-to-image translation in large-scale domains.《SuperstarGAN:大规模域图像到图像转换的生成对抗网络》。
Neural Netw. 2023 May;162:330-339. doi: 10.1016/j.neunet.2023.02.042. Epub 2023 Mar 7.
2
The role of unpaired image-to-image translation for stain color normalization in colorectal cancer histology classification.非配对图像到图像翻译在结直肠癌组织学分类中用于染色颜色归一化的作用。
Comput Methods Programs Biomed. 2023 Jun;234:107511. doi: 10.1016/j.cmpb.2023.107511. Epub 2023 Mar 26.
3
GD-StarGAN: Multi-domain image-to-image translation in garment design.GD-StarGAN:服装设计中的多领域图像到图像转换。
PLoS One. 2020 Apr 21;15(4):e0231719. doi: 10.1371/journal.pone.0231719. eCollection 2020.
4
A Method based on Evolutionary Algorithms and Channel Attention Mechanism to Enhance Cycle Generative Adversarial Network Performance for Image Translation.基于进化算法和通道注意力机制的方法来提高用于图像翻译的循环生成对抗网络性能。
Int J Neural Syst. 2023 May;33(5):2350026. doi: 10.1142/S0129065723500260. Epub 2023 Apr 5.
5
Multi-domain medical image translation generation for lung image classification based on generative adversarial networks.基于生成对抗网络的肺部图像分类的多领域医学图像翻译生成。
Comput Methods Programs Biomed. 2023 Feb;229:107200. doi: 10.1016/j.cmpb.2022.107200. Epub 2022 Nov 2.
6
Small facial image dataset augmentation using conditional GANs based on incomplete edge feature input.基于不完整边缘特征输入的条件生成对抗网络用于小面部图像数据集增强
PeerJ Comput Sci. 2021 Nov 17;7:e760. doi: 10.7717/peerj-cs.760. eCollection 2021.
7
Exploring Explicit Domain Supervision for Latent Space Disentanglement in Unpaired Image-to-Image Translation.探索用于无配对图像到图像翻译中潜在空间解缠的显式域监督
IEEE Trans Pattern Anal Mach Intell. 2021 Apr;43(4):1254-1266. doi: 10.1109/TPAMI.2019.2950198. Epub 2021 Mar 5.
8
Robust Data Augmentation Generative Adversarial Network for Object Detection.用于目标检测的鲁棒数据增强生成对抗网络。
Sensors (Basel). 2022 Dec 23;23(1):157. doi: 10.3390/s23010157.
9
A GAN-based image synthesis method for skin lesion classification.一种基于生成对抗网络的用于皮肤病变分类的图像合成方法。
Comput Methods Programs Biomed. 2020 Oct;195:105568. doi: 10.1016/j.cmpb.2020.105568. Epub 2020 May 29.
10
Generative adversarial networks with decoder-encoder output noises.生成对抗网络与解码器编码器输出噪声。
Neural Netw. 2020 Jul;127:19-28. doi: 10.1016/j.neunet.2020.04.005. Epub 2020 Apr 9.

引用本文的文献

1
Synthetic CT generation from CBCT and MRI using StarGAN in the Pelvic Region.利用StarGAN在盆腔区域从CBCT和MRI生成合成CT
Radiat Oncol. 2025 Feb 4;20(1):18. doi: 10.1186/s13014-025-02590-2.
2
Deep learning for the harmonization of structural MRI scans: a survey.深度学习在结构磁共振成像扫描配准中的应用:综述。
Biomed Eng Online. 2024 Aug 31;23(1):90. doi: 10.1186/s12938-024-01280-6.
3
ctGAN: combined transformation of gene expression and survival data with generative adversarial network.ctGAN:利用生成对抗网络对基因表达和生存数据进行联合变换。
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae325.
4
Deep Learning Techniques with Genomic Data in Cancer Prognosis: A Comprehensive Review of the 2021-2023 Literature.癌症预后中基因组数据的深度学习技术:2021 - 2023年文献综述
Biology (Basel). 2023 Jun 21;12(7):893. doi: 10.3390/biology12070893.
5
Deep Learning Approaches for lncRNA-Mediated Mechanisms: A Comprehensive Review of Recent Developments.深度学习方法在 lncRNA 介导的机制研究中的应用:最新进展的综合评述。
Int J Mol Sci. 2023 Jun 18;24(12):10299. doi: 10.3390/ijms241210299.