关于生成对抗网络（GAN）训练中的数据增强

On Data Augmentation for GAN Training.

作者信息

Tran Ngoc-Trung, Tran Viet-Hung, Nguyen Ngoc-Bao, Nguyen Trung-Kien, Cheung Ngai-Man

出版信息

IEEE Trans Image Process. 2021;30:1882-1897. doi: 10.1109/TIP.2021.3049346. Epub 2021 Jan 20.

DOI:10.1109/TIP.2021.3049346

Abstract

Recent successes in Generative Adversarial Networks (GAN) have affirmed the importance of using more data in GAN training. Yet it is expensive to collect data in many domains such as medical applications. Data Augmentation (DA) has been applied in these applications. In this work, we first argue that the classical DA approach could mislead the generator to learn the distribution of the augmented data, which could be different from that of the original data. We then propose a principled framework, termed Data Augmentation Optimized for GAN (DAG), to enable the use of augmented data in GAN training to improve the learning of the original distribution. We provide theoretical analysis to show that using our proposed DAG aligns with the original GAN in minimizing the Jensen-Shannon (JS) divergence between the original distribution and model distribution. Importantly, the proposed DAG effectively leverages the augmented data to improve the learning of discriminator and generator. We conduct experiments to apply DAG to different GAN models: unconditional GAN, conditional GAN, self-supervised GAN and CycleGAN using datasets of natural images and medical images. The results show that DAG achieves consistent and considerable improvements across these models. Furthermore, when DAG is used in some GAN models, the system establishes state-of-the-art Fréchet Inception Distance (FID) scores. Our code is available (https://github.com/tntrung/dag-gans).

摘要

生成对抗网络（GAN）最近取得的成功证实了在GAN训练中使用更多数据的重要性。然而，在许多领域（如医学应用）收集数据成本很高。数据增强（DA）已应用于这些应用中。在这项工作中，我们首先指出，经典的数据增强方法可能会误导生成器学习增强数据的分布，而这可能与原始数据的分布不同。然后，我们提出了一个有原则的框架，称为针对GAN优化的数据增强（DAG），以在GAN训练中使用增强数据来改进对原始分布的学习。我们提供理论分析表明，使用我们提出的DAG与原始GAN在最小化原始分布和模型分布之间的詹森-香农（JS）散度方面是一致的。重要的是，所提出的DAG有效地利用增强数据来改进判别器和生成器的学习。我们进行实验将DAG应用于不同的GAN模型：使用自然图像和医学图像数据集的无条件GAN、条件GAN、自监督GAN和循环GAN。结果表明，DAG在这些模型上取得了一致且显著的改进。此外，当DAG用于某些GAN模型时，该系统建立了当前最优的弗雷歇因袭距离（FID）分数。我们的代码可在（https://github.com/tntrung/dag-gans）获取。

相似文献

On Data Augmentation for GAN Training.

IEEE Trans Image Process. 2021;30:1882-1897. doi: 10.1109/TIP.2021.3049346. Epub 2021 Jan 20.

Least kth-Order and Rényi Generative Adversarial Networks.

Neural Comput. 2021 Aug 19;33(9):2473-2510. doi: 10.1162/neco_a_01416.

A GAN-based image synthesis method for skin lesion classification.

Comput Methods Programs Biomed. 2020 Oct;195:105568. doi: 10.1016/j.cmpb.2020.105568. Epub 2020 May 29.

Data augmentation using Generative Adversarial Networks (GANs) for GAN-based detection of Pneumonia and COVID-19 in chest X-ray images.

Inform Med Unlocked. 2021;27:100779. doi: 10.1016/j.imu.2021.100779. Epub 2021 Nov 22.

A Unifying Generator Loss Function for Generative Adversarial Networks.

Entropy (Basel). 2024 Mar 27;26(4):290. doi: 10.3390/e26040290.

Semi-supervised segmentation of lesion from breast ultrasound images with attentional generative adversarial network.

Comput Methods Programs Biomed. 2020 Jun;189:105275. doi: 10.1016/j.cmpb.2019.105275. Epub 2019 Dec 12.

EviD-GAN: Improving GAN With an Infinite Set of Discriminators at Negligible Cost.

IEEE Trans Neural Netw Learn Syst. 2025 Apr;36(4):6422-6436. doi: 10.1109/TNNLS.2024.3388197. Epub 2025 Apr 4.

Learning with limited target data to detect cells in cross-modality images.

Med Image Anal. 2023 Dec;90:102969. doi: 10.1016/j.media.2023.102969. Epub 2023 Sep 29.

GAN-CL: Generative Adversarial Networks for Learning From Complementary Labels.

IEEE Trans Cybern. 2023 Jan;53(1):236-247. doi: 10.1109/TCYB.2021.3089337. Epub 2022 Dec 23.

SUGAN: A Stable U-Net Based Generative Adversarial Network.

Sensors (Basel). 2023 Aug 23;23(17):7338. doi: 10.3390/s23177338.

引用本文的文献

TomoGRAF: An X-ray physics-driven generative radiance field framework for extremely sparse view CT reconstruction.

PLoS One. 2025 Aug 22;20(8):e0330463. doi: 10.1371/journal.pone.0330463. eCollection 2025.

Towards the Generation of Medical Imaging Classifiers Robust to Common Perturbations.

BioMedInformatics. 2024 Jun;4(2):889-910. doi: 10.3390/biomedinformatics4020050. Epub 2024 Apr 1.

Deep learning in histopathology images for prediction of oncogenic driver molecular alterations in lung cancer: a systematic review and meta-analysis.

Transl Lung Cancer Res. 2025 May 30;14(5):1756-1769. doi: 10.21037/tlcr-2024-1196. Epub 2025 May 21.

Automated Brain Tumor Classification and Grading Using Multi-scale Graph Neural Network with Spatio-Temporal Transformer Attention Through MRI Scans.

Interdiscip Sci. 2025 Jun 5. doi: 10.1007/s12539-025-00718-2.

A hybrid approach combining deep learning and signal processing for bearing fault diagnosis under imbalanced samples and multiple operating conditions.

Sci Rep. 2025 Apr 19;15(1):13606. doi: 10.1038/s41598-025-98138-1.

Digital Representation of Patients as Medical Digital Twins: Data-Centric Viewpoint.

JMIR Med Inform. 2025 Jan 28;13:e53542. doi: 10.2196/53542.

LPDi GAN: A License Plate De-Identification Method to Preserve Strong Data Utility.

Sensors (Basel). 2024 Jul 30;24(15):4922. doi: 10.3390/s24154922.

The Performance and Clinical Applicability of HER2 Digital Image Analysis in Breast Cancer: A Systematic Review.

Cancers (Basel). 2024 Aug 3;16(15):2761. doi: 10.3390/cancers16152761.

Synthetic data generation methods in healthcare: A review on open-source tools and methods.

Comput Struct Biotechnol J. 2024 Jul 9;23:2892-2910. doi: 10.1016/j.csbj.2024.07.005. eCollection 2024 Dec.

Bridging the Camera Domain Gap With Image-to-Image Translation Improves Glaucoma Diagnosis.

Transl Vis Sci Technol. 2023 Dec 1;12(12):20. doi: 10.1167/tvst.12.12.20.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

关于生成对抗网络（GAN）训练中的数据增强

On Data Augmentation for GAN Training.

作者信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献