关于计算机视觉任务中不平衡问题的生成对抗网络调查。

A survey on generative adversarial networks for imbalance problems in computer vision tasks.

作者信息

Sampath Vignesh, Maurtua Iñaki, Aguilar Martín Juan José, Gutierrez Aitor

机构信息

Autonomous and Intelligent Systems Unit, Tekniker, Member of Basque Research and Technology Alliance, Eibar, Spain.

Design and Manufacturing Engineering Department, Universidad de Zaragoza, 3 María de Luna Street, Torres Quevedo Bld, 50018 Zaragoza, Spain.

出版信息

J Big Data. 2021;8(1):27. doi: 10.1186/s40537-021-00414-0. Epub 2021 Jan 29.

DOI:10.1186/s40537-021-00414-0

PMID:33552840

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7845583/

Abstract

Any computer vision application development starts off by acquiring images and data, then preprocessing and pattern recognition steps to perform a task. When the acquired images are highly imbalanced and not adequate, the desired task may not be achievable. Unfortunately, the occurrence of imbalance problems in acquired image datasets in certain complex real-world problems such as anomaly detection, emotion recognition, medical image analysis, fraud detection, metallic surface defect detection, disaster prediction, etc., are inevitable. The performance of computer vision algorithms can significantly deteriorate when the training dataset is imbalanced. In recent years, Generative Adversarial Neural Networks (GANs) have gained immense attention by researchers across a variety of application domains due to their capability to model complex real-world image data. It is particularly important that GANs can not only be used to generate synthetic images, but also its fascinating adversarial learning idea showed good potential in restoring balance in imbalanced datasets. In this paper, we examine the most recent developments of GANs based techniques for addressing imbalance problems in image data. The real-world challenges and implementations of synthetic image generation based on GANs are extensively covered in this survey. Our survey first introduces various imbalance problems in computer vision tasks and its existing solutions, and then examines key concepts such as deep generative image models and GANs. After that, we propose a taxonomy to summarize GANs based techniques for addressing imbalance problems in computer vision tasks into three major categories: 1. Image level imbalances in classification, 2. object level imbalances in object detection and 3. pixel level imbalances in segmentation tasks. We elaborate the imbalance problems of each group, and provide GANs based solutions in each group. Readers will understand how GANs based techniques can handle the problem of imbalances and boost performance of the computer vision algorithms.

摘要

任何计算机视觉应用程序开发都是从获取图像和数据开始，然后进行预处理和模式识别步骤以执行任务。当获取的图像高度不平衡且不充分时，可能无法实现所需的任务。不幸的是，在某些复杂的现实世界问题（如异常检测、情感识别、医学图像分析、欺诈检测、金属表面缺陷检测、灾难预测等）中，获取的图像数据集中不可避免地会出现不平衡问题。当训练数据集不平衡时，计算机视觉算法的性能可能会显著下降。近年来，生成对抗神经网络（GAN）因其能够对复杂的现实世界图像数据进行建模而受到各个应用领域研究人员的广泛关注。特别重要的是，GAN不仅可以用于生成合成图像，而且其引人入胜的对抗学习思想在恢复不平衡数据集中的平衡方面显示出良好的潜力。在本文中，我们研究了基于GAN的技术在解决图像数据不平衡问题方面的最新进展。本综述广泛涵盖了基于GAN的合成图像生成的现实世界挑战和实现。我们的综述首先介绍了计算机视觉任务中的各种不平衡问题及其现有解决方案，然后研究了诸如深度生成图像模型和GAN等关键概念。之后，我们提出了一种分类法，将基于GAN的解决计算机视觉任务中不平衡问题的技术总结为三大类：1. 分类中的图像级不平衡，2. 目标检测中的目标级不平衡，3. 分割任务中的像素级不平衡。我们详细阐述了每组的不平衡问题，并在每组中提供了基于GAN的解决方案。读者将了解基于GAN的技术如何处理不平衡问题并提高计算机视觉算法的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dec8/7845583/22121a3267c5/40537_2021_414_Fig1_HTML.jpg

相似文献

A survey on generative adversarial networks for imbalance problems in computer vision tasks.关于计算机视觉任务中不平衡问题的生成对抗网络调查。

J Big Data. 2021;8(1):27. doi: 10.1186/s40537-021-00414-0. Epub 2021 Jan 29.

Systematic Review of Generative Adversarial Networks (GANs) for Medical Image Classification and Segmentation.生成对抗网络（GANs）在医学图像分类和分割中的系统评价。

J Digit Imaging. 2022 Apr;35(2):137-152. doi: 10.1007/s10278-021-00556-w. Epub 2022 Jan 12.

Generative Adversarial Networks in Digital Histopathology: Current Applications, Limitations, Ethical Considerations, and Future Directions.生成对抗网络在数字病理中的应用：当前应用、局限性、伦理考虑和未来方向。

Mod Pathol. 2024 Jan;37(1):100369. doi: 10.1016/j.modpat.2023.100369. Epub 2023 Oct 27.

Generative adversarial networks and its applications in the biomedical image segmentation: a comprehensive survey.生成对抗网络及其在生物医学图像分割中的应用：全面综述。

Int J Multimed Inf Retr. 2022;11(3):333-368. doi: 10.1007/s13735-022-00240-x. Epub 2022 Jul 8.

Generative Adversarial Network Technologies and Applications in Computer Vision.生成对抗网络技术及其在计算机视觉中的应用。

Comput Intell Neurosci. 2020 Aug 1;2020:1459107. doi: 10.1155/2020/1459107. eCollection 2020.

Generative adversarial network based adaptive data augmentation for handwritten Arabic text recognition.基于生成对抗网络的自适应数据增强用于手写阿拉伯文本识别。

PeerJ Comput Sci. 2022 Jan 25;8:e861. doi: 10.7717/peerj-cs.861. eCollection 2022.

Generative Adversarial Networks and Other Generative Models生成对抗网络及其他生成模型

Generative Adversarial Networks in Brain Imaging: A Narrative Review.脑成像中的生成对抗网络：一篇综述

J Imaging. 2022 Mar 23;8(4):83. doi: 10.3390/jimaging8040083.

A Generative Neighborhood-Based Deep Autoencoder for Robust Imbalanced Classification.一种基于生成邻域的深度自动编码器用于稳健的不平衡分类。

IEEE Trans Artif Intell. 2024 Jan;5(1):80-91. doi: 10.1109/TAI.2023.3249685. Epub 2023 Feb 27.

Insights and Considerations in Development and Performance Evaluation of Generative Adversarial Networks (GANs): What Radiologists Need to Know.生成对抗网络（GANs）开发与性能评估中的见解与思考：放射科医生需要了解的内容。

Diagnostics (Basel). 2024 Aug 13;14(16):1756. doi: 10.3390/diagnostics14161756.

引用本文的文献

Improving CNN predictive accuracy in COVID-19 health analytics.提高新冠疫情健康分析中卷积神经网络的预测准确性。

Sci Rep. 2025 Aug 14;15(1):29864. doi: 10.1038/s41598-025-15218-y.

A novel facial expression recognition framework using deep learning based dynamic cross-domain dual attention network.一种基于深度学习的动态跨域双注意力网络的新型面部表情识别框架。

PeerJ Comput Sci. 2025 May 9;11:e2866. doi: 10.7717/peerj-cs.2866. eCollection 2025.

Artificial Intelligence and Internet of Things Integration in Pharmaceutical Manufacturing: A Smart Synergy.制药制造中的人工智能与物联网集成：一种智能协同效应。

Pharmaceutics. 2025 Feb 22;17(3):290. doi: 10.3390/pharmaceutics17030290.

Two-stage augmentation for detecting malignancy of BI-RADS 3 lesions in early breast cancer.两阶段增强法用于检测早期乳腺癌中BI-RADS 3类病变的恶性情况。

BMC Cancer. 2025 Mar 24;25(1):537. doi: 10.1186/s12885-025-13960-0.

Embedding-based pair generation for contrastive representation learning in audio-visual surveillance data.用于视听监控数据中对比表示学习的基于嵌入的对生成

Front Robot AI. 2025 Jan 13;11:1490718. doi: 10.3389/frobt.2024.1490718. eCollection 2024.

Challenges in data-driven geospatial modeling for environmental research and practice.环境研究与实践中数据驱动的地理空间建模面临的挑战。

Nat Commun. 2024 Dec 19;15(1):10700. doi: 10.1038/s41467-024-55240-8.

Ensemble feature selection and tabular data augmentation with generative adversarial networks to enhance cutaneous melanoma identification and interpretability.利用生成对抗网络进行集成特征选择和表格数据增强，以提高皮肤黑色素瘤的识别和可解释性。

BioData Min. 2024 Oct 30;17(1):46. doi: 10.1186/s13040-024-00397-7.

MC-ViViT: Multi-branch Classifier-ViViT to Detect Mild Cognitive Impairment in Older Adults Using Facial Videos.MC-ViViT：用于通过面部视频检测老年人轻度认知障碍的多分支分类器-ViViT

Expert Syst Appl. 2024 Mar 15;238(Pt B). doi: 10.1016/j.eswa.2023.121929. Epub 2023 Oct 4.

Innovative entrepreneurial market trend prediction model based on deep learning: Case study and performance evaluation.基于深度学习的创新型创业市场趋势预测模型：案例研究与绩效评估。

Sci Prog. 2024 Jul-Sep;107(3):368504241272722. doi: 10.1177/00368504241272722.

Optimizing compressive strength prediction using adversarial learning and hybrid regularization.使用对抗学习和混合正则化优化抗压强度预测。

Sci Rep. 2024 Aug 7;14(1):18338. doi: 10.1038/s41598-024-69434-z.

本文引用的文献

Text Data Augmentation for Deep Learning.用于深度学习的文本数据增强

J Big Data. 2021;8(1):101. doi: 10.1186/s40537-021-00492-0. Epub 2021 Jul 19.

CovidGAN: Data Augmentation Using Auxiliary Classifier GAN for Improved Covid-19 Detection.CovidGAN：使用辅助分类器生成对抗网络进行数据增强以改进新冠病毒检测

IEEE Access. 2020 May 14;8:91916-91923. doi: 10.1109/ACCESS.2020.2994762. eCollection 2020.

Improving breast mass classification by shared data with domain transformation using a generative adversarial network.利用生成对抗网络通过域变换共享数据来改进乳腺肿块分类

Comput Biol Med. 2020 Apr;119:103698. doi: 10.1016/j.compbiomed.2020.103698. Epub 2020 Mar 10.

Generative adversarial networks with decoder-encoder output noises.生成对抗网络与解码器编码器输出噪声。

Neural Netw. 2020 Jul;127:19-28. doi: 10.1016/j.neunet.2020.04.005. Epub 2020 Apr 9.

On the localness modeling for the self-attention based end-to-end speech synthesis.基于自注意力的端到端语音合成的局部建模。

Neural Netw. 2020 May;125:121-130. doi: 10.1016/j.neunet.2020.01.034. Epub 2020 Feb 11.

Skin Lesion Classification Using GAN based Data Augmentation.基于生成对抗网络（GAN）的数据增强的皮肤病变分类

Annu Int Conf IEEE Eng Med Biol Soc. 2019 Jul;2019:916-919. doi: 10.1109/EMBC.2019.8857905.

AttGAN: Facial Attribute Editing by Only Changing What You Want.AttGAN：仅通过改变你想要改变的内容来进行面部属性编辑。

IEEE Trans Image Process. 2019 Nov;28(11):5464-5478. doi: 10.1109/TIP.2019.2916751. Epub 2019 May 20.

Breast cancer detection using synthetic mammograms from generative adversarial networks in convolutional neural networks.利用卷积神经网络中生成对抗网络的合成乳房X光片进行乳腺癌检测。

J Med Imaging (Bellingham). 2019 Jul;6(3):031411. doi: 10.1117/1.JMI.6.3.031411. Epub 2019 Mar 23.

On the Effectiveness of Least Squares Generative Adversarial Networks.最小二乘生成对抗网络的有效性。

IEEE Trans Pattern Anal Mach Intell. 2019 Dec;41(12):2947-2960. doi: 10.1109/TPAMI.2018.2872043. Epub 2018 Sep 24.

The HAM10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions.HAM10000 数据集，一个大型的常见色素性皮肤病变多源皮肤镜图像集合。

Sci Data. 2018 Aug 14;5:180161. doi: 10.1038/sdata.2018.161.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

关于计算机视觉任务中不平衡问题的生成对抗网络调查。

A survey on generative adversarial networks for imbalance problems in computer vision tasks.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献