College of Information Engineering, Shanghai Maritime University, Shanghai 201306, China.
Comput Intell Neurosci. 2020 Aug 1;2020:1459107. doi: 10.1155/2020/1459107. eCollection 2020.
Computer vision is one of the hottest research fields in deep learning. The emergence of generative adversarial networks (GANs) provides a new method and model for computer vision. The idea of GANs using the game training method is superior to traditional machine learning algorithms in terms of feature learning and image generation. GANs are widely used not only in image generation and style transfer but also in the text, voice, video processing, and other fields. However, there are still some problems with GANs, such as model collapse and uncontrollable training. This paper deeply reviews the theoretical basis of GANs and surveys some recently developed GAN models, in comparison with traditional GAN models. The applications of GANs in computer vision include data enhancement, domain transfer, high-quality sample generation, and image restoration. The latest research progress of GANs in artificial intelligence (AI) based security attack and defense is introduced. The future development of GANs in computer vision is also discussed at the end of the paper with possible applications of AI in computer vision.
计算机视觉是深度学习中最热门的研究领域之一。生成对抗网络(GAN)的出现为计算机视觉提供了一种新的方法和模型。GAN 采用博弈训练方法的思想在特征学习和图像生成方面优于传统的机器学习算法。GAN 不仅在图像生成和风格转换方面得到了广泛应用,而且在文本、语音、视频处理等领域也得到了广泛应用。然而,GAN 仍然存在一些问题,例如模型崩溃和不可控训练。本文深入回顾了 GAN 的理论基础,并对一些最近开发的 GAN 模型进行了调查,与传统的 GAN 模型进行了比较。GAN 在计算机视觉中的应用包括数据增强、域转移、高质量样本生成和图像恢复。介绍了 GAN 在人工智能(AI)基于安全攻击和防御方面的最新研究进展。本文最后还讨论了 GAN 在计算机视觉中的未来发展,并探讨了 AI 在计算机视觉中的可能应用。