School of Optoelectronics, Beijing Institute of Technology, Beijing, China; School of Information Technology, Beijing Institute of Technology, Zhuhai, China.
School of Intelligent Systems Science and Engineering, Jinan University, Zhuhai, China.
Neural Netw. 2022 Aug;152:487-498. doi: 10.1016/j.neunet.2022.05.014. Epub 2022 May 21.
Recently, with the rapid development of artificial intelligence, image generation based on deep learning has advanced significantly. Image generation based on Generative Adversarial Networks (GANs) is a promising study. However, because convolutions are limited by spatial-agnostic and channel-specific, features extracted by conventional GANs based on convolution are constrained. Therefore, GANs cannot capture in-depth details per image. Moreover, straightforwardly stacking of convolutions causes too many parameters and layers in GANs, yielding a high overfitting risk. To overcome the abovementioned limitations, in this study, we propose a GANs called GIU-GANs (where Global Information Utilization: GIU). GIU-GANs leverages a new module called the GIU module, which integrates the squeeze-and-excitation module and involution to focus on global information via the channel attention mechanism, enhancing the generated image quality. Moreover, Batch Normalization (BN) inevitably ignores the representation differences among noise sampled by the generator and thus degrades the generated image quality. Thus, we introduce the representative BN to the GANs' architecture. The CIFAR-10 and CelebA datasets are employed to demonstrate the effectiveness of the proposed model. Numerous experiments indicate that the proposed model achieves state-of-the-art performance.
最近,随着人工智能的飞速发展,基于深度学习的图像生成技术取得了显著进展。基于生成对抗网络(GANs)的图像生成是一项很有前途的研究。然而,由于卷积受到空间不可知和通道特定的限制,基于卷积的传统 GANs 提取的特征受到限制。因此,GANs 无法捕获图像的深度细节。此外,卷积的直接堆叠会导致 GANs 中的参数和层数过多,从而带来较高的过拟合风险。为了克服上述限制,在本研究中,我们提出了一种称为 GIU-GANs(其中 Global Information Utilization:GIU)的 GANs。GIU-GANs 利用了一个名为 GIU 模块的新模块,该模块通过通道注意力机制整合了 squeeze-and-excitation 模块和 involution,以关注全局信息,从而提高生成图像的质量。此外,批量归一化(BN)不可避免地忽略了生成器采样的噪声之间的表示差异,从而降低了生成图像的质量。因此,我们将代表性的 BN 引入 GANs 架构中。我们使用 CIFAR-10 和 CelebA 数据集来验证所提出模型的有效性。大量实验表明,所提出的模型取得了最先进的性能。