Faculty of Information Technology, Beijing University of Technology, Beijing 100124, China; Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing 100124, China; Engineering Research Center of Digital Community Ministry of Education, Beijing University of Technology, Beijing 100124, China; Beijing Artificial Intelligence Institute and Beijing Laboratory for Intelligent Environmental Protection, Beijing 100124, China.
Faculty of Information Technology, Beijing University of Technology, Beijing 100124, China; Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing 100124, China.
Neural Netw. 2023 Oct;167:10-21. doi: 10.1016/j.neunet.2023.07.032. Epub 2023 Aug 10.
Convolutional neural networks (CNNs) have successfully driven many visual recognition tasks including image classification. However, when dealing with classification tasks with intra-class sample style diversity, the network tends to be disturbed by more diverse features, resulting in limited feature learning. In this article, a spatial oblivion channel attention (SOCA) for intra-class diversity feature learning is proposed. Specifically, SOCA performs spatial structure oblivion in a progressive regularization for each channel after convolution, so that the network is not restricted to a limited feature learning, and pays attention to more regionally detailed features. Further, SOCA reassigns channel weights in the progressively oblivious feature space from top to bottom along the channel direction, to ensure the network learns more image details in an orderly manner while not falling into feature redundancy. Experiments are conducted on the standard classification dataset CIFAR-10/100 and two garbage datasets with intra-class diverse styles. SOCA improves SqueezeNet, MobileNet, BN-VGG-19, Inception and ResNet-50 in classification accuracy by 1.31%, 1.18%, 1.57%, 2.09% and 2.27% on average, respectively. The feasibility and effectiveness of intra-class diversity feature learning in SOCA-enhanced networks are verified. Besides, the class activation map shows that more local detail feature regions are activated by adding the SOCA module, which also demonstrates the interpretability of the method for intra-class diversity feature learning.
卷积神经网络(CNNs)已成功推动了许多视觉识别任务,包括图像分类。然而,当处理具有类内样本样式多样性的分类任务时,网络往往会受到更多样化特征的干扰,从而导致特征学习受限。在本文中,提出了一种用于类内多样性特征学习的空间遗忘通道注意力(SOCA)。具体来说,SOCA 在卷积后对每个通道进行渐进正则化的空间结构遗忘,从而使网络不受限于有限的特征学习,并关注更多区域详细的特征。此外,SOCA 沿着通道方向从顶部到底部重新分配渐进遗忘特征空间中的通道权重,以确保网络在有序地学习更多图像细节的同时不会陷入特征冗余。在标准分类数据集 CIFAR-10/100 和两个具有类内多样风格的垃圾数据集上进行了实验。SOCA 分别将 SqueezeNet、MobileNet、BN-VGG-19、Inception 和 ResNet-50 的分类精度提高了 1.31%、1.18%、1.57%、2.09%和 2.27%。验证了 SOCA 增强网络中类内多样性特征学习的可行性和有效性。此外,类激活图显示,通过添加 SOCA 模块,更多的局部细节特征区域被激活,这也证明了该方法对于类内多样性特征学习的可解释性。