College of Computer Science and Software Engineering, Shenzhen University, Nanhai Ave 3688, Shenzhen, Guangdong 518060, P. R. China.
Int J Neural Syst. 2023 Mar;33(3):2350013. doi: 10.1142/S0129065723500132. Epub 2023 Feb 25.
How to obtain discriminative features has proved to be a core problem for image retrieval. Many recent works use convolutional neural networks to extract features. However, clutter and occlusion will interfere with the distinguishability of features when using convolutional neural network (CNN) for feature extraction. To address this problem, we intend to obtain high-response activations in the feature map based on the attention mechanism. We propose two attention modules, a spatial attention module and a channel attention module. For the spatial attention module, we first capture the global information and model the relation between channels as a region evaluator, which evaluates and assigns new weights to local features. For the channel attention module, we use a vector with trainable parameters to weight the importance of each feature map. The two attention modules are cascaded to adjust the weight distribution for the feature map, which makes the extracted features more discriminative. Furthermore, we present a scale and mask scheme to scale the major components and filter out the meaningless local features. This scheme can reduce the disadvantages of the various scales of the major components in images by applying multiple scale filters, and filter out the redundant features with the . Exhaustive experiments demonstrate that the two attention modules are complementary to improve performance, and our network with the three modules outperforms the state-of-the-art methods on four well-known image retrieval datasets.
如何获取判别特征已被证明是图像检索的核心问题。许多最近的工作使用卷积神经网络来提取特征。然而,在使用卷积神经网络 (CNN) 进行特征提取时,杂波和遮挡会干扰特征的可区分性。为了解决这个问题,我们意图基于注意力机制在特征图中获得高响应激活。我们提出了两个注意力模块,一个是空间注意力模块,另一个是通道注意力模块。对于空间注意力模块,我们首先捕获全局信息,并将通道之间的关系建模为区域评估器,该评估器评估并为局部特征分配新的权重。对于通道注意力模块,我们使用带有可训练参数的向量为每个特征图的重要性加权。这两个注意力模块级联在一起,以调整特征图的权重分布,从而使提取的特征更具判别性。此外,我们提出了一种尺度和掩模方案,以缩放主要成分并过滤掉无意义的局部特征。该方案通过应用多个尺度滤波器来减少图像中主要成分的各种尺度的缺点,并通过. 过滤掉冗余特征。详尽的实验表明,这两个注意力模块相互补充以提高性能,我们的带有三个模块的网络在四个著名的图像检索数据集上的表现优于最先进的方法。