Ma Tengda, Sun Ke, Pang Xiyu, Si Wei, Liu Tongxin, Wang Cheng
School of Information Science and Electrical Engineering, Shandong Jiaotong University, Jinan, 250357, China.
School of Computer and Artificial Intelligence, Beijing Technology and Business University, Beijing, 100048, China.
Sci Rep. 2025 Aug 20;15(1):30541. doi: 10.1038/s41598-025-15854-4.
Vehicle re-identification (Re-ID) has become a challenging retrieval task due to the high inter-class similarity and low intra-class similarity among vehicles. To address this challenge, the self-attention mechanism has been extensively studied and applied, demonstrating its effectiveness in capturing long-range dependencies in vehicle Re-ID. Traditional spatial self-attention and channel self-attention assign different weights to each node (position/channel) based on pairwise dependencies at a global scale to model long-term dependencies, but this approach is not only computationally complex but also unable to fully mine refined features. In this paper, we propose a vehicle Re-ID network design based on a multi-axis compression fusion (MCF) attention mechanism. The MCF attention mechanism preserves feature information on different axes through compression operations while maintaining high computational efficiency. It utilizes single-axis self-attention calculations to update the weights and strengthens the regions of common interest across multiple axes by fusing information from multiple axes, thereby enhancing the effect of attention learning. On the basis of this mechanism, we propose a multi-axis compression fusion network (MCF-Net), which combines the spatial multi-axis compression fusion (S-MCF) module and the channel multi-axis compression fusion (C-MCF) module, and uses a rigid partitioning strategy to capture both global and fine-grained features. Experiments show that MCF-Net achieves state-of-the-art performance on the vehicle Re-ID datasets VeRi-776 and VehicleID.
由于车辆之间的类间相似度高且类内相似度低,车辆重新识别(Re-ID)已成为一项具有挑战性的检索任务。为应对这一挑战,自注意力机制已得到广泛研究和应用,证明了其在捕获车辆Re-ID中的长程依赖关系方面的有效性。传统的空间自注意力和通道自注意力基于全局尺度上的成对依赖关系为每个节点(位置/通道)分配不同权重,以对长期依赖关系进行建模,但这种方法不仅计算复杂,而且无法充分挖掘精细特征。在本文中,我们提出了一种基于多轴压缩融合(MCF)注意力机制的车辆Re-ID网络设计。MCF注意力机制通过压缩操作在不同轴上保留特征信息,同时保持较高的计算效率。它利用单轴自注意力计算来更新权重,并通过融合来自多个轴的信息来增强跨多个轴的共同关注区域,从而增强注意力学习的效果。在此机制的基础上,我们提出了一种多轴压缩融合网络(MCF-Net),它结合了空间多轴压缩融合(S-MCF)模块和通道多轴压缩融合(C-MCF)模块,并采用刚性划分策略来捕获全局和细粒度特征。实验表明,MCF-Net在车辆Re-ID数据集VeRi-776和VehicleID上取得了领先的性能。