• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

TE-TransReID:通过局部特征嵌入和轻量级Transformer实现高效行人重识别

TE-TransReID: Towards Efficient Person Re-Identification via Local Feature Embedding and Lightweight Transformer.

作者信息

Zhang Xiaoyu, Cai Rui, Jiang Ning, Xing Minwen, Xu Ke, Yang Huicheng, Zhu Wenbo, Hu Yaocong

机构信息

School of Electrical Engineering, Anhui Polytechnic University, Beijing Road No. 8, Wuhu 241000, China.

College of Computer and Control Engineering, Northeast Forestry University, Harbin 150040, China.

出版信息

Sensors (Basel). 2025 Sep 3;25(17):5461. doi: 10.3390/s25175461.

DOI:10.3390/s25175461
PMID:40942890
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12431027/
Abstract

Person re-identification aims to match images of the same individual across non-overlapping cameras by analyzing personal characteristics. Recently, Transformer-based models have demonstrated excellent capabilities and achieved breakthrough progress in this task. However, their high computational costs and inadequate capacity to capture fine-grained local features impose significant constraints on re-identification performance. To address these challenges, this paper proposes a novel Toward Efficient Transformer-based Person Re-identification (TE-TransReID) framework. Specifically, the proposed framework retains only the former L-th layer layers of a pretrained Vision Transformer (ViT) for global feature extraction while combining local features extracted from a pretrained CNN, thus achieving the trade-off between high accuracy and lightweight networks. Additionally, we propose a dual efficient feature-fusion strategy to integrate global and local features for accurate person re-identification. The Efficient Token-based Feature-Fusion Module (ETFFM) employs the gate-based network to learn fused token-wise features, while the Efficient Patch-based Feature-Fusion Module (EPFFM) utilizes a lightweight Transformer to aggregate patch-level features. Finally, TE-TransReID achieves a rank-1 of 94.8%, 88.3%, and 85.7% on Market1501, DukeMTMC, and MSMT17 with a parameter of 27.5 M, respectively. Compared to existing CNN-Transformer hybrid models, TE-TransReID maintains comparable recognition accuracy while drastically reducing model parameters, establishing an optimal equilibrium between recognition accuracy and computational efficiency.

摘要

行人重识别旨在通过分析个人特征,在不重叠的摄像头之间匹配同一个人的图像。最近,基于Transformer的模型在这项任务中展现出了卓越的能力,并取得了突破性进展。然而,它们高昂的计算成本以及捕捉细粒度局部特征的能力不足,对重识别性能造成了重大限制。为应对这些挑战,本文提出了一种新颖的基于高效Transformer的行人重识别(TE-TransReID)框架。具体而言,所提出的框架仅保留预训练视觉Transformer(ViT)的前L层用于全局特征提取,同时结合从预训练卷积神经网络提取的局部特征,从而在高精度和轻量级网络之间实现权衡。此外,我们提出了一种双重高效特征融合策略,以整合全局和局部特征,实现准确的行人重识别。基于高效令牌的特征融合模块(ETFFM)采用基于门的网络来学习融合的逐令牌特征,而基于高效补丁的特征融合模块(EPFFM)利用轻量级Transformer来聚合补丁级特征。最后,TE-TransReID在Market1501、DukeMTMC和MSMT17上分别以2750万个参数实现了94.8%、88.3%和85.7%的Rank-1准确率。与现有的卷积神经网络-Transformer混合模型相比,TE-TransReID在大幅减少模型参数的同时保持了相当的识别准确率,在识别准确率和计算效率之间建立了最佳平衡。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c40a/12431027/95c8ba84f012/sensors-25-05461-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c40a/12431027/f097e823a9d6/sensors-25-05461-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c40a/12431027/1c69f6a9935d/sensors-25-05461-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c40a/12431027/e3b36ff57658/sensors-25-05461-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c40a/12431027/f3c7309516c8/sensors-25-05461-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c40a/12431027/ee532c2c9491/sensors-25-05461-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c40a/12431027/c84c488a23cd/sensors-25-05461-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c40a/12431027/95c8ba84f012/sensors-25-05461-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c40a/12431027/f097e823a9d6/sensors-25-05461-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c40a/12431027/1c69f6a9935d/sensors-25-05461-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c40a/12431027/e3b36ff57658/sensors-25-05461-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c40a/12431027/f3c7309516c8/sensors-25-05461-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c40a/12431027/ee532c2c9491/sensors-25-05461-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c40a/12431027/c84c488a23cd/sensors-25-05461-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c40a/12431027/95c8ba84f012/sensors-25-05461-g007.jpg

相似文献

1
TE-TransReID: Towards Efficient Person Re-Identification via Local Feature Embedding and Lightweight Transformer.TE-TransReID:通过局部特征嵌入和轻量级Transformer实现高效行人重识别
Sensors (Basel). 2025 Sep 3;25(17):5461. doi: 10.3390/s25175461.
2
ConvTransNet-S: A CNN-Transformer Hybrid Disease Recognition Model for Complex Field Environments.ConvTransNet-S:一种用于复杂现场环境的卷积神经网络与Transformer混合疾病识别模型。
Plants (Basel). 2025 Jul 22;14(15):2252. doi: 10.3390/plants14152252.
3
Multi-level channel-spatial attention and light-weight scale-fusion network (MCSLF-Net): multi-level channel-spatial attention and light-weight scale-fusion transformer for 3D brain tumor segmentation.多级通道空间注意力与轻量级尺度融合网络(MCSLF-Net):用于3D脑肿瘤分割的多级通道空间注意力与轻量级尺度融合变换器
Quant Imaging Med Surg. 2025 Jul 1;15(7):6301-6325. doi: 10.21037/qims-2025-354. Epub 2025 Jun 30.
4
TLTNet: A novel transscale cascade layered transformer network for enhanced retinal blood vessel segmentation.TLTNet:一种新颖的跨尺度级联分层Transformer 网络,用于增强视网膜血管分割。
Comput Biol Med. 2024 Aug;178:108773. doi: 10.1016/j.compbiomed.2024.108773. Epub 2024 Jun 25.
5
Lightweight hybrid Mamba2 for unsupervised medical image registration.用于无监督医学图像配准的轻量级混合Mamba2
Med Phys. 2025 Sep;52(9):e18104. doi: 10.1002/mp.18104.
6
EMOST: A dual-branch hybrid network for medical image fusion via efficient model module and sparse transformer.EMOST:一种基于高效模型模块和稀疏 Transformer 的医学图像融合双分支混合网络。
Comput Biol Med. 2024 Sep;179:108771. doi: 10.1016/j.compbiomed.2024.108771. Epub 2024 Jul 5.
7
Enhancing breast cancer detection on screening mammogram using self-supervised learning and a hybrid deep model of Swin Transformer and convolutional neural networks.使用自监督学习以及Swin Transformer和卷积神经网络的混合深度模型提高筛查乳腺钼靶片中的乳腺癌检测率。
J Med Imaging (Bellingham). 2025 Nov;12(Suppl 2):S22007. doi: 10.1117/1.JMI.12.S2.S22007. Epub 2025 May 14.
8
Fine-grained Prototype Network for MRI Sequence Classification.用于MRI序列分类的细粒度原型网络。
Curr Med Imaging. 2025 Jul 30. doi: 10.2174/0115734056361649250717162910.
9
Multi-stream feature fusion of vision transformer and CNN for precise epileptic seizure detection from EEG signals.用于从脑电图信号中精确检测癫痫发作的视觉Transformer和卷积神经网络的多流特征融合
J Transl Med. 2025 Aug 6;23(1):871. doi: 10.1186/s12967-025-06862-z.
10
MAFA-Uformer: Multi-attention and dual-branch feature aggregation U-shaped transformer for sparse-view CT reconstruction.MAFA-Uformer:用于稀疏视图CT重建的多注意力与双分支特征聚合U型变换器
J Xray Sci Technol. 2025 Jan;33(1):157-166. doi: 10.1177/08953996241300016. Epub 2025 Jan 8.

本文引用的文献

1
Vehicle Re-Identification Method Based on Efficient Self-Attention CNN-Transformer and Multi-Task Learning Optimization.基于高效自注意力卷积神经网络-Transformer和多任务学习优化的车辆重识别方法
Sensors (Basel). 2025 May 8;25(10):2977. doi: 10.3390/s25102977.
2
Person Re-Identification with Attribute-Guided, Robust-to-Low-Resolution Drone Footage Considering Fog/Edge Computing.基于属性引导、对低分辨率无人机影像具有鲁棒性且考虑雾/边缘计算的人员重新识别
Sensors (Basel). 2025 Mar 14;25(6):1819. doi: 10.3390/s25061819.
3
Enhancing Person Re-Identification through Attention-Driven Global Features and Angular Loss Optimization.
通过注意力驱动的全局特征和角度损失优化增强行人重识别
Entropy (Basel). 2024 May 21;26(6):436. doi: 10.3390/e26060436.
4
AAformer: Auto-Aligned Transformer for Person Re-Identification.AAformer:用于行人重识别的自动对齐Transformer。
IEEE Trans Neural Netw Learn Syst. 2024 Dec;35(12):17307-17317. doi: 10.1109/TNNLS.2023.3301856. Epub 2024 Dec 2.
5
Deeply Coupled Convolution-Transformer With Spatial-Temporal Complementary Learning for Video-Based Person Re-Identification.基于时空互补学习的深度耦合卷积-Transformer用于视频人物重识别
IEEE Trans Neural Netw Learn Syst. 2024 Oct;35(10):13753-13763. doi: 10.1109/TNNLS.2023.3271353. Epub 2024 Oct 7.
6
Dual Branch Attention Network for Person Re-Identification.双分支注意力网络的行人再识别。
Sensors (Basel). 2021 Aug 30;21(17):5839. doi: 10.3390/s21175839.
7
Identifying Visible Parts via Pose Estimation for Occluded Person Re-Identification.基于姿态估计的遮挡行人再识别中可见部件的识别
IEEE Trans Neural Netw Learn Syst. 2022 Sep;33(9):4624-4634. doi: 10.1109/TNNLS.2021.3059515. Epub 2022 Aug 31.