• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于视频的人物再识别,使用图变换器融合互补的局部和全局特征。

Video-based person re-identification with complementary local and global features using a graph transformer.

作者信息

Lu Hai, Luo Enbo, Feng Yong, Wang Yifan

机构信息

Electric Power Research Institute of Yunnan Power Grid Co., Ltd., Kunming 650217, China.

出版信息

Math Biosci Eng. 2024 Jul 23;21(7):6694-6709. doi: 10.3934/mbe.2024293.

DOI:10.3934/mbe.2024293
PMID:39176415
Abstract

In recent years, significant progress has been made in video-based person re-identification (Re-ID). The key challenge in video person Re-ID lies in effectively constructing discriminative and robust person feature representations. Methods based on local regions utilize spatial and temporal attention to extract representative local features. However, prior approaches often overlook the correlations between local regions. To leverage relationships among different local regions, we have proposed a novel video person Re-ID representation learning approach based on a graph transformer, which facilitates contextual interactions between relevant region features. Specifically, we construct a local relation graph to model intrinsic relationships between nodes representing local regions. This graph employs the architecture of a transformer for feature propagation, iteratively refining region features and considering information from adjacent nodes to obtain partial feature representations. To learn compact and discriminative representations, we have further proposed a global feature learning branch based on a vision transformer to capture the relationships between different frames in a sequence. Additionally, we designed a dual-branch interaction network based on multi-head fusion attention to integrate frame-level features extracted by both local and global branches. Finally, the concatenated global and local features, after interaction, are used for testing. We evaluated the proposed method on three datasets, namely iLIDS-VID, MARS, and DukeMTMC-VideoReID. Experimental results demonstrate competitive performance, validating the effectiveness of our proposed approach.

摘要

近年来,基于视频的行人重识别(Re-ID)取得了显著进展。视频行人重识别的关键挑战在于有效地构建具有判别力和鲁棒性的行人特征表示。基于局部区域的方法利用空间和时间注意力来提取具有代表性的局部特征。然而,先前的方法往往忽略了局部区域之间的相关性。为了利用不同局部区域之间的关系,我们提出了一种基于图变换器的新颖视频行人Re-ID表示学习方法,该方法促进了相关区域特征之间的上下文交互。具体来说,我们构建了一个局部关系图来建模表示局部区域的节点之间的内在关系。该图采用变换器架构进行特征传播,迭代地细化区域特征并考虑来自相邻节点的信息以获得局部特征表示。为了学习紧凑且具有判别力的表示,我们进一步提出了一个基于视觉变换器的全局特征学习分支,以捕捉序列中不同帧之间的关系。此外,我们设计了一个基于多头融合注意力的双分支交互网络,以整合由局部和全局分支提取的帧级特征。最后,交互后的全局和局部特征连接起来用于测试。我们在三个数据集上评估了所提出的方法,即iLIDS-VID、MARS和DukeMTMC-VideoReID。实验结果表明该方法具有竞争力的性能,验证了我们所提方法的有效性。

相似文献

1
Video-based person re-identification with complementary local and global features using a graph transformer.基于视频的人物再识别,使用图变换器融合互补的局部和全局特征。
Math Biosci Eng. 2024 Jul 23;21(7):6694-6709. doi: 10.3934/mbe.2024293.
2
Adaptive Graph Representation Learning for Video Person Re-identification.用于视频人物重识别的自适应图表示学习
IEEE Trans Image Process. 2020 Jun 17;PP. doi: 10.1109/TIP.2020.3001693.
3
Exploring High-Order Spatio-Temporal Correlations From Skeleton for Person Re-Identification.从骨骼中探索用于行人重识别的高阶时空相关性。
IEEE Trans Image Process. 2023;32:949-963. doi: 10.1109/TIP.2023.3236144. Epub 2023 Jan 23.
4
Multi-Level Fusion Temporal-Spatial Co-Attention for Video-Based Person Re-Identification.用于基于视频的行人重识别的多级融合时空协同注意力
Entropy (Basel). 2021 Dec 15;23(12):1686. doi: 10.3390/e23121686.
5
Multi-granularity graph pooling for video-based person re-identification.基于视频的行人再识别的多粒度图池化。
Neural Netw. 2023 Mar;160:22-33. doi: 10.1016/j.neunet.2022.12.015. Epub 2022 Dec 28.
6
Video Person Re-identification by Temporal Residual Learning.基于时间残差学习的视频人物重识别
IEEE Trans Image Process. 2018 Oct 29. doi: 10.1109/TIP.2018.2878505.
7
Video Person Re-Identification with Frame Sampling-Random Erasure and Mutual Information-Temporal Weight Aggregation.基于帧采样-随机擦除和互信息-时间权重聚合的视频行人再识别。
Sensors (Basel). 2022 Apr 15;22(8):3047. doi: 10.3390/s22083047.
8
Video-Based Person Re-Identification by an End-To-End Learning Architecture with Hybrid Deep Appearance-Temporal Feature.基于端到端学习架构的混合深度表观-时间特征的视频人物再识别
Sensors (Basel). 2018 Oct 29;18(11):3669. doi: 10.3390/s18113669.
9
Multi-scale Temporal Cues Learning for Video Person Re-Identification.用于视频人物重识别的多尺度时间线索学习
IEEE Trans Image Process. 2020 Feb 14. doi: 10.1109/TIP.2020.2972108.
10
A Multi-Level Relation-Aware Transformer model for occluded person re-identification.一种用于遮挡行人再识别的多层次关系感知 Transformer 模型。
Neural Netw. 2024 Sep;177:106382. doi: 10.1016/j.neunet.2024.106382. Epub 2024 May 9.