基于跨视角多级字典学习的行人重识别

Person Re-Identification by Cross-View Multi-Level Dictionary Learning.

作者信息

Li Sheng, Shao Ming, Fu Yun

出版信息

IEEE Trans Pattern Anal Mach Intell. 2018 Dec;40(12):2963-2977. doi: 10.1109/TPAMI.2017.2764893. Epub 2017 Oct 26.

DOI:10.1109/TPAMI.2017.2764893

Abstract

Person re-identification plays an important role in many safety-critical applications. Existing works mainly focus on extracting patch-level features or learning distance metrics. However, the representation power of extracted features might be limited, due to the various viewing conditions of pedestrian images in complex real-world scenarios. To improve the representation power of features, we learn discriminative and robust representations via dictionary learning in this paper. First, we propose a Cross-view Dictionary Learning (CDL) model, which is a general solution to the multi-view learning problem. Inspired by the dictionary learning based domain adaptation, CDL learns a pair of dictionaries from two views. In particular, CDL adopts a projective learning strategy, which is more efficient than the optimization in traditional dictionary learning. Second, we propose a Cross-view Multi-level Dictionary Learning (CMDL) approach based on CDL. CMDL contains dictionary learning models at different representation levels, including image-level, horizontal part-level, and patch-level. The proposed models take advantages of the view-consistency information, and adaptively learn pairs of dictionaries to generate robust and compact representations for pedestrian images. Third, we incorporate a discriminative regularization term to CMDL, and propose a CMDL-Dis approach which learns pairs of discriminative dictionaries in image-level and part-level. We devise efficient optimization algorithms to solve the proposed models. Finally, a fusion strategy is utilized to generate the similarity scores for test images. Experiments on the public VIPeR, CUHK Campus, iLIDS, GRID and PRID450S datasets show that our approach achieves the state-of-the-art performance.

摘要

行人重识别在许多安全关键型应用中发挥着重要作用。现有工作主要集中在提取补丁级特征或学习距离度量。然而，由于复杂现实场景中行人图像的各种观看条件，提取特征的表示能力可能有限。为了提高特征的表示能力，我们在本文中通过字典学习来学习判别性和鲁棒性表示。首先，我们提出了一种跨视图字典学习（CDL）模型，它是多视图学习问题的通用解决方案。受基于字典学习的域适应启发，CDL从两个视图中学习一对字典。具体而言，CDL采用投影学习策略，这比传统字典学习中的优化更有效。其次，我们基于CDL提出了一种跨视图多级字典学习（CMDL）方法。CMDL包含不同表示级别的字典学习模型，包括图像级、水平部分级和补丁级。所提出的模型利用视图一致性信息，并自适应地学习字典对，以为行人图像生成鲁棒且紧凑的表示。第三，我们将判别正则化项纳入CMDL，并提出了一种CMDL-Dis方法，该方法在图像级和部分级学习判别字典对。我们设计了高效的优化算法来求解所提出的模型。最后，利用融合策略生成测试图像的相似度分数。在公共VIPeR数据集、香港中文大学校园数据集、iLIDS数据集、GRID数据集和PRID450S数据集上的实验表明，我们的方法取得了当前最优的性能。

相似文献

Person Re-Identification by Cross-View Multi-Level Dictionary Learning.基于跨视角多级字典学习的行人重识别

IEEE Trans Pattern Anal Mach Intell. 2018 Dec;40(12):2963-2977. doi: 10.1109/TPAMI.2017.2764893. Epub 2017 Oct 26.

Toward Resolution-Invariant Person Reidentification via Projective Dictionary Learning.通过投影字典学习实现分辨率不变的行人重识别

IEEE Trans Neural Netw Learn Syst. 2019 Jun;30(6):1896-1907. doi: 10.1109/TNNLS.2018.2875429. Epub 2018 Nov 2.

Cross-View Action Recognition via Transferable Dictionary Learning.跨视图动作识别的可迁移字典学习

IEEE Trans Image Process. 2016 May;25(6):2542-56. doi: 10.1109/TIP.2016.2548242.

Dictionary learning for stereo image representation.立体图像表示的字典学习。

IEEE Trans Image Process. 2011 Apr;20(4):921-34. doi: 10.1109/TIP.2010.2081679. Epub 2010 Sep 30.

Super-Resolution Person Re-Identification With Semi-Coupled Low-Rank Discriminant Dictionary Learning.基于半耦合低秩判别字典学习的超分辨率行人再识别

IEEE Trans Image Process. 2017 Mar;26(3):1363-1378. doi: 10.1109/TIP.2017.2651364. Epub 2017 Jan 10.

Multimodal Task-Driven Dictionary Learning for Image Classification.基于多模态任务驱动的图像分类词典学习

IEEE Trans Image Process. 2016 Jan;25(1):24-38. doi: 10.1109/TIP.2015.2496275. Epub 2015 Oct 30.

Discriminative dictionary learning algorithm with pairwise local constraints for histopathological image classification.基于局部约束对的鉴别字典学习算法在病理图像分类中的应用

Med Biol Eng Comput. 2021 Jan;59(1):153-164. doi: 10.1007/s11517-020-02281-y. Epub 2021 Jan 2.

Multi-Modal Convolutional Dictionary Learning.多模态卷积字典学习

IEEE Trans Image Process. 2022;31:1325-1339. doi: 10.1109/TIP.2022.3141251. Epub 2022 Jan 25.

Deep MCDL: Deep Multi-Scale Multi-Modal Convolutional Dictionary Learning Network.深度MCDL：深度多尺度多模态卷积字典学习网络。

IEEE Trans Pattern Anal Mach Intell. 2024 May;46(5):2770-2787. doi: 10.1109/TPAMI.2023.3334624. Epub 2024 Apr 3.

Slice-Based Online Convolutional Dictionary Learning.基于切片的在线卷积字典学习。

IEEE Trans Cybern. 2021 Oct;51(10):5116-5129. doi: 10.1109/TCYB.2019.2931914. Epub 2021 Oct 12.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于跨视角多级字典学习的行人重识别

Person Re-Identification by Cross-View Multi-Level Dictionary Learning.

作者信息

出版信息

相似文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献