IEEE Trans Pattern Anal Mach Intell. 2022 Oct;44(10):6649-6666. doi: 10.1109/TPAMI.2021.3092833. Epub 2022 Sep 14.
Person re-identification (Re-ID) via gait features within 3D skeleton sequences is a newly-emerging topic with several advantages. Existing solutions either rely on hand-crafted descriptors or supervised gait representation learning. This paper proposes a self-supervised gait encoding approach that can leverage unlabeled skeleton data to learn gait representations for person Re-ID. Specifically, we first create self-supervision by learning to reconstruct unlabeled skeleton sequences reversely, which involves richer high-level semantics to obtain better gait representations. Other pretext tasks are also explored to further improve self-supervised learning. Second, inspired by the fact that motion's continuity endows adjacent skeletons in one skeleton sequence and temporally consecutive skeleton sequences with higher correlations (referred as locality in 3D skeleton data), we propose a locality-aware attention mechanism and a locality-aware contrastive learning scheme, which aim to preserve locality-awareness on intra-sequence level and inter-sequence level respectively during self-supervised learning. Last, with context vectors learned by our locality-aware attention mechanism and contrastive learning scheme, a novel feature named Constrastive Attention-based Gait Encodings (CAGEs) is designed to represent gait effectively. Empirical evaluations show that our approach significantly outperforms skeleton-based counterparts by 15-40 percent Rank-1 accuracy, and it even achieves superior performance to numerous multi-modal methods with extra RGB or depth information. Our codes are available at https://github.com/Kali-Hac/Locality-Awareness-SGE.
基于 3D 骨骼序列的步态特征的人体重识别(Re-ID)是一个新兴的话题,具有多个优势。现有的解决方案要么依赖于手工制作的描述符,要么依赖于监督步态表示学习。本文提出了一种自监督步态编码方法,可以利用未标记的骨骼数据学习人体 Re-ID 的步态表示。具体来说,我们首先通过学习反向重建未标记的骨骼序列来创建自我监督,这涉及更丰富的高级语义,以获得更好的步态表示。还探索了其他的预训练任务,以进一步提高自监督学习的效果。其次,受运动的连续性赋予一个骨骼序列中的相邻骨骼和时间上连续的骨骼序列更高相关性的事实的启发(在 3D 骨骼数据中称为局部性),我们提出了一种局部感知注意力机制和局部感知对比学习方案,旨在在自监督学习过程中分别在序列内和序列间保持局部感知。最后,利用我们的局部感知注意力机制和对比学习方案学习到的上下文向量,设计了一种新的特征,称为基于对比注意力的步态编码(CAGEs),以有效地表示步态。实验评估表明,我们的方法在 Rank-1 准确率上比基于骨骼的方法高出 15-40%,甚至优于具有额外 RGB 或深度信息的许多多模态方法。我们的代码可以在 https://github.com/Kali-Hac/Locality-Awareness-SGE 上获得。