Sun Yifan, Zheng Liang, Li Yali, Yang Yi, Tian Qi, Wang Shengjin
IEEE Trans Pattern Anal Mach Intell. 2021 Mar;43(3):902-917. doi: 10.1109/TPAMI.2019.2938523. Epub 2021 Feb 4.
Part-level features offer fine granularity for pedestrian image description. In this article, we generally aim to learn discriminative part-informed feature for person re-identification. Our contribution is two-fold. First, we introduce a general part-level feature learning method, named Part-based Convolutional Baseline (PCB). Given an image input, it outputs a convolutional descriptor consisting of several part-level features. PCB is general in that it is able to accommodate several part partitioning strategies, including pose estimation, human parsing and uniform part partitioning. In experiment, we show that the learned descriptor has a significantly higher discriminative ability than the global descriptor. Second, based on PCB, we propose refined part pooling (RPP), which allows the parts to be more precisely located. Our idea is that pixels within a well-located part should be similar to each other while being dissimilar with pixels from other parts. We call it within-part consistency. When a pixel-wise feature vector in a part is more similar to some other part, it is then an outlier, indicating inappropriate partitioning. RPP re-assigns these outliers to the parts they are closest to, resulting in refined parts with enhanced within-part consistency. RPP requires no part labels and is trained in a weakly supervised manner. Experiment confirms that RPP allows PCB to gain another round of performance boost. For instance, on the Market-1501 dataset, we achieve (77.4+4.2) percent mAP and (92.3+1.5) percent rank-1 accuracy, a competitive performance with the state of the art.
部件级特征为行人图像描述提供了精细的粒度。在本文中,我们总体目标是学习用于行人重识别的有判别力的部件感知特征。我们的贡献有两个方面。首先,我们引入一种通用的部件级特征学习方法,名为基于部件的卷积基线(PCB)。给定一个图像输入,它输出一个由几个部件级特征组成的卷积描述符。PCB具有通用性,因为它能够适应多种部件划分策略,包括姿态估计、人体解析和均匀部件划分。在实验中,我们表明所学习的描述符比全局描述符具有显著更高的判别能力。其次,基于PCB,我们提出了精细部件池化(RPP),它能使部件定位更精确。我们的想法是,定位良好的部件内的像素应该彼此相似,而与其他部件的像素不同。我们称之为部件内一致性。当一个部件中的逐像素特征向量与其他某个部件更相似时,它就是一个离群值,表明划分不合适。RPP将这些离群值重新分配到它们最接近的部件,从而得到具有增强的部件内一致性的精细部件。RPP不需要部件标签,并且以弱监督方式进行训练。实验证实RPP能使PCB的性能再提升一轮。例如,在Market-1501数据集上,我们实现了(77.4 + 4.2)%的平均精度均值(mAP)和(92.3 + 1.5)%的秩-1准确率,与当前最优水平相比具有竞争力。