Yang Xi, Wang Xiaoqi, Wang Nannan, Gao Xinbo
IEEE Trans Neural Netw Learn Syst. 2024 Nov;35(11):16916-16926. doi: 10.1109/TNNLS.2023.3300582. Epub 2024 Oct 29.
Text attribute person search aims to identify the particular pedestrian by textual attribute information. Compared to person re- identification tasks which requires imagery samples as its query, text attribute person search is more useful under the circumstance where only witness is available. Most existing text attribute person search methods focus on improving the matching correlation and alignments by learning better representations of person-attribute instance pairs, with few consideration of the latent correlations between attributes. In this work, we propose a graph convolutional network (GCN) and pseudo-label-based text attribute person search method. Concretely, the model directly constructs the attribute correlations by label co- occurrence probability, in which the nodes are represented by attribute embedding and edges are by the filtered correlation matrix of attribute labels. In order to obtain better representations, we combine the cross-attention module (CAM) and the GCN. Furthermore, to address the unseen attribute relationships, we update the edge information through the instances through testing set with high predicted probability thus to better adapt the attribute distribution. Extensive experiments illustrate that our model outperforms the existing state-of-the-art methods on publicly available person search benchmarks: Market-1501 and PETA.
文本属性行人搜索旨在通过文本属性信息识别特定行人。与需要图像样本作为查询的行人再识别任务相比,文本属性行人搜索在仅有目击者描述的情况下更有用。大多数现有的文本属性行人搜索方法专注于通过学习更好的人物 - 属性实例对表示来提高匹配相关性和对齐度,很少考虑属性之间的潜在相关性。在这项工作中,我们提出了一种基于图卷积网络(GCN)和伪标签的文本属性行人搜索方法。具体而言,该模型通过标签共现概率直接构建属性相关性,其中节点由属性嵌入表示,边由属性标签的过滤相关矩阵表示。为了获得更好的表示,我们将交叉注意力模块(CAM)和GCN相结合。此外,为了解决未见的属性关系,我们通过具有高预测概率的测试集实例更新边信息,从而更好地适应属性分布。大量实验表明,我们的模型在公开可用的行人搜索基准数据集Market-1501和PETA上优于现有的最先进方法。