Fu Chaoyou, Wu Xiang, Hu Yibo, Huang Huaibo, He Ran
IEEE Trans Pattern Anal Mach Intell. 2022 Jun;44(6):2938-2952. doi: 10.1109/TPAMI.2021.3052549. Epub 2022 May 5.
Heterogeneous face recognition (HFR) refers to matching cross-domain faces and plays a crucial role in public security. Nevertheless, HFR is confronted with challenges from large domain discrepancy and insufficient heterogeneous data. In this paper, we formulate HFR as a dual generation problem, and tackle it via a novel dual variational generation (DVG-Face) framework. Specifically, a dual variational generator is elaborately designed to learn the joint distribution of paired heterogeneous images. However, the small-scale paired heterogeneous training data may limit the identity diversity of sampling. In order to break through the limitation, we propose to integrate abundant identity information of large-scale visible data into the joint distribution. Furthermore, a pairwise identity preserving loss is imposed on the generated paired heterogeneous images to ensure their identity consistency. As a consequence, massive new diverse paired heterogeneous images with the same identity can be generated from noises. The identity consistency and identity diversity properties allow us to employ these generated images to train the HFR network via a contrastive learning mechanism, yielding both domain-invariant and discriminative embedding features. Concretely, the generated paired heterogeneous images are regarded as positive pairs, and the images obtained from different samplings are considered as negative pairs. Our method achieves superior performances over state-of-the-art methods on seven challenging databases belonging to five HFR tasks, including NIR-VIS, Sketch-Photo, Profile-Frontal Photo, Thermal-VIS, and ID-Camera.
异质人脸识别(HFR)是指对跨域人脸进行匹配,在公共安全领域发挥着至关重要的作用。然而,HFR面临着来自大域差异和异质数据不足的挑战。在本文中,我们将HFR表述为一个双生成问题,并通过一个新颖的双变分生成(DVG-Face)框架来解决它。具体而言,精心设计了一个双变分生成器来学习配对异质图像的联合分布。然而,小规模的配对异质训练数据可能会限制采样的身份多样性。为了突破这一限制,我们建议将大规模可见数据的丰富身份信息整合到联合分布中。此外,对生成的配对异质图像施加成对身份保留损失,以确保它们的身份一致性。因此,可以从噪声中生成大量具有相同身份的新的多样配对异质图像。身份一致性和身份多样性属性使我们能够通过对比学习机制使用这些生成的图像来训练HFR网络,从而产生域不变且有区分力的嵌入特征。具体来说,将生成的配对异质图像视为正样本对,将从不同采样中获得的图像视为负样本对。我们的方法在属于五个HFR任务的七个具有挑战性的数据库上,包括近红外-可见光、草图-照片、侧面-正面照片、热成像-可见光以及身份证-相机,比现有方法取得了更优的性能。