Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China.
Computer Vision Laboratory, ETH Zürich, Zürich, Switzerland.
IEEE Trans Image Process. 2018;27(1):151-163. doi: 10.1109/TIP.2017.2746993.
To address the problem of face recognition with image sets, we aim to capture the underlying data distribution in each set and thus facilitate more robust classification. To this end, we represent image set as the Gaussian mixture model (GMM) comprising a number of Gaussian components with prior probabilities and seek to discriminate Gaussian components from different classes. Since in the light of information geometry, the Gaussians lie on a specific Riemannian manifold, this paper presents a method named discriminant analysis on Riemannian manifold of Gaussian distributions (DARG). We investigate several distance metrics between Gaussians and accordingly two discriminative learning frameworks are presented to meet the geometric and statistical characteristics of the specific manifold. The first framework derives a series of provably positive definite probabilistic kernels to embed the manifold to a high-dimensional Hilbert space, where conventional discriminant analysis methods developed in Euclidean space can be applied, and a weighted Kernel discriminant analysis is devised which learns discriminative representation of the Gaussian components in GMMs with their prior probabilities as sample weights. Alternatively, the other framework extends the classical graph embedding method to the manifold by utilizing the distance metrics between Gaussians to construct the adjacency graph, and hence the original manifold is embedded to a lower-dimensional and discriminative target manifold with the geometric structure preserved and the interclass separability maximized. The proposed method is evaluated by face identification and verification tasks on four most challenging and largest databases, YouTube Celebrities, COX, YouTube Face DB, and Point-and-Shoot Challenge, to demonstrate its superiority over the state-of-the-art.To address the problem of face recognition with image sets, we aim to capture the underlying data distribution in each set and thus facilitate more robust classification. To this end, we represent image set as the Gaussian mixture model (GMM) comprising a number of Gaussian components with prior probabilities and seek to discriminate Gaussian components from different classes. Since in the light of information geometry, the Gaussians lie on a specific Riemannian manifold, this paper presents a method named discriminant analysis on Riemannian manifold of Gaussian distributions (DARG). We investigate several distance metrics between Gaussians and accordingly two discriminative learning frameworks are presented to meet the geometric and statistical characteristics of the specific manifold. The first framework derives a series of provably positive definite probabilistic kernels to embed the manifold to a high-dimensional Hilbert space, where conventional discriminant analysis methods developed in Euclidean space can be applied, and a weighted Kernel discriminant analysis is devised which learns discriminative representation of the Gaussian components in GMMs with their prior probabilities as sample weights. Alternatively, the other framework extends the classical graph embedding method to the manifold by utilizing the distance metrics between Gaussians to construct the adjacency graph, and hence the original manifold is embedded to a lower-dimensional and discriminative target manifold with the geometric structure preserved and the interclass separability maximized. The proposed method is evaluated by face identification and verification tasks on four most challenging and largest databases, YouTube Celebrities, COX, YouTube Face DB, and Point-and-Shoot Challenge, to demonstrate its superiority over the state-of-the-art.
为了解决图像集的人脸识别问题,我们旨在捕捉每个集合中的底层数据分布,从而促进更稳健的分类。为此,我们将图像集表示为具有先验概率的高斯混合模型(GMM),并试图区分不同类别的高斯分量。由于根据信息几何,高斯位于特定的黎曼流形上,因此本文提出了一种称为高斯分布黎曼流形上的判别分析(DARG)的方法。我们研究了高斯之间的几种距离度量,并相应地提出了两种判别学习框架,以满足特定流形的几何和统计特性。第一个框架导出了一系列可证明的正定概率核,将流形嵌入到高维希尔伯特空间中,在那里可以应用欧几里得空间中开发的传统判别分析方法,并设计了加权核判别分析,该方法学习 GMM 中高斯分量的判别表示,其先验概率作为样本权重。或者,另一个框架通过利用高斯之间的距离度量来构建邻接图,将经典的图嵌入方法扩展到流形上,从而将原始流形嵌入到具有保留几何结构和最大化类间可分离性的低维和判别目标流形中。该方法通过在四个最具挑战性和最大的数据库(YouTube 名人、COX、YouTube Face DB 和 Point-and-Shoot Challenge)上的人脸识别和验证任务进行评估,证明了其优于最先进的方法。