Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing 100190, China.
IEEE Trans Image Process. 2012 Oct;21(10):4466-79. doi: 10.1109/TIP.2012.2206039. Epub 2012 Jun 26.
In this paper, we address the problem of classifying image sets for face recognition, where each set contains images belonging to the same subject and typically covering large variations. By modeling each image set as a manifold, we formulate the problem as the computation of the distance between two manifolds, called manifold-manifold distance (MMD). Since an image set can come in three pattern levels, point, subspace, and manifold, we systematically study the distance among the three levels and formulate them in a general multilevel MMD framework. Specifically, we express a manifold by a collection of local linear models, each depicted by a subspace. MMD is then converted to integrate the distances between pairs of subspaces from one of the involved manifolds. We theoretically and experimentally study several configurations of the ingredients of MMD. The proposed method is applied to the task of face recognition with image sets, where identification is achieved by seeking the minimum MMD from the probe to the gallery of image sets. Our experiments demonstrate that, as a general set similarity measure, MMD consistently outperforms other competing nondiscriminative methods and is also promisingly comparable to the state-of-the-art discriminative methods.
在本文中,我们解决了用于人脸识别的图像集分类问题,其中每个集包含属于同一主体的图像,并且通常涵盖很大的变化。通过将每个图像集建模为流形,我们将问题表述为计算两个流形之间的距离,称为流形-流形距离(MMD)。由于图像集可以有三种模式级别,即点、子空间和流形,因此我们系统地研究了这三个级别之间的距离,并在一般的多级 MMD 框架中对其进行了表述。具体来说,我们通过收集局部线性模型来表示流形,每个模型都由一个子空间表示。然后,MMD 被转换为集成来自一个所涉及的流形的子空间对之间的距离。我们从理论和实验两方面研究了 MMD 的几个组成部分的配置。所提出的方法应用于具有图像集的人脸识别任务,其中通过从探针到图像集库寻找最小 MMD 来实现识别。我们的实验表明,作为一种通用的集合相似性度量,MMD 始终优于其他竞争的非判别方法,并且与最先进的判别方法也具有可比性。