IEEE Trans Pattern Anal Mach Intell. 2016 Jan;38(1):188-94. doi: 10.1109/TPAMI.2015.2435740.
In many computer vision systems, the same object can be observed at varying viewpoints or even by different sensors, which brings in the challenging demand for recognizing objects from distinct even heterogeneous views. In this work we propose a Multi-view Discriminant Analysis (MvDA) approach, which seeks for a single discriminant common space for multiple views in a non-pairwise manner by jointly learning multiple view-specific linear transforms. Specifically, our MvDA is formulated to jointly solve the multiple linear transforms by optimizing a generalized Rayleigh quotient, i.e., maximizing the between-class variations and minimizing the within-class variations from both intra-view and inter-view in the common space. By reformulating this problem as a ratio trace problem, the multiple linear transforms are achieved analytically and simultaneously through generalized eigenvalue decomposition. Furthermore, inspired by the observation that different views share similar data structures, a constraint is introduced to enforce the view-consistency of the multiple linear transforms. The proposed method is evaluated on three tasks: face recognition across pose, photo versus. sketch face recognition, and visual light image versus near infrared image face recognition on Multi-PIE, CUFSF and HFB databases respectively. Extensive experiments show that our MvDA achieves significant improvements compared with the best known results.
在许多计算机视觉系统中,同一物体可以从不同的视角进行观察,甚至可以通过不同的传感器进行观察,这就带来了从不同的甚至异构的视角识别物体的挑战性需求。在这项工作中,我们提出了一种多视图判别分析(MvDA)方法,该方法通过联合学习多个视图特定的线性变换,以非成对的方式为多个视图寻求单一的判别公共空间。具体来说,我们的 MvDA 通过优化广义瑞利商来联合求解多个线性变换,即最大化公共空间中来自内视图和视图间的类间方差,同时最小化类内方差。通过将这个问题重新表述为一个比迹问题,通过广义特征值分解,可以解析地同时获得多个线性变换。此外,受不同视图共享相似数据结构这一观察结果的启发,我们引入了一个约束条件,以强制多个线性变换的视图一致性。我们在三个任务上评估了所提出的方法:多姿态人脸识别、照片与素描人脸识别,以及 Multi-PIE、CUFSF 和 HFB 数据库上的可见光图像与近红外图像人脸识别。广泛的实验表明,与最先进的结果相比,我们的 MvDA 取得了显著的改进。