多视图判别分析。

Multi-View Discriminant Analysis.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2016 Jan;38(1):188-94. doi: 10.1109/TPAMI.2015.2435740.

DOI:10.1109/TPAMI.2015.2435740

Abstract

In many computer vision systems, the same object can be observed at varying viewpoints or even by different sensors, which brings in the challenging demand for recognizing objects from distinct even heterogeneous views. In this work we propose a Multi-view Discriminant Analysis (MvDA) approach, which seeks for a single discriminant common space for multiple views in a non-pairwise manner by jointly learning multiple view-specific linear transforms. Specifically, our MvDA is formulated to jointly solve the multiple linear transforms by optimizing a generalized Rayleigh quotient, i.e., maximizing the between-class variations and minimizing the within-class variations from both intra-view and inter-view in the common space. By reformulating this problem as a ratio trace problem, the multiple linear transforms are achieved analytically and simultaneously through generalized eigenvalue decomposition. Furthermore, inspired by the observation that different views share similar data structures, a constraint is introduced to enforce the view-consistency of the multiple linear transforms. The proposed method is evaluated on three tasks: face recognition across pose, photo versus. sketch face recognition, and visual light image versus near infrared image face recognition on Multi-PIE, CUFSF and HFB databases respectively. Extensive experiments show that our MvDA achieves significant improvements compared with the best known results.

摘要

在许多计算机视觉系统中，同一物体可以从不同的视角进行观察，甚至可以通过不同的传感器进行观察，这就带来了从不同的甚至异构的视角识别物体的挑战性需求。在这项工作中，我们提出了一种多视图判别分析（MvDA）方法，该方法通过联合学习多个视图特定的线性变换，以非成对的方式为多个视图寻求单一的判别公共空间。具体来说，我们的 MvDA 通过优化广义瑞利商来联合求解多个线性变换，即最大化公共空间中来自内视图和视图间的类间方差，同时最小化类内方差。通过将这个问题重新表述为一个比迹问题，通过广义特征值分解，可以解析地同时获得多个线性变换。此外，受不同视图共享相似数据结构这一观察结果的启发，我们引入了一个约束条件，以强制多个线性变换的视图一致性。我们在三个任务上评估了所提出的方法：多姿态人脸识别、照片与素描人脸识别，以及 Multi-PIE、CUFSF 和 HFB 数据库上的可见光图像与近红外图像人脸识别。广泛的实验表明，与最先进的结果相比，我们的 MvDA 取得了显著的改进。

相似文献

Multi-View Discriminant Analysis.

IEEE Trans Pattern Anal Mach Intell. 2016 Jan;38(1):188-94. doi: 10.1109/TPAMI.2015.2435740.

Cross-View Action Recognition Over Heterogeneous Feature Spaces.

IEEE Trans Image Process. 2015 Nov;24(11):4096-108. doi: 10.1109/TIP.2015.2445293. Epub 2015 Jun 12.

Multi-View Linear Discriminant Analysis Network.

IEEE Trans Image Process. 2019 Nov;28(11):5352-5365. doi: 10.1109/TIP.2019.2913511. Epub 2019 May 2.

Generalized Multi-View Embedding for Visual Recognition and Cross-Modal Retrieval.

IEEE Trans Cybern. 2018 Sep;48(9):2542-2555. doi: 10.1109/TCYB.2017.2742705. Epub 2017 Sep 6.

Tensor discriminant color space for face recognition.

IEEE Trans Image Process. 2011 Sep;20(9):2490-501. doi: 10.1109/TIP.2011.2121084. Epub 2011 Feb 28.

Discriminant subspace analysis: a Fukunaga-Koontz approach.

IEEE Trans Pattern Anal Mach Intell. 2007 Oct;29(10):1732-45. doi: 10.1109/TPAMI.2007.1089.

Discriminative learning and recognition of image set classes using canonical correlations.

IEEE Trans Pattern Anal Mach Intell. 2007 Jun;29(6):1005-18. doi: 10.1109/TPAMI.2007.1037.

Multi-task pose-invariant face recognition.

IEEE Trans Image Process. 2015 Mar;24(3):980-93. doi: 10.1109/TIP.2015.2390959. Epub 2015 Jan 12.

Invariant object recognition in the visual system with novel views of 3D objects.

Neural Comput. 2002 Nov;14(11):2585-96. doi: 10.1162/089976602760407982.

Regularized discriminative spectral regression method for heterogeneous face matching.

IEEE Trans Image Process. 2013 Jan;22(1):353-62. doi: 10.1109/TIP.2012.2215617. Epub 2012 Aug 27.

引用本文的文献

Hybrid DAER Based Cross-Modal Retrieval Exploiting Deep Representation Learning.

Entropy (Basel). 2023 Aug 16;25(8):1216. doi: 10.3390/e25081216.

Multi-Modal Learning-Based Equipment Fault Prediction in the Internet of Things.

Sensors (Basel). 2022 Sep 6;22(18):6722. doi: 10.3390/s22186722.

An Improved Entropy-Weighted Topsis Method for Decision-Level Fusion Evaluation System of Multi-Source Data.

Sensors (Basel). 2022 Aug 25;22(17):6391. doi: 10.3390/s22176391.

A multimodal deep learning model for cardiac resynchronisation therapy response prediction.

Med Image Anal. 2022 Jul;79:102465. doi: 10.1016/j.media.2022.102465. Epub 2022 Apr 20.

Hierarchical Fusion Using Subsets of Multi-Features for Historical Arabic Manuscript Dating.

J Imaging. 2022 Mar 1;8(3):60. doi: 10.3390/jimaging8030060.

Some Information Geometric Aspects of Cyber Security by Face Recognition.

Entropy (Basel). 2021 Jul 9;23(7):878. doi: 10.3390/e23070878.

Kernelized Heterogeneity-Aware Cross-View Face Recognition.

Front Artif Intell. 2021 Jul 20;4:670538. doi: 10.3389/frai.2021.670538. eCollection 2021.

A Collaborative Dictionary Learning Model for Nasopharyngeal Carcinoma Segmentation on Multimodalities MR Sequences.

Comput Math Methods Med. 2020 Aug 28;2020:7562140. doi: 10.1155/2020/7562140. eCollection 2020.

Multi-View Broad Learning System for Primate Oculomotor Decision Decoding.

IEEE Trans Neural Syst Rehabil Eng. 2020 Sep;28(9):1908-1920. doi: 10.1109/TNSRE.2020.3003342. Epub 2020 Jun 18.

Multiview learning for understanding functional multiomics.

PLoS Comput Biol. 2020 Apr 2;16(4):e1007677. doi: 10.1371/journal.pcbi.1007677. eCollection 2020 Apr.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

多视图判别分析。

Multi-View Discriminant Analysis.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2016 Jan;38(1):188-94. doi: 10.1109/TPAMI.2015.2435740.

DOI:10.1109/TPAMI.2015.2435740

PMID:26656586

Abstract

摘要

多视图判别分析。

Multi-View Discriminant Analysis.

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

多视图判别分析。

Multi-View Discriminant Analysis.

出版信息

相似文献

引用本文的文献