解析未标记数据中的变异模式。

Disentangling the Modes of Variation in Unlabelled Data.

作者信息

Wang Mengjiao, Panagakis Yannis, Snape Patrick, Zafeiriou Stefanos P

出版信息

IEEE Trans Pattern Anal Mach Intell. 2018 Nov;40(11):2682-2695. doi: 10.1109/TPAMI.2017.2783940. Epub 2017 Dec 15.

DOI:10.1109/TPAMI.2017.2783940

Abstract

Statistical methods are of paramount importance in discovering the modes of variation in visual data. The Principal Component Analysis (PCA) is probably the most prominent method for extracting a single mode of variation in the data. However, in practice, several factors contribute to the appearance of visual objects including pose, illumination, and deformation, to mention a few. To extract these modes of variations from visual data, several supervised methods, such as the TensorFaces relying on multilinear (tensor) decomposition have been developed. The main drawbacks of such methods is that they require both labels regarding the modes of variations and the same number of samples under all modes of variations (e.g., the same face under different expressions, poses etc.). Therefore, their applicability is limited to well-organised data, usually captured in well-controlled conditions. In this paper, we propose a novel general multilinear matrix decomposition method that discovers the multilinear structure of possibly incomplete sets of visual data in unsupervised setting (i.e., without the presence of labels). We also propose extensions of the method with sparsity and low-rank constraints in order to handle noisy data, captured in unconstrained conditions. Besides that, a graph-regularised variant of the method is also developed in order to exploit available geometric or label information for some modes of variations. We demonstrate the applicability of the proposed method in several computer vision tasks, including Shape from Shading (SfS) (in the wild and with occlusion removal), expression transfer, and estimation of surface normals from images captured in the wild.

摘要

统计方法在发现视觉数据的变化模式方面至关重要。主成分分析（PCA）可能是提取数据中单一变化模式最突出的方法。然而，在实际中，有几个因素会影响视觉对象的外观，比如姿态、光照和变形等等。为了从视觉数据中提取这些变化模式，已经开发了几种监督方法，例如依赖多线性（张量）分解的张量脸方法。这类方法的主要缺点是，它们既需要关于变化模式的标签，又需要在所有变化模式下具有相同数量的样本（例如，同一面部在不同表情、姿态等下的样本）。因此，它们的适用性仅限于通常在良好控制条件下捕获的组织良好的数据。在本文中，我们提出了一种新颖的通用多线性矩阵分解方法，该方法在无监督设置（即没有标签）下发现可能不完整的视觉数据集的多线性结构。我们还提出了该方法的扩展，带有稀疏性和低秩约束，以处理在无约束条件下捕获的噪声数据。除此之外，还开发了该方法的一种图正则化变体，以便利用某些变化模式的可用几何或标签信息。我们在几个计算机视觉任务中展示了所提出方法的适用性，包括从阴影恢复形状（SfS）（在自然场景中以及去除遮挡的情况下）、表情迁移以及从自然场景中捕获的图像估计表面法线。

相似文献

Disentangling the Modes of Variation in Unlabelled Data.解析未标记数据中的变异模式。

IEEE Trans Pattern Anal Mach Intell. 2018 Nov;40(11):2682-2695. doi: 10.1109/TPAMI.2017.2783940. Epub 2017 Dec 15.

IEEE Trans Neural Netw. 2009 Nov;20(11):1820-36. doi: 10.1109/TNN.2009.2031144. Epub 2009 Sep 29.

Multilinear sparse principal component analysis.多元稀疏主成分分析。

IEEE Trans Neural Netw Learn Syst. 2014 Oct;25(10):1942-50. doi: 10.1109/TNNLS.2013.2297381.

Learning Low-Rank Class-Specific Dictionary and Sparse Intra-Class Variant Dictionary for Face Recognition.学习用于人脸识别的低秩特定类别字典和稀疏类内变体字典。

PLoS One. 2015 Nov 16;10(11):e0142403. doi: 10.1371/journal.pone.0142403. eCollection 2015.

Multilinear Graph Embedding: Representation and Regularization for Images.多线性图嵌入：图像的表示和正则化。

IEEE Trans Image Process. 2014 Feb;23(2):741-54. doi: 10.1109/TIP.2013.2292303.

Sparse alignment for robust tensor learning.用于鲁棒张量学习的稀疏对齐。

IEEE Trans Neural Netw Learn Syst. 2014 Oct;25(10):1779-92. doi: 10.1109/TNNLS.2013.2295717.

Simultaneously Discovering and Localizing Common Objects in Wild Images.在野外图像中同时发现和定位常见对象。

IEEE Trans Image Process. 2018 Sep;27(9):4503-4515. doi: 10.1109/TIP.2018.2839901.

Label Information Guided Graph Construction for Semi-Supervised Learning.基于标签信息引导的图构建的半监督学习方法。

IEEE Trans Image Process. 2017 Sep;26(9):4182-4192. doi: 10.1109/TIP.2017.2703120. Epub 2017 May 18.

Scene Graph Prediction with Limited Labels.基于有限标签的场景图预测

Proc IEEE Int Conf Comput Vis. 2019 Oct-Nov;2019:2580-2590. doi: 10.1109/iccv.2019.00267. Epub 2020 Feb 27.

MPCA: Multilinear Principal Component Analysis of Tensor Objects.MPCA：张量对象的多线性主成分分析

IEEE Trans Neural Netw. 2008 Jan;19(1):18-39. doi: 10.1109/TNN.2007.901277.

引用本文的文献

A Cascade Attention Based Facial Expression Recognition Network by Fusing Multi-Scale Spatio-Temporal Features.基于级联注意力的融合多尺度时空特征的面部表情识别网络。

Sensors (Basel). 2022 Feb 10;22(4):1350. doi: 10.3390/s22041350.

Hybrid Attention Cascade Network for Facial Expression Recognition.用于面部表情识别的混合注意力级联网络。

Sensors (Basel). 2021 Mar 12;21(6):2003. doi: 10.3390/s21062003.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

解析未标记数据中的变异模式。

Disentangling the Modes of Variation in Unlabelled Data.

作者信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献