Suppr超能文献

解析未标记数据中的变异模式。

Disentangling the Modes of Variation in Unlabelled Data.

作者信息

Wang Mengjiao, Panagakis Yannis, Snape Patrick, Zafeiriou Stefanos P

出版信息

IEEE Trans Pattern Anal Mach Intell. 2018 Nov;40(11):2682-2695. doi: 10.1109/TPAMI.2017.2783940. Epub 2017 Dec 15.

Abstract

Statistical methods are of paramount importance in discovering the modes of variation in visual data. The Principal Component Analysis (PCA) is probably the most prominent method for extracting a single mode of variation in the data. However, in practice, several factors contribute to the appearance of visual objects including pose, illumination, and deformation, to mention a few. To extract these modes of variations from visual data, several supervised methods, such as the TensorFaces relying on multilinear (tensor) decomposition have been developed. The main drawbacks of such methods is that they require both labels regarding the modes of variations and the same number of samples under all modes of variations (e.g., the same face under different expressions, poses etc.). Therefore, their applicability is limited to well-organised data, usually captured in well-controlled conditions. In this paper, we propose a novel general multilinear matrix decomposition method that discovers the multilinear structure of possibly incomplete sets of visual data in unsupervised setting (i.e., without the presence of labels). We also propose extensions of the method with sparsity and low-rank constraints in order to handle noisy data, captured in unconstrained conditions. Besides that, a graph-regularised variant of the method is also developed in order to exploit available geometric or label information for some modes of variations. We demonstrate the applicability of the proposed method in several computer vision tasks, including Shape from Shading (SfS) (in the wild and with occlusion removal), expression transfer, and estimation of surface normals from images captured in the wild.

摘要

统计方法在发现视觉数据的变化模式方面至关重要。主成分分析(PCA)可能是提取数据中单一变化模式最突出的方法。然而,在实际中,有几个因素会影响视觉对象的外观,比如姿态、光照和变形等等。为了从视觉数据中提取这些变化模式,已经开发了几种监督方法,例如依赖多线性(张量)分解的张量脸方法。这类方法的主要缺点是,它们既需要关于变化模式的标签,又需要在所有变化模式下具有相同数量的样本(例如,同一面部在不同表情、姿态等下的样本)。因此,它们的适用性仅限于通常在良好控制条件下捕获的组织良好的数据。在本文中,我们提出了一种新颖的通用多线性矩阵分解方法,该方法在无监督设置(即没有标签)下发现可能不完整的视觉数据集的多线性结构。我们还提出了该方法的扩展,带有稀疏性和低秩约束,以处理在无约束条件下捕获的噪声数据。除此之外,还开发了该方法的一种图正则化变体,以便利用某些变化模式的可用几何或标签信息。我们在几个计算机视觉任务中展示了所提出方法的适用性,包括从阴影恢复形状(SfS)(在自然场景中以及去除遮挡的情况下)、表情迁移以及从自然场景中捕获的图像估计表面法线。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验