Suppr超能文献

共享高斯过程潜在变量模型用于不完全多视图聚类。

Shared Gaussian Process Latent Variable Model for Incomplete Multiview Clustering.

出版信息

IEEE Trans Cybern. 2020 Jan;50(1):61-73. doi: 10.1109/TCYB.2018.2863790. Epub 2018 Aug 30.

Abstract

These days, many multiview learning methods have been proposed by integrating the complementary information of multiple views and can significantly improve the performance of machine learning tasks comparing with single-view learning methods. However, most of these methods fail to learn better models when the multiview data are unpaired (or partially paired) or incomplete (or partially complete). Although some previous attempts have been made to address these problems, these methods often lead to poor results when dealing with incomplete multiview data that contain a relatively large number of missing instances. In fact, this incomplete problem is more challenging than the unpaired problem since less shared information can be caught by the model in the former case. In this paper, we propose a shared Gaussian process (GP) latent variable model for incomplete multiview clustering to gain the merits of two worlds (i.e., GP and multiview learning). Specifically, it learns a set of intentionally aligned representative auxiliary points in individual views jointly to not only compensate for missing instances but also implement the group-level constraint. Thus, the shared information among these views can be explicitly built into the model. All of the hyper-parameters and auxiliary points are simultaneously learned by variational inference. Compared with the existing methods, our method naturally inherits the advantages of GP. Furthermore, it is also straightforwardly extended to cases with more than two views without adding any complexity in formulation. In the experiments, we compare it with the state-of-the-art methods for incomplete multiview data clustering to demonstrate its superiorities.

摘要

如今,许多多视图学习方法通过整合多视图的互补信息得到了提出,并与单视图学习方法相比,显著提高了机器学习任务的性能。然而,当多视图数据未配对(或部分配对)或不完整(或部分完整)时,大多数这些方法都无法学习到更好的模型。尽管之前已经有一些尝试来解决这些问题,但这些方法在处理包含相对大量缺失实例的不完整多视图数据时,往往会导致较差的结果。事实上,这个不完整的问题比未配对的问题更具挑战性,因为在前一种情况下,模型可以捕捉到的共享信息更少。在本文中,我们提出了一种用于不完整多视图聚类的共享高斯过程(GP)潜在变量模型,以获得两个世界(即 GP 和多视图学习)的优势。具体来说,它共同学习一组有意对齐的代表辅助点,以不仅补偿缺失实例,而且实现组级别的约束。因此,这些视图之间的共享信息可以明确地构建到模型中。所有超参数和辅助点都是通过变分推理同时学习的。与现有方法相比,我们的方法自然继承了 GP 的优势。此外,它也可以直接扩展到超过两个视图的情况,而无需在公式中增加任何复杂度。在实验中,我们将其与现有的用于不完整多视图数据聚类的方法进行比较,以证明其优越性。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验