Zhuge Wenzhang, Luo Tingjin, Fan Ruidong, Tao Hong, Hou Chenping, Yi Dongyun
IEEE Trans Cybern. 2024 Mar;54(3):1708-1721. doi: 10.1109/TCYB.2023.3241171. Epub 2024 Feb 9.
With the advent of vast data collection ways, data are often with multiple modalities or coming from multiple sources. Traditional multiview learning often assumes that each example of data appears in all views. However, this assumption is too strict in some real applications such as multisensor surveillance system, where every view suffers from some data absent. In this article, we focus on how to classify such incomplete multiview data in semisupervised scenario and a method called absent multiview semisupervised classification (AMSC) has been proposed. Specifically, partial graph matrices are constructed independently by anchor strategy to measure the relationships among between each pair of present samples on each view. And to obtain unambiguous classification results for all unlabeled data points, AMSC learns view-specific label matrices and a common label matrix simultaneously. AMSC measures the similarity between pair of view-specific label vectors on each view by partial graph matrices, and consider the similarity between view-specific label vectors and class indicator vectors based on the common label matrix. To characterize the contributions of different views, the p th root integration strategy is adopted to incorporate the losses of different views. By further analyzing the relation between the p th root integration strategy and exponential decay integration strategy, we develop an efficient algorithm with proved convergence to solve the proposed nonconvex problem. To validate the effectiveness of AMSC, comparisons are made with some benchmark methods on real-world datasets and in the document classification scenario as well. The experimental results demonstrate the advantages of our proposed approach.
随着大量数据收集方式的出现,数据通常具有多种模态或来自多个来源。传统的多视图学习通常假设数据的每个示例都出现在所有视图中。然而,在一些实际应用中,如多传感器监视系统,这种假设过于严格,因为每个视图都会出现一些数据缺失的情况。在本文中,我们专注于如何在半监督场景中对这种不完整的多视图数据进行分类,并提出了一种称为缺失多视图半监督分类(AMSC)的方法。具体来说,通过锚定策略独立构建部分图矩阵,以测量每个视图上每对现有样本之间的关系。为了获得所有未标记数据点的明确分类结果,AMSC同时学习特定于视图的标签矩阵和一个公共标签矩阵。AMSC通过部分图矩阵测量每个视图上一对特定于视图的标签向量之间的相似性,并基于公共标签矩阵考虑特定于视图的标签向量与类指示向量之间的相似性。为了表征不同视图的贡献,采用第p次根积分策略来合并不同视图的损失。通过进一步分析第p次根积分策略与指数衰减积分策略之间的关系,我们开发了一种具有收敛性证明的高效算法来解决所提出的非凸问题。为了验证AMSC的有效性,在真实世界数据集和文档分类场景中与一些基准方法进行了比较。实验结果证明了我们提出的方法的优势。