State Key Laboratory of Integrated Services Networks, Xidian University, Shaanxi 710071, China.
State Key Laboratory of Integrated Services Networks, Xidian University, Shaanxi 710071, China.
Neural Netw. 2019 Nov;119:85-92. doi: 10.1016/j.neunet.2019.07.015. Epub 2019 Aug 1.
Robust Principal Component Analysis (RPCA) is a powerful tool in machine learning and data mining problems. However, in many real-world applications, RPCA is unable to well encode the intrinsic geometric structure of data, thereby failing to obtain the lowest rank representation from the corrupted data. To cope with this problem, most existing methods impose the smooth manifold, which is artificially constructed by the original data. This reduces the flexibility of algorithms. Moreover, the graph, which is artificially constructed by the corrupted data, is inexact and does not characterize the true intrinsic structure of real data. To tackle this problem, we propose an adaptive RPCA (ARPCA) to recover the clean data from the high-dimensional corrupted data. Our proposed model is advantageous due to: (1) The graph is adaptively constructed upon the clean data such that the system is more flexible. (2) Our model simultaneously learns both clean data and similarity matrix that determines the construction of graph. (3) The clean data has the lowest-rank structure that enforces to correct the corruptions. Extensive experiments on several datasets illustrate the effectiveness of our model for clustering and low-rank recovery tasks.
鲁棒主成分分析(RPCA)是机器学习和数据挖掘问题中的强大工具。然而,在许多实际应用中,RPCA 无法很好地编码数据的内在几何结构,从而无法从损坏的数据中获得最低秩表示。为了解决这个问题,大多数现有方法都采用了平滑流形,这是由原始数据人为构造的。这降低了算法的灵活性。此外,由损坏的数据人为构造的图是不精确的,并且不能描述真实数据的真实内在结构。为了解决这个问题,我们提出了一种自适应 RPCA(ARPCA)方法,从高维损坏数据中恢复干净的数据。我们提出的模型具有以下优势:(1)图是根据干净的数据自适应构建的,因此系统更加灵活。(2)我们的模型同时学习干净的数据和相似度矩阵,这决定了图的构建。(3)干净的数据具有最低秩结构,强制纠正损坏。在几个数据集上的广泛实验表明,我们的模型在聚类和低秩恢复任务中非常有效。