Vidal René, Ma Yi, Sastry Shankar
Center for Imaging Science, Department of Biomedical Engineering, The Johns Hopkins University, 308B Clark Hall, 3400 N. Charles Street, Baltimore, MD 21218, USA.
IEEE Trans Pattern Anal Mach Intell. 2005 Dec;27(12):1945-59. doi: 10.1109/TPAMI.2005.244.
This paper presents an algebro-geometric solution to the problem of segmenting an unknown number of subspaces of unknown and varying dimensions from sample data points. We represent the subspaces with a set of homogeneous polynomials whose degree is the number of subspaces and whose derivatives at a data point give normal vectors to the subspace passing through the point. When the number of subspaces is known, we show that these polynomials can be estimated linearly from data; hence, subspace segmentation is reduced to classifying one point per subspace. We select these points optimally from the data set by minimizing certain distance function, thus dealing automatically with moderate noise in the data. A basis for the complement of each subspace is then recovered by applying standard PCA to the collection of derivatives (normal vectors). Extensions of GPCA that deal with data in a high-dimensional space and with an unknown number of subspaces are also presented. Our experiments on low-dimensional data show that GPCA outperforms existing algebraic algorithms based on polynomial factorization and provides a good initialization to iterative techniques such as K-subspaces and Expectation Maximization. We also present applications of GPCA to computer vision problems such as face clustering, temporal video segmentation, and 3D motion segmentation from point correspondences in multiple affine views.
本文提出了一种代数几何解决方案,用于从样本数据点中分割出数量未知且维度未知且变化的子空间。我们用一组齐次多项式来表示这些子空间,其次数为子空间的数量,并且在数据点处的导数给出通过该点的子空间的法向量。当子空间的数量已知时,我们表明这些多项式可以从数据中线性估计;因此,子空间分割就简化为对每个子空间的一个点进行分类。我们通过最小化某个距离函数从数据集中最优地选择这些点,从而自动处理数据中的适度噪声。然后通过对导数(法向量)的集合应用标准主成分分析(PCA)来恢复每个子空间的补空间的一个基。还提出了广义主成分分析(GPCA)的扩展,用于处理高维空间中的数据以及数量未知的子空间。我们在低维数据上的实验表明,GPCA优于基于多项式因式分解的现有代数算法,并为诸如K -子空间和期望最大化等迭代技术提供了良好的初始化。我们还展示了GPCA在计算机视觉问题中的应用,如人脸聚类、时间视频分割以及从多个仿射视图中的点对应关系进行3D运动分割。