Lin Tong, Zha Hongbin
State Key Laboratory of Machine Perception, Science Building, School of EECS, Peking University, Beijing, China.
IEEE Trans Pattern Anal Mach Intell. 2008 May;30(5):796-809. doi: 10.1109/TPAMI.2007.70735.
Recently, manifold learning has been widely exploited in pattern recognition, data analysis, and machine learning. This paper presents a novel framework, called Riemannian manifold learning (RML), based on the assumption that the input high-dimensional data lie on an intrinsically low-dimensional Riemannian manifold. The main idea is to formulate the dimensionality reduction problem as a classical problem in Riemannian geometry, i.e., how to construct coordinate charts for a given Riemannian manifold? We implement the Riemannian normal coordinate chart, which has been the most widely used in Riemannian geometry, for a set of unorganized data points. First, two input parameters (the neighborhood size k and the intrinsic dimension d) are estimated based on an efficient simplicial reconstruction of the underlying manifold. Then, the normal coordinates are computed to map the input high-dimensional data into a low-dimensional space. Experiments on synthetic data as well as real world images demonstrate that our algorithm can learn intrinsic geometric structures of the data, preserve radial geodesic distances, and yield regular embeddings.
近年来,流形学习在模式识别、数据分析和机器学习中得到了广泛应用。本文基于输入的高维数据位于本质低维黎曼流形这一假设,提出了一种名为黎曼流形学习(RML)的新颖框架。其主要思想是将降维问题表述为黎曼几何中的一个经典问题,即如何为给定的黎曼流形构建坐标图?我们针对一组无组织的数据点实现了黎曼法坐标图,它在黎曼几何中应用最为广泛。首先,基于基础流形的高效单纯形重建估计两个输入参数(邻域大小k和内在维度d)。然后,计算法坐标以将输入的高维数据映射到低维空间。对合成数据以及真实世界图像的实验表明,我们的算法能够学习数据的内在几何结构,保留径向测地距离,并产生规则嵌入。