Taskin Gulsen, Crawford Melba M
IEEE Trans Image Process. 2019 May 15. doi: 10.1109/TIP.2019.2915162.
Unsupervised manifold learning has become accepted as an important tool for reducing dimensionality of a dataset by finding its meaningful low-dimensional representation lying on an unknown nonlinear subspace. Most manifold learning methods only embed an existing dataset, but do not provide an explicit mapping function for novel out-of-sample data, thereby potentially resulting in an ineffective tool for classification purposes, particularly for iterative methods such as active learning. To address this issue, out-of-sample extension methods have been introduced to generalize an existing embedding of new samples. In this work, a novel out-of-sample method is introduced by utilizing High Dimensional Model Representation (HDMR) as a nonlinear multivariate regression with the Tikhonov regularizer for unsupervised manifold learning algorithms. The proposed method was extensively analyzed using illustrative datasets sampled from known manifolds. Several experiments with 3D synthetic datasets and face recognition datasets were also conducted, and the performance of the proposed method was compared to several well-known out-of-sample methods. The results obtained with Locally Linear Embedding (LLE), Laplacian Eigenmaps (LE), and t-Distributed Stochastic Neighbor Embedding (t-SNE) showed that the proposed method achieves competitive even better performance than the other out-of-sample methods.
无监督流形学习已被公认为是一种重要工具,可通过找到位于未知非线性子空间上的有意义的低维表示来降低数据集的维度。大多数流形学习方法仅对现有数据集进行嵌入,但未为新的样本外数据提供显式映射函数,从而可能导致该工具在分类目的上无效,特别是对于诸如主动学习之类的迭代方法。为了解决这个问题,已引入样本外扩展方法来推广新样本的现有嵌入。在这项工作中,通过将高维模型表示(HDMR)用作具有蒂霍诺夫正则化器的非线性多元回归,为无监督流形学习算法引入了一种新颖的样本外方法。使用从已知流形采样的说明性数据集对所提出的方法进行了广泛分析。还对3D合成数据集和人脸识别数据集进行了几次实验,并将所提出方法的性能与几种著名的样本外方法进行了比较。使用局部线性嵌入(LLE)、拉普拉斯特征映射(LE)和t分布随机邻域嵌入(t-SNE)获得的结果表明,所提出的方法甚至比其他样本外方法具有更好的竞争力。