Wang Li, Li Ren-Cang
IEEE Trans Neural Netw Learn Syst. 2020 Apr;31(4):1098-1112. doi: 10.1109/TNNLS.2019.2917696. Epub 2019 Jun 18.
We aim to automatically learn a latent graph structure in a low-dimensional space from high-dimensional, unsupervised data based on a unified density estimation framework for both feature extraction and feature selection, where the latent structure is considered as a compact and informative representation of the high-dimensional data. Based on this framework, two novel methods are proposed with very different but intuitive learning criteria from existing methods. The proposed feature extraction method can learn a set of embedded points in a low-dimensional space by naturally integrating the discriminative information of the input data with structure learning so that multiple disconnected embedding structures of data can be uncovered. The proposed feature selection method preserves the pairwise distances only on the optimal set of features and selects these features simultaneously. It not only obtains the optimal set of features but also learns both the structure and embeddings for visualization. Extensive experiments demonstrate that our proposed methods can achieve competitive quantitative (often better) results in terms of discriminant evaluation performance and are able to obtain the embeddings of smooth skeleton structures and select optimal features to unveil the correct graph structures of high-dimensional data sets.
我们旨在基于一个统一的密度估计框架,从高维无监督数据中自动学习低维空间中的潜在图结构,用于特征提取和特征选择,其中潜在结构被视为高维数据的紧凑且信息丰富的表示。基于此框架,提出了两种新颖的方法,其学习准则与现有方法截然不同但直观易懂。所提出的特征提取方法通过将输入数据的判别信息与结构学习自然地整合,能够在低维空间中学习一组嵌入点,从而揭示数据的多个不相连的嵌入结构。所提出的特征选择方法仅保留最优特征集上的成对距离,并同时选择这些特征。它不仅能获得最优特征集,还能学习用于可视化的结构和嵌入。大量实验表明,我们提出的方法在判别评估性能方面能够取得具有竞争力的定量(通常更好)结果,并且能够获得平滑骨架结构的嵌入,并选择最优特征以揭示高维数据集的正确图结构。