Geng Xin, Zhan De-Chuan, Zhou Zhi-Hua
National Laboratory for Novel Software Technology, Nanjing University, China.
IEEE Trans Syst Man Cybern B Cybern. 2005 Dec;35(6):1098-107. doi: 10.1109/tsmcb.2005.850151.
When performing visualization and classification, people often confront the problem of dimensionality reduction. Isomap is one of the most promising nonlinear dimensionality reduction techniques. However, when Isomap is applied to real-world data, it shows some limitations, such as being sensitive to noise. In this paper, an improved version of Isomap, namely S-Isomap, is proposed. S-Isomap utilizes class information to guide the procedure of nonlinear dimensionality reduction. Such a kind of procedure is called supervised nonlinear dimensionality reduction. In S-Isomap, the neighborhood graph of the input data is constructed according to a certain kind of dissimilarity between data points, which is specially designed to integrate the class information. The dissimilarity has several good properties which help to discover the true neighborhood of the data and, thus, makes S-Isomap a robust technique for both visualization and classification, especially for real-world problems. In the visualization experiments, S-Isomap is compared with Isomap, LLE, and WeightedIso. The results show that S-Isomap performs the best. In the classification experiments, S-Isomap is used as a preprocess of classification and compared with Isomap, WeightedIso, as well as some other well-established classification methods, including the K-nearest neighbor classifier, BP neural network, J4.8 decision tree, and SVM. The results reveal that S-Isomap excels compared to Isomap and WeightedIso in classification, and it is highly competitive with those well-known classification methods.
在进行可视化和分类时,人们常常面临降维问题。等距映射(Isomap)是最有前景的非线性降维技术之一。然而,当将等距映射应用于实际数据时,它表现出一些局限性,比如对噪声敏感。本文提出了等距映射的一种改进版本,即S - 等距映射(S-Isomap)。S - 等距映射利用类别信息来指导非线性降维过程。这种过程被称为监督非线性降维。在S - 等距映射中,根据数据点之间某种特定的差异构建输入数据的邻域图,该差异是专门为整合类别信息而设计的。这种差异具有几个良好的特性,有助于发现数据的真实邻域,因此使S - 等距映射成为一种用于可视化和分类的强大技术,尤其适用于实际问题。在可视化实验中,将S - 等距映射与等距映射、局部线性嵌入(LLE)和加权等距映射(WeightedIso)进行了比较。结果表明S - 等距映射表现最佳。在分类实验中,S - 等距映射被用作分类的预处理,并与等距映射、加权等距映射以及其他一些成熟的分类方法进行比较,包括K近邻分类器、BP神经网络、J4.8决策树和支持向量机(SVM)。结果显示,在分类方面,S - 等距映射比等距映射和加权等距映射表现更优,并且与那些知名分类方法具有很强的竞争力。