大型差异数据集的地形制图。

University of Bielefeld, 33501 Bielefeld, Germany.

Neural Comput. 2010 Sep 1;22(9):2229-84. doi: 10.1162/NECO_a_00012.

Topographic maps such as the self-organizing map (SOM) or neural gas (NG) constitute powerful data mining techniques that allow simultaneously clustering data and inferring their topological structure, such that additional features, for example, browsing, become available. Both methods have been introduced for vectorial data sets; they require a classical feature encoding of information. Often data are available in the form of pairwise distances only, such as arise from a kernel matrix, a graph, or some general dissimilarity measure. In such cases, NG and SOM cannot be applied directly. In this article, we introduce relational topographic maps as an extension of relational clustering algorithms, which offer prototype-based representations of dissimilarity data, to incorporate neighborhood structure. These methods are equivalent to the standard (vectorial) techniques if a Euclidean embedding exists, while preventing the need to explicitly compute such an embedding. Extending these techniques for the general case of non-Euclidean dissimilarities makes possible an interpretation of relational clustering as clustering in pseudo-Euclidean space. We compare the methods to well-known clustering methods for proximity data based on deterministic annealing and discuss how far convergence can be guaranteed in the general case. Relational clustering is quadratic in the number of data points, which makes the algorithms infeasible for huge data sets. We propose an approximate patch version of relational clustering that runs in linear time. The effectiveness of the methods is demonstrated in a number of examples.

地形地图，如自组织映射（SOM）或神经气体（NG），构成了强大的数据挖掘技术，允许同时对数据进行聚类并推断其拓扑结构，从而提供了其他功能，例如浏览。这两种方法都已被引入到向量数据集；它们需要对信息进行经典的特征编码。通常，数据仅以成对距离的形式出现，例如核矩阵、图或某些一般相似度度量中出现的距离。在这种情况下，NG 和 SOM 不能直接应用。在本文中，我们引入了关系地形图作为关系聚类算法的扩展，该算法为相似性数据提供基于原型的表示形式，以合并邻域结构。如果存在欧几里得嵌入，则这些方法等同于标准（向量）技术，同时避免了显式计算此类嵌入的需要。将这些技术扩展到非欧几里得相似度的一般情况，使得关系聚类可以解释为伪欧几里得空间中的聚类。我们将这些方法与基于确定性退火的基于相似性数据的知名聚类方法进行比较，并讨论在一般情况下可以保证多远的收敛性。关系聚类在数据点的数量上是二次的，这使得算法对于庞大的数据集不可行。我们提出了一种关系聚类的近似补丁版本，其运行时间为线性。在许多示例中证明了这些方法的有效性。

相似文献

Topographic mapping of large dissimilarity data sets.

Neural Comput. 2010 Sep 1;22(9):2229-84. doi: 10.1162/NECO_a_00012.

Local matrix learning in clustering and applications for manifold visualization.

Neural Netw. 2010 May;23(4):476-86. doi: 10.1016/j.neunet.2009.12.003. Epub 2009 Dec 22.

Clustering: a neural network approach.

Neural Netw. 2010 Jan;23(1):89-107. doi: 10.1016/j.neunet.2009.08.007. Epub 2009 Aug 29.

Fast algorithm and implementation of dissimilarity self-organizing maps.

Neural Netw. 2006 Jul-Aug;19(6-7):855-63. doi: 10.1016/j.neunet.2006.05.002. Epub 2006 Jun 12.

Advanced visualization of self-organizing maps with vector fields.

Neural Netw. 2006 Jul-Aug;19(6-7):911-22. doi: 10.1016/j.neunet.2006.05.013. Epub 2006 Jun 19.

Self-organizing maps and clustering methods for matrix data.

Neural Netw. 2004 Oct-Nov;17(8-9):1211-29. doi: 10.1016/j.neunet.2004.06.012.

Twin kernel embedding.

IEEE Trans Pattern Anal Mach Intell. 2008 Aug;30(8):1490-5. doi: 10.1109/TPAMI.2008.74.

LEGClust- a clustering algorithm based on layered entropic subgraphs.

IEEE Trans Pattern Anal Mach Intell. 2008 Jan;30(1):62-75. doi: 10.1109/TPAMI.2007.1142.

Detecting clusters of different geometrical shapes in microarray gene expression data.

Bioinformatics. 2005 May 1;21(9):1927-34. doi: 10.1093/bioinformatics/bti251. Epub 2005 Jan 12.

A novel kernel method for clustering.

IEEE Trans Pattern Anal Mach Intell. 2005 May;27(5):801-5. doi: 10.1109/TPAMI.2005.88.

引用本文的文献

Fractal Geometry Meets Computational Intelligence: Future Perspectives.

Adv Neurobiol. 2024;36:983-997. doi: 10.1007/978-3-031-47606-8_48.

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

相似文献

Topographic mapping of large dissimilarity data sets.

Neural Comput. 2010 Sep 1;22(9):2229-84. doi: 10.1162/NECO_a_00012.

Local matrix learning in clustering and applications for manifold visualization.

Neural Netw. 2010 May;23(4):476-86. doi: 10.1016/j.neunet.2009.12.003. Epub 2009 Dec 22.

Clustering: a neural network approach.

Neural Netw. 2010 Jan;23(1):89-107. doi: 10.1016/j.neunet.2009.08.007. Epub 2009 Aug 29.

Fast algorithm and implementation of dissimilarity self-organizing maps.

Neural Netw. 2006 Jul-Aug;19(6-7):855-63. doi: 10.1016/j.neunet.2006.05.002. Epub 2006 Jun 12.

Advanced visualization of self-organizing maps with vector fields.

Neural Netw. 2006 Jul-Aug;19(6-7):911-22. doi: 10.1016/j.neunet.2006.05.013. Epub 2006 Jun 19.

Self-organizing maps and clustering methods for matrix data.

Neural Netw. 2004 Oct-Nov;17(8-9):1211-29. doi: 10.1016/j.neunet.2004.06.012.

Twin kernel embedding.

IEEE Trans Pattern Anal Mach Intell. 2008 Aug;30(8):1490-5. doi: 10.1109/TPAMI.2008.74.

LEGClust- a clustering algorithm based on layered entropic subgraphs.

IEEE Trans Pattern Anal Mach Intell. 2008 Jan;30(1):62-75. doi: 10.1109/TPAMI.2007.1142.

Detecting clusters of different geometrical shapes in microarray gene expression data.

Bioinformatics. 2005 May 1;21(9):1927-34. doi: 10.1093/bioinformatics/bti251. Epub 2005 Jan 12.

A novel kernel method for clustering.

IEEE Trans Pattern Anal Mach Intell. 2005 May;27(5):801-5. doi: 10.1109/TPAMI.2005.88.

引用本文的文献

Fractal Geometry Meets Computational Intelligence: Future Perspectives.

Adv Neurobiol. 2024;36:983-997. doi: 10.1007/978-3-031-47606-8_48.

Topographic mapping of large dissimilarity data sets.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献