基于密度核心的新型最小生成树聚类算法。

A Novel Minimum Spanning Tree Clustering Algorithm Based on Density Core.

机构信息

Key Laboratory of Dependable Service Computing in Cyber-Physical Society, Ministry of Education, Chongqing University, Chongqing 400044, China.

出版信息

Comput Intell Neurosci. 2022 Oct 5;2022:8496265. doi: 10.1155/2022/8496265. eCollection 2022.

DOI:10.1155/2022/8496265

PMID:36248922

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9556202/

Abstract

Clustering analysis is an unsupervised learning method, which has applications across many fields such as pattern recognition, machine learning, information security, and image segmentation. The density-based method, as one of the various clustering algorithms, has achieved good performance. However, it works poor in dealing with multidensity and complex-shaped datasets. Moreover, the result of this method depends heavily on the parameters we input. Thus, we propose a novel clustering algorithm (called the MST-DC) in this paper, which is based on the density core. Firstly, we employ the reverse nearest neighbors to extract core objects. Secondly, we use the minimum spanning tree algorithm to cluster the core objects. Finally, the remaining objects are assigned to the cluster to which their nearest core object belongs. The experimental results on several synthetic and real-world datasets show the superiority of the MST-DC to Kmeans, DBSCAN, DPC, DCore, SNNDPC, and LDP-MST.

摘要

聚类分析是一种无监督学习方法，它在模式识别、机器学习、信息安全和图像分割等多个领域都有应用。密度聚类方法作为聚类算法之一，已经取得了很好的效果。然而，它在处理多密度和复杂形状的数据集时表现不佳。此外，该方法的结果严重依赖于我们输入的参数。因此，我们在本文中提出了一种新的聚类算法（称为 MST-DC），它基于密度核心。首先，我们使用反向最近邻来提取核心对象。其次，我们使用最小生成树算法对核心对象进行聚类。最后，将剩余的对象分配给与其最近的核心对象所属的簇。在几个合成和真实数据集上的实验结果表明，MST-DC 优于 Kmeans、DBSCAN、DPC、DCore、SNNDPC 和 LDP-MST。