一种用于确定最优聚类数的自适应模糊均值算法

A Self-Adaptive Fuzzy -Means Algorithm for Determining the Optimal Number of Clusters.

作者信息

Ren Min, Liu Peiyu, Wang Zhihao, Yi Jing

机构信息

School of Information Science and Engineering, Shandong Normal University, Jinan, Shandong, China; School of Mathematic and Quantitative Economics, Shandong University of Finance and Economics, Jinan, Shandong, China; Shandong Provincial Key Laboratory for Distributed Computer Software Novel Technology, Jinan, Shandong, China.

School of Information Science and Engineering, Shandong Normal University, Jinan, Shandong, China; Shandong Provincial Key Laboratory for Distributed Computer Software Novel Technology, Jinan, Shandong, China.

出版信息

Comput Intell Neurosci. 2016;2016:2647389. doi: 10.1155/2016/2647389. Epub 2016 Nov 29.

DOI:10.1155/2016/2647389

PMID:28042291

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5153549/

Abstract

For the shortcoming of fuzzy -means algorithm (FCM) needing to know the number of clusters in advance, this paper proposed a new self-adaptive method to determine the optimal number of clusters. Firstly, a density-based algorithm was put forward. The algorithm, according to the characteristics of the dataset, automatically determined the possible maximum number of clusters instead of using the empirical rule [Formula: see text] and obtained the optimal initial cluster centroids, improving the limitation of FCM that randomly selected cluster centroids lead the convergence result to the local minimum. Secondly, this paper, by introducing a penalty function, proposed a new fuzzy clustering validity index based on fuzzy compactness and separation, which ensured that when the number of clusters verged on that of objects in the dataset, the value of clustering validity index did not monotonically decrease and was close to zero, so that the optimal number of clusters lost robustness and decision function. Then, based on these studies, a self-adaptive FCM algorithm was put forward to estimate the optimal number of clusters by the iterative trial-and-error process. At last, experiments were done on the UCI, KDD Cup 1999, and synthetic datasets, which showed that the method not only effectively determined the optimal number of clusters, but also reduced the iteration of FCM with the stable clustering result.

摘要

针对模糊均值算法（FCM）需要预先知道聚类数量的缺点，本文提出了一种新的自适应方法来确定最优聚类数。首先，提出了一种基于密度的算法。该算法根据数据集的特征，自动确定可能的最大聚类数，而不是使用经验规则[公式：见原文]，并获得最优的初始聚类中心，改善了FCM随机选择聚类中心导致收敛结果陷入局部最小值的局限性。其次，本文通过引入惩罚函数，提出了一种基于模糊紧致性和分离度的新的模糊聚类有效性指标，确保当聚类数接近数据集中对象的数量时，聚类有效性指标的值不会单调下降并接近零，从而使最优聚类数失去鲁棒性和决策功能。然后，基于这些研究，提出了一种自适应FCM算法，通过迭代试错过程来估计最优聚类数。最后，在UCI、1999年KDD杯和合成数据集上进行了实验，结果表明该方法不仅能有效地确定最优聚类数，还能减少FCM的迭代次数，且聚类结果稳定。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7c0c/5153549/94d4a685bcef/CIN2016-2647389.001.jpg

相似文献

A Self-Adaptive Fuzzy -Means Algorithm for Determining the Optimal Number of Clusters.一种用于确定最优聚类数的自适应模糊均值算法

Comput Intell Neurosci. 2016;2016:2647389. doi: 10.1155/2016/2647389. Epub 2016 Nov 29.

An improved fuzzy c-means clustering algorithm based on shadowed sets and PSO.一种基于阴影集和粒子群优化算法的改进型模糊C均值聚类算法

Comput Intell Neurosci. 2014;2014:368628. doi: 10.1155/2014/368628. Epub 2014 Nov 12.

A simple and fast method to determine the parameters for fuzzy c-means cluster analysis.一种用于确定模糊 C 均值聚类分析参数的简单快速方法。

Bioinformatics. 2010 Nov 15;26(22):2841-8. doi: 10.1093/bioinformatics/btq534. Epub 2010 Sep 29.

Alpha-cut implemented fuzzy clustering algorithms and switching regressions.实现了阿尔法切割的模糊聚类算法和切换回归。

IEEE Trans Syst Man Cybern B Cybern. 2008 Jun;38(3):588-603. doi: 10.1109/TSMCB.2008.915537.

[MR brain image segmentation based on modified fuzzy C-means clustering using fuzzy GIbbs random field].基于使用模糊吉布斯随机场的改进模糊C均值聚类的磁共振脑图像分割

Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2008 Dec;25(6):1264-70.

A New Validity Index Based on Fuzzy Energy and Fuzzy Entropy Measures in Fuzzy Clustering Problems.基于模糊聚类问题中模糊能量和模糊熵测度的一种新有效性指标。

Entropy (Basel). 2020 Oct 23;22(11):1200. doi: 10.3390/e22111200.

A robust fuzzy local information C-Means clustering algorithm.一种鲁棒的模糊局部信息 C-均值聚类算法。

IEEE Trans Image Process. 2010 May;19(5):1328-37. doi: 10.1109/TIP.2010.2040763. Epub 2010 Jan 19.

A Hybrid Method for Image Segmentation Based on Artificial Fish Swarm Algorithm and Fuzzy c-Means Clustering.一种基于人工鱼群算法和模糊c均值聚类的图像分割混合方法。

Comput Math Methods Med. 2015;2015:120495. doi: 10.1155/2015/120495. Epub 2015 Nov 16.

Generalized fuzzy C-means clustering algorithm with improved fuzzy partitions.具有改进模糊划分的广义模糊C均值聚类算法

IEEE Trans Syst Man Cybern B Cybern. 2009 Jun;39(3):578-91. doi: 10.1109/TSMCB.2008.2004818. Epub 2009 Jan 23.

An improved fuzzy C-means clustering algorithm for assisted therapy of chronic bronchitis.一种用于慢性支气管炎辅助治疗的改进型模糊C均值聚类算法。

Technol Health Care. 2015;23(6):699-713. doi: 10.3233/THC-151023.

引用本文的文献

Magnetic Resonance Features of Acquired Immune Deficiency Syndrome Involving Central Nervous System Diseases by Intelligent Fuzzy C-Means Clustering (FCM) Algorithm.智能模糊 C-均值聚类（FCM）算法在获得性免疫缺陷综合征涉及中枢神经系统疾病中的磁共振成像特征。

Comput Math Methods Med. 2022 Jul 5;2022:4955555. doi: 10.1155/2022/4955555. eCollection 2022.

Clustering of fMRI data: the elusive optimal number of clusters.功能磁共振成像（fMRI）数据的聚类：难以捉摸的最佳聚类数

PeerJ. 2018 Oct 3;6:e5416. doi: 10.7717/peerj.5416. eCollection 2018.

Automated detection of photoreceptor disruption in mild diabetic retinopathy on volumetric optical coherence tomography.基于容积光学相干断层扫描技术自动检测轻度糖尿病视网膜病变中的光感受器破坏情况。

Biomed Opt Express. 2017 Nov 7;8(12):5384-5398. doi: 10.1364/BOE.8.005384. eCollection 2017 Dec 1.

本文引用的文献

Machine learning. Clustering by fast search and find of density peaks.机器学习。基于密度峰值的快速搜索和发现的聚类。

Science. 2014 Jun 27;344(6191):1492-6. doi: 10.1126/science.1242072.

A new validity measure for a correlation-based fuzzy c-means clustering algorithm.一种基于相关性的模糊 c 均值聚类算法的新有效性度量。

Annu Int Conf IEEE Eng Med Biol Soc. 2009;2009:3865-8. doi: 10.1109/IEMBS.2009.5332582.

Mercer kernel-based clustering in feature space.特征空间中基于 Mercer 核的聚类

IEEE Trans Neural Netw. 2002;13(3):780-4. doi: 10.1109/TNN.2002.1000150.

Clustering by passing messages between data points.通过在数据点之间传递信息进行聚类。

Science. 2007 Feb 16;315(5814):972-6. doi: 10.1126/science.1136800. Epub 2007 Jan 11.

IEEE Trans Pattern Anal Mach Intell. 2004 Apr;26(4):434-48. doi: 10.1109/TPAMI.2004.1265860.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一种用于确定最优聚类数的自适应模糊均值算法

A Self-Adaptive Fuzzy -Means Algorithm for Determining the Optimal Number of Clusters.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献