关于差异度量在k-模式聚类算法中的影响。

On the impact of dissimilarity measure in k-modes clustering algorithm.

作者信息

Ng Michael K, Li Mark Junjie, Huang Joshua Zhexue, He Zengyou

机构信息

Department of Mathematics, Hong Kong Baptist University, Kowloon Tong, Hong Kong.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2007 Mar;29(3):503-7. doi: 10.1109/TPAMI.2007.53.

DOI:10.1109/TPAMI.2007.53

PMID:17224620

Abstract

This correspondence describes extensions to the k-modes algorithm for clustering categorical data. By modifying a simple matching dissimilarity measure for categorical objects, a heuristic approach was developed in [4], [12] which allows the use of the k-modes paradigm to obtain a cluster with strong intrasimilarity and to efficiently cluster large categorical data sets. The main aim of this paper is to rigorously derive the updating formula of the k-modes clustering algorithm with the new dissimilarity measure and the convergence of the algorithm under the optimization framework.

摘要

本通信描述了用于对分类数据进行聚类的k-模式算法的扩展。通过修改用于分类对象的简单匹配差异度量，在[4]、[12]中开发了一种启发式方法，该方法允许使用k-模式范式来获得具有强内部相似性的聚类，并有效地对大型分类数据集进行聚类。本文的主要目的是在优化框架下，严格推导具有新差异度量的k-模式聚类算法的更新公式以及该算法的收敛性。

相似文献

On the impact of dissimilarity measure in k-modes clustering algorithm.

IEEE Trans Pattern Anal Mach Intell. 2007 Mar;29(3):503-7. doi: 10.1109/TPAMI.2007.53.

Automated variable weighting in k-means type clustering.

IEEE Trans Pattern Anal Mach Intell. 2005 May;27(5):657-68. doi: 10.1109/TPAMI.2005.95.

Dimensionality reduction of clustered data sets.

IEEE Trans Pattern Anal Mach Intell. 2008 Mar;30(3):535-40. doi: 10.1109/TPAMI.2007.70819.

General C-means clustering model.

IEEE Trans Pattern Anal Mach Intell. 2005 Aug;27(8):1197-211. doi: 10.1109/TPAMI.2005.160.

A novel kernel method for clustering.

IEEE Trans Pattern Anal Mach Intell. 2005 May;27(5):801-5. doi: 10.1109/TPAMI.2005.88.

Iterative RELIEF for feature weighting: algorithms, theories, and applications.

IEEE Trans Pattern Anal Mach Intell. 2007 Jun;29(6):1035-51. doi: 10.1109/TPAMI.2007.1093.

A modified K-means algorithm for circular invariant clustering.

IEEE Trans Pattern Anal Mach Intell. 2005 Dec;27(12):1856-65. doi: 10.1109/TPAMI.2005.230.

An optimization criterion for generalized discriminant analysis on undersampled problems.

IEEE Trans Pattern Anal Mach Intell. 2004 Aug;26(8):982-94. doi: 10.1109/TPAMI.2004.37.

Online clustering algorithms for radar emitter classification.

IEEE Trans Pattern Anal Mach Intell. 2005 Aug;27(8):1185-96. doi: 10.1109/TPAMI.2005.166.

A new distance measure for model-based sequence clustering.

IEEE Trans Pattern Anal Mach Intell. 2009 Jul;31(7):1325-31. doi: 10.1109/TPAMI.2008.268.

引用本文的文献

Subtyping of common complex diseases and disorders by integrating heterogeneous data. Identifying clusters among women with lower urinary tract symptoms in the LURN study.

PLoS One. 2022 Jun 10;17(6):e0268547. doi: 10.1371/journal.pone.0268547. eCollection 2022.

A Memory-Efficient Encoding Method for Processing Mixed-Type Data on Machine Learning.

Entropy (Basel). 2020 Dec 9;22(12):1391. doi: 10.3390/e22121391.

Automatic and Fast Recognition of On-Road High-Emitting Vehicles Using an Optical Remote Sensing System.

Sensors (Basel). 2019 Aug 13;19(16):3540. doi: 10.3390/s19163540.

Clustering Categorical Data Using Community Detection Techniques.

Comput Intell Neurosci. 2017;2017:8986360. doi: 10.1155/2017/8986360. Epub 2017 Dec 21.

A Global-Relationship Dissimilarity Measure for the -Modes Clustering Algorithm.

Comput Intell Neurosci. 2017;2017:3691316. doi: 10.1155/2017/3691316. Epub 2017 Mar 28.

Manipulating measurement scales in medical statistical analysis and data mining: A review of methodologies.

J Res Med Sci. 2014 Jan;19(1):47-56.

An efficient clustering algorithm for partitioning Y-short tandem repeats data.

BMC Res Notes. 2012 Oct 6;5:557. doi: 10.1186/1756-0500-5-557.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

关于差异度量在k-模式聚类算法中的影响。

On the impact of dissimilarity measure in k-modes clustering algorithm.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献