Suppr超能文献

双准则聚类分析。

Bicriterion cluster analysis.

机构信息

Universitaire Catholique de Mons, Mons, Belgium; Institut d'Economie Scientifique et de Gestion, Lille, France.

出版信息

IEEE Trans Pattern Anal Mach Intell. 1980 Apr;2(4):277-91. doi: 10.1109/tpami.1980.4767027.

Abstract

Cluster analysis is concerned with the problem of partitioning a given set of entities into homogeneous and well-separated subsets called clusters. The concepts of homogeneity and of separation can be made precise when a measure of dissimilarity between the entities is given. Let us define the diameter of a partition of the given set of entities into clusters as the maximum dissimilarity between any pair of entities in the same cluster and the split of a partition as the minimum dissimilarity between entities in different clusters. The problems of determining a partition into a given number of clusters with minimum diameter (i.e., a partition of maximum homogeneity) or with maximum split (i.e., a partition of maximum separation) are first considered. It is shown that the latter problem can be solved by the classical single-link clustering algorithm, while the former can be solved by a graph-theoretic algorithm involving the optimal coloration of a sequence of partial graphs, described in more detail in a previous paper. A partition into a given number of clusters will be called efficient if and only if there exists no partition into at most the same number of clusters with smaller diameter and not smaller split or with larger split and not larger diameter. Two efficient partitions are called equivalent if and only if they have the same values for the split and for the diameter.

摘要

聚类分析关注的问题是将给定的实体集合划分为同质且分离良好的子集,称为聚类。当给定实体之间的相似度度量时,可以精确地定义同质性和分离性的概念。让我们将给定实体集合划分为聚类的分区的直径定义为同一聚类中任意两个实体之间的最大不相似度,而分区的分裂定义为不同聚类中实体之间的最小不相似度。首先考虑确定具有最小直径(即最大同质性的分区)或最大分裂(即最大分离的分区)的给定数量聚类的分区问题。结果表明,后一个问题可以通过经典的单链接聚类算法解决,而前一个问题可以通过涉及序列部分图的最优着色的图论算法解决,该算法在前面的一篇论文中进行了更详细的描述。如果不存在具有更小直径且不更小分裂或更大分裂且不更大直径的至多相同数量的聚类的分区,则将分区称为有效分区。如果两个有效分区的分裂和直径值相同,则称它们等效。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验