Brusco Michael J, Cradit J Dennis
Department of Marketing, College of Business, Florida State University, Tallahassee 32306-1110, USA.
Br J Math Stat Psychol. 2005 Nov;58(Pt 2):319-32. doi: 10.1348/000711005X63890.
Partitioning indices associated with the within-cluster sums of pairwise dissimilarities often exhibit a systematic bias towards clusters of a particular size, whereas minimization of the partition diameter (i.e. the maximum dissimilarity element across all pairs of objects within the same cluster) does not typically have this problem. However, when the partition-diameter criterion is used, there is often a myriad of alternative optimal solutions that can vary significantly with respect to their substantive interpretation. We propose a bicriterion partitioning approach that considers both diameter and within-cluster sums in the optimization problem and facilitates selection from among the alternative optima. We developed several MATLAB-based exchange algorithms that rapidly provide excellent solutions to bicriterion partitioning problems. These algorithms were evaluated using synthetic data sets, as well as an empirical dissimilarity matrix.
与簇内成对差异之和相关的划分指标通常会对特定大小的簇表现出系统性偏差,而划分直径(即同一簇内所有对象对之间的最大差异元素)的最小化通常不存在这个问题。然而,当使用划分直径准则时,通常会有大量的替代最优解,这些解在实质解释方面可能会有很大差异。我们提出了一种双准则划分方法,该方法在优化问题中同时考虑直径和簇内和,并便于从替代最优解中进行选择。我们开发了几种基于MATLAB的交换算法,这些算法能快速为双准则划分问题提供出色的解决方案。使用合成数据集以及经验差异矩阵对这些算法进行了评估。