Milligan G W, Cooper M C
Multivariate Behav Res. 1986 Oct 1;21(4):441-58. doi: 10.1207/s15327906mbr2104_5.
Five external criteria were used to evaluate the extent of recovery of the true structure in a hierarchical clustering solution. This was accomplished by comparing the partitions produced by the clustering algorithm with the partition that indicates the true cluster structure known to exist in the data. The five criteria examined were the Rand, the Morey and Agresti adjusted Rand, the Hubert and Arabie adjusted Rand, the Jaccard, and the Fowlkes and Mallows measures. The results of the study indicated that the Hubert and Arabie adjusted Rank index was best suited to the task of comparison across hierarchy levels. Deficiencies with the other measures are noted.
使用五个外部标准来评估层次聚类解决方案中真实结构的恢复程度。这是通过将聚类算法产生的划分与表示数据中已知存在的真实聚类结构的划分进行比较来实现的。所检验的五个标准是兰德指数、莫雷和阿格雷斯蒂调整兰德指数、休伯特和阿拉比调整兰德指数、杰卡德指数以及福克尔斯和马洛斯度量。研究结果表明,休伯特和阿拉比调整兰德指数最适合跨层次级别进行比较的任务。文中还指出了其他度量的不足之处。