Suppr
超能文献

层次聚类算法的混合模型检验：对每个人进行分类的问题。

Mixture Model Tests Of Hierarchical Clustering Algorithms: The Problem Of Classifying Everybody.

出版信息

Multivariate Behav Res. 1979 Jul 1;14(3):367-84. doi: 10.1207/s15327906mbr1403_6.

Abstract

Due to the effects of outliers, mixture model tests that require all objects to be classified can severely underestimate the accuracy of hierarchical clustering algorithms. More valid and relevant comparisons between algorithms can be made by calculating accuracy at several levels in the hierarchical tree and considering accuracy as a function of the coverage of the classification. Using this procedure, several algorithms were compared on their ability to resolve ten multivariate normal mixtures. All of the algorithms were significantly more accurate than a random linkage algorithm, and accuracy was inversely related to coverage. Algorithms using correlation as the similarity measure were significantly more accurate than those using Euclidean distance (p < .001). A subset of high accuracy algorithms, including single, average, and centroid linkage using correlation, and Ward's minimum variance technique, was identified.

摘要

由于异常值的影响，要求所有对象都被分类的混合模型测试可能会严重低估层次聚类算法的准确性。通过在层次树的几个级别计算准确性，并将准确性视为分类的覆盖范围的函数，可以对算法进行更有效和更相关的比较。使用此过程，在其解析十个多元正态混合的能力上对几种算法进行了比较。所有算法的准确性都明显高于随机链接算法，并且准确性与覆盖范围成反比。使用相关性作为相似性度量的算法比使用欧几里得距离的算法（p<.001）更为准确。确定了一组高精度算法，包括使用相关性的单链接、平均链接和质心链接，以及 Ward 的最小方差技术。

相似文献

Mixture Model Tests Of Hierarchical Clustering Algorithms: The Problem Of Classifying Everybody.

Multivariate Behav Res. 1979 Jul 1;14(3):367-84. doi: 10.1207/s15327906mbr1403_6.

Hierarchical Cluster Analysis Using Intraclass Correlations: A Mixture Model Study.

Multivariate Behav Res. 1980 Jul 1;15(3):299-318. doi: 10.1207/s15327906mbr1503_5.

Monte Carlo Tests of the Accuracy of Cluster Analysis Algorithms: A Comparison of Hierarchical and Nonhierarchical Methods.

Multivariate Behav Res. 1985 Jul 1;20(3):283-304. doi: 10.1207/s15327906mbr2003_4.

Generalising Ward's Method for Use with Manhattan Distances.

PLoS One. 2017 Jan 13;12(1):e0168288. doi: 10.1371/journal.pone.0168288. eCollection 2017.

On the quality of tree-based protein classification.

Bioinformatics. 2005 May 1;21(9):1876-90. doi: 10.1093/bioinformatics/bti244. Epub 2005 Jan 12.

Clustering Molecular Dynamics Trajectories: 1. Characterizing the Performance of Different Clustering Algorithms.

J Chem Theory Comput. 2007 Nov;3(6):2312-34. doi: 10.1021/ct700119m.

Aerosol time-of-flight mass spectrometry data analysis: a benchmark of clustering algorithms.

Anal Chim Acta. 2007 Feb 28;585(1):38-54. doi: 10.1016/j.aca.2006.12.009. Epub 2006 Dec 10.

Effectiveness of environmental cluster analysis in representing regional species diversity.

Conserv Biol. 2006 Aug;20(4):1087-98. doi: 10.1111/j.1523-1739.2006.00500.x.

Evaluation of clustering algorithms for gene expression data using gene ontology annotations.

Chin Med J (Engl). 2012 Sep;125(17):3048-52.

Six clustering algorithms applied to the WAIS-R: the problem of dissimilar cluster results.

J Clin Psychol. 1989 Nov;45(6):932-5. doi: 10.1002/1097-4679(198911)45:6<932::aid-jclp2270450617>3.0.co;2-t.

引用本文的文献

Metabolic signatures of Arabidopsis thaliana abiotic stress responses elucidate patterns in stress priming, acclimation, and recovery.

Stress Biol. 2022 Feb 15;2(1):11. doi: 10.1007/s44154-022-00034-5.

Shape complexity in cluster analysis.

PLoS One. 2023 May 26;18(5):e0286312. doi: 10.1371/journal.pone.0286312. eCollection 2023.

Examining the effect of initialization strategies on the performance of Gaussian mixture modeling.

Behav Res Methods. 2017 Feb;49(1):282-293. doi: 10.3758/s13428-015-0697-6.

A mixture model approach for the analysis of small exploratory microarray experiments.

Comput Stat Data Anal. 2009 Mar 15;53(5):1566-1576. doi: 10.1016/j.csda.2008.06.011.

Patterns of dysmorphic features in schizophrenia.

Am J Med Genet. 2001 Dec 8;105(8):713-23. doi: 10.1002/ajmg.1612.

A typology of child behavior profile patterns: distribution and correlates for disturbed children aged 6--16.

J Abnorm Child Psychol. 1980 Dec;8(4):441-70. doi: 10.1007/BF00916500.

A cluster-analytically derived typology: feasible alternative to clinical diagnostic classification of children?

J Abnorm Child Psychol. 1982 Dec;10(4):451-82. doi: 10.1007/BF00920748.

Cluster analytic identification of autistic preschoolers.

J Autism Dev Disord. 1988 Dec;18(4):475-92. doi: 10.1007/BF02211868.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

层次聚类算法的混合模型检验：对每个人进行分类的问题。

Mixture Model Tests Of Hierarchical Clustering Algorithms: The Problem Of Classifying Everybody.

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译