Suppr超能文献

通过修正的聚类不稳定性估计聚类数。

Estimating the number of clusters via a corrected clustering instability.

作者信息

Haslbeck Jonas M B, Wulff Dirk U

机构信息

Psychological Methods Group, University of Amsterdam, Amsterdam, The Netherlands.

Center for Cognitive and Decision Science, University of Basel, Basel, Switzerland.

出版信息

Comput Stat. 2020;35(4):1879-1894. doi: 10.1007/s00180-020-00981-5. Epub 2020 May 18.

Abstract

We improve instability-based methods for the selection of the number of clusters in cluster analysis by developing a corrected clustering distance that corrects for the unwanted influence of the distribution of cluster sizes on cluster instability. We show that our corrected instability measure outperforms current instability-based measures across the whole sequence of possible , overcoming limitations of current insability-based methods for large . We also compare, for the first time, model-based and model-free approaches to determining cluster-instability and find their performance to be comparable. We make our method available in the R-package cstab.

摘要

我们通过开发一种校正聚类距离来改进聚类分析中基于不稳定性的聚类数量选择方法,该距离可校正聚类大小分布对聚类不稳定性的不良影响。我们表明,在整个可能的序列中,我们校正后的不稳定性度量优于当前基于不稳定性的度量,克服了当前基于不稳定性方法在大数据集时的局限性。我们还首次比较了基于模型和无模型的确定聚类不稳定性的方法,发现它们的性能相当。我们将我们的方法以R包cstab的形式提供。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/365a/7550318/25d42b86789f/180_2020_981_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验