Lancichinetti Andrea, Radicchi Filippo, Ramasco José J
Complex Networks Lagrange Laboratory, Turin, Italy and Physics Department, Politecnico di Torino, ISI Foundation, Turin, Italy.
Phys Rev E Stat Nonlin Soft Matter Phys. 2010 Apr;81(4 Pt 2):046110. doi: 10.1103/PhysRevE.81.046110. Epub 2010 Apr 20.
Nodes in real-world networks are usually organized in local modules. These groups, called communities, are intuitively defined as subgraphs with a larger density of internal connections than of external links. In this work, we define a measure aimed at quantifying the statistical significance of single communities. Extreme and order statistics are used to predict the statistics associated with individual clusters in random graphs. These distributions allows us to define one community significance as the probability that a generic clustering algorithm finds such a group in a random graph. The method is successfully applied in the case of real-world networks for the evaluation of the significance of their communities.
现实世界网络中的节点通常按局部模块组织。这些组被称为社区,直观地定义为内部连接密度大于外部链接密度的子图。在这项工作中,我们定义了一种旨在量化单个社区统计显著性的度量。极值统计和顺序统计用于预测随机图中与单个聚类相关的统计量。这些分布使我们能够将一个社区显著性定义为通用聚类算法在随机图中找到这样一个组的概率。该方法成功应用于现实世界网络的案例,用于评估其社区的显著性。