Li Hui-Jia, Daniels Jasmine J
School of Management Science and Engineering, Central University of Finance and Economics, Beijing 100080, China and Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China.
Department of Applied Physics, Stanford University, Stanford, California 94305, USA.
Phys Rev E Stat Nonlin Soft Matter Phys. 2015 Jan;91(1):012801. doi: 10.1103/PhysRevE.91.012801. Epub 2015 Jan 5.
Community structure analysis is a powerful tool for social networks that can simplify their topological and functional analysis considerably. However, since community detection methods have random factors and real social networks obtained from complex systems always contain error edges, evaluating the significance of a partitioned community structure is an urgent and important question. In this paper, integrating the specific characteristics of real society, we present a framework to analyze the significance of a social community. The dynamics of social interactions are modeled by identifying social leaders and corresponding hierarchical structures. Instead of a direct comparison with the average outcome of a random model, we compute the similarity of a given node with the leader by the number of common neighbors. To determine the membership vector, an efficient community detection algorithm is proposed based on the position of the nodes and their corresponding leaders. Then, using a log-likelihood score, the tightness of the community can be derived. Based on the distribution of community tightness, we establish a connection between p-value theory and network analysis, and then we obtain a significance measure of statistical form . Finally, the framework is applied to both benchmark networks and real social networks. Experimental results show that our work can be used in many fields, such as determining the optimal number of communities, analyzing the social significance of a given community, comparing the performance among various algorithms, etc.
社区结构分析是社交网络的一种强大工具,它可以极大地简化社交网络的拓扑和功能分析。然而,由于社区检测方法存在随机因素,并且从复杂系统中获得的真实社交网络总是包含错误边,因此评估划分后的社区结构的重要性是一个紧迫且重要的问题。在本文中,结合现实社会的具体特征,我们提出了一个分析社会社区重要性的框架。通过识别社会领导者和相应的层次结构来对社会互动的动态进行建模。我们不是直接与随机模型的平均结果进行比较,而是通过共同邻居的数量来计算给定节点与领导者的相似度。为了确定成员向量,基于节点及其相应领导者的位置提出了一种高效的社区检测算法。然后,使用对数似然分数,可以得出社区的紧密程度。基于社区紧密程度的分布,我们在p值理论和网络分析之间建立联系,进而获得一种统计形式的重要性度量。最后,将该框架应用于基准网络和真实社交网络。实验结果表明,我们的工作可用于许多领域,如确定社区的最优数量、分析给定社区的社会重要性、比较各种算法之间的性能等。