在超大型网络中寻找社区结构。

Finding community structure in very large networks.

作者信息

Clauset Aaron, Newman M E J, Moore Cristopher

机构信息

Department of Computer Science, University of New Mexico, Albuquerque, NM 87131, USA.

出版信息

Phys Rev E Stat Nonlin Soft Matter Phys. 2004 Dec;70(6 Pt 2):066111. doi: 10.1103/PhysRevE.70.066111. Epub 2004 Dec 6.

DOI:10.1103/PhysRevE.70.066111

PMID:15697438

Abstract

The discovery and analysis of community structure in networks is a topic of considerable recent interest within the physics community, but most methods proposed so far are unsuitable for very large networks because of their computational cost. Here we present a hierarchical agglomeration algorithm for detecting community structure which is faster than many competing algorithms: its running time on a network with n vertices and m edges is O (md log n) where d is the depth of the dendrogram describing the community structure. Many real-world networks are sparse and hierarchical, with m approximately n and d approximately log n, in which case our algorithm runs in essentially linear time, O (n log(2) n). As an example of the application of this algorithm we use it to analyze a network of items for sale on the web site of a large on-line retailer, items in the network being linked if they are frequently purchased by the same buyer. The network has more than 400 000 vertices and 2 x 10(6) edges. We show that our algorithm can extract meaningful communities from this network, revealing large-scale patterns present in the purchasing habits of customers.

摘要

网络中社区结构的发现与分析是近期物理学界相当感兴趣的一个话题，但迄今为止提出的大多数方法由于计算成本过高而不适用于非常大的网络。在此，我们提出一种用于检测社区结构的层次凝聚算法，它比许多竞争算法都要快：在一个具有(n)个顶点和(m)条边的网络上，其运行时间为(O(md\log n))，其中(d)是描述社区结构的树状图的深度。许多现实世界的网络是稀疏且分层的，(m)近似于(n)且(d)近似于(\log n)，在这种情况下我们的算法基本上以线性时间(O(n\log^2 n))运行。作为该算法应用的一个例子，我们用它来分析一家大型在线零售商网站上待售商品的网络，如果商品经常被同一买家购买，那么网络中的这些商品就会被链接起来。该网络有超过40万个顶点和(2\times10^6)条边。我们表明我们的算法能够从这个网络中提取有意义的社区，揭示出客户购买习惯中存在的大规模模式。

相似文献

Finding community structure in very large networks.

Phys Rev E Stat Nonlin Soft Matter Phys. 2004 Dec;70(6 Pt 2):066111. doi: 10.1103/PhysRevE.70.066111. Epub 2004 Dec 6.

Finding local community structure in networks.

Phys Rev E Stat Nonlin Soft Matter Phys. 2005 Aug;72(2 Pt 2):026132. doi: 10.1103/PhysRevE.72.026132. Epub 2005 Aug 29.

Unfolding communities in large complex networks: combining defensive and offensive label propagation for core extraction.

Phys Rev E Stat Nonlin Soft Matter Phys. 2011 Mar;83(3 Pt 2):036103. doi: 10.1103/PhysRevE.83.036103. Epub 2011 Mar 8.

Towards real-time community detection in large networks.

Phys Rev E Stat Nonlin Soft Matter Phys. 2009 Jun;79(6 Pt 2):066107. doi: 10.1103/PhysRevE.79.066107. Epub 2009 Jun 16.

Adaptive clustering algorithm for community detection in complex networks.

Phys Rev E Stat Nonlin Soft Matter Phys. 2008 Oct;78(4 Pt 2):046115. doi: 10.1103/PhysRevE.78.046115. Epub 2008 Oct 30.

Fast algorithm for detecting community structure in networks.

Phys Rev E Stat Nonlin Soft Matter Phys. 2004 Jun;69(6 Pt 2):066133. doi: 10.1103/PhysRevE.69.066133. Epub 2004 Jun 18.

Sequential algorithm for fast clique percolation.

Phys Rev E Stat Nonlin Soft Matter Phys. 2008 Aug;78(2 Pt 2):026109. doi: 10.1103/PhysRevE.78.026109. Epub 2008 Aug 15.

Tolerating the community detection resolution limit with edge weighting.

Phys Rev E Stat Nonlin Soft Matter Phys. 2011 May;83(5 Pt 2):056119. doi: 10.1103/PhysRevE.83.056119. Epub 2011 May 25.

A DC programming approach for finding communities in networks.

Neural Comput. 2014 Dec;26(12):2827-54. doi: 10.1162/NECO_a_00673. Epub 2014 Sep 23.

Multiscale community geometry in a network and its application.

Phys Rev E Stat Nonlin Soft Matter Phys. 2012 Oct;86(4 Pt 1):041120. doi: 10.1103/PhysRevE.86.041120. Epub 2012 Oct 12.

引用本文的文献

Mapping Vaccine Sentiment by Analyzing Spanish-Language Social Media Posts and Survey-Based Public Opinion: Dual Methods Study.

JMIR Infodemiology. 2025 Aug 29;5:e63223. doi: 10.2196/63223.

Resource use of dusky and Galapagos sharks in response to fishing activities at a remote Pacific Island.

Sci Rep. 2025 Aug 28;15(1):31743. doi: 10.1038/s41598-025-15677-3.

Reversing Preeclampsia Pathology: AXL Inhibition Restores Mitochondrial Function and ECM Balance.

Cells. 2025 Aug 8;14(16):1229. doi: 10.3390/cells14161229.

Overlapping community detection based on bridging structural features and fuzzy C-means.

PLoS One. 2025 Aug 26;20(8):e0328825. doi: 10.1371/journal.pone.0328825. eCollection 2025.

Comorbidities associated with fetal alcohol spectrum disorders in the United States.

Sci Rep. 2025 Aug 13;15(1):29704. doi: 10.1038/s41598-025-13366-9.

Microbiome composition and co-occurrence dynamics in wild are influenced by host crop, fly sex, and sampling location.

Microbiol Spectr. 2025 Aug 12:e0260824. doi: 10.1128/spectrum.02608-24.

Disease clusters and their genetic determinants following a diagnosis of depression: analyses based on a novel three-dimensional disease network approach.

Mol Psychiatry. 2025 Jul 18. doi: 10.1038/s41380-025-03120-y.

Monthly Alternations of Core Plant Species in Dynamic Plant-Pollinator Networks of an Urban Botanical Garden.

Ecol Evol. 2025 Jul 17;15(7):e71822. doi: 10.1002/ece3.71822. eCollection 2025 Jul.

GCN-based unsupervised community detection with refined structure centers and expanded pseudo-labeled set.

PLoS One. 2025 Jul 1;20(7):e0327022. doi: 10.1371/journal.pone.0327022. eCollection 2025.

[1]The human gut microbiota in IBD, characterizing hubs, the core microbiota and terminal nodes: a network-based approach.

BMC Microbiol. 2025 Jun 26;25(1):371. doi: 10.1186/s12866-025-04106-0.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

在超大型网络中寻找社区结构。

Finding community structure in very large networks.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献