Peixoto Tiago P
Institut für Theoretische Physik, Universität Bremen, Germany.
Phys Rev E Stat Nonlin Soft Matter Phys. 2012 May;85(5 Pt 2):056122. doi: 10.1103/PhysRevE.85.056122. Epub 2012 May 30.
Stochastic blockmodels are generative network models where the vertices are separated into discrete groups, and the probability of an edge existing between two vertices is determined solely by their group membership. In this paper, we derive expressions for the entropy of stochastic blockmodel ensembles. We consider several ensemble variants, including the traditional model as well as the newly introduced degree-corrected version [Karrer et al., Phys. Rev. E 83, 016107 (2011)], which imposes a degree sequence on the vertices, in addition to the block structure. The imposed degree sequence is implemented both as "soft" constraints, where only the expected degrees are imposed, and as "hard" constraints, where they are required to be the same on all samples of the ensemble. We also consider generalizations to multigraphs and directed graphs. We illustrate one of many applications of this measure by directly deriving a log-likelihood function from the entropy expression, and using it to infer latent block structure in observed data. Due to the general nature of the ensembles considered, the method works well for ensembles with intrinsic degree correlations (i.e., with entropic origin) as well as extrinsic degree correlations, which go beyond the block structure.
随机块模型是一种生成式网络模型,其中顶点被划分为离散的组,两个顶点之间存在边的概率仅由它们所属的组决定。在本文中,我们推导了随机块模型系综熵的表达式。我们考虑了几种系综变体,包括传统模型以及新引入的度校正版本[卡勒等人,《物理评论E》83,016107(2011年)],除了块结构外,该版本还对顶点施加了度序列。施加的度序列既可以作为“软”约束来实现,即只施加期望度,也可以作为“硬”约束来实现,即要求在系综的所有样本上度都相同。我们还考虑了对多重图和有向图的推广。我们通过直接从熵表达式推导对数似然函数,并使用它来推断观测数据中的潜在块结构,说明了这种度量的众多应用之一。由于所考虑系综的一般性,该方法对于具有内在度相关性(即具有熵起源)以及超出块结构的外在度相关性的系综都适用。