Stumpf Michael P H, Wiuf Carsten
Centre for Bioinformatics, Division of Molecular Biosciences, Imperial College London, Wolfson Building, London SW7 2AZ, United Kingdom.
Phys Rev E Stat Nonlin Soft Matter Phys. 2005 Sep;72(3 Pt 2):036118. doi: 10.1103/PhysRevE.72.036118. Epub 2005 Sep 19.
We discuss two sampling schemes for selecting random subnets from a network, random sampling and connectivity dependent sampling, and investigate how the degree distribution of a node in the network is affected by the two types of sampling. Here we derive a necessary and sufficient condition that guarantees that the degree distributions of the subnet and the true network belong to the same family of probability distributions. For completely random sampling of nodes we find that this condition is satisfied by classical random graphs; for the vast majority of networks this condition will, however, not be met. We furthermore discuss the case where the probability of sampling a node depends on the degree of a node and we find that even classical random graphs are no longer closed under this sampling regime. We conclude by relating the results to real Eschericia coli protein interaction network data.
我们讨论了从网络中选择随机子网的两种抽样方案,即随机抽样和依赖连通性的抽样,并研究了这两种抽样类型如何影响网络中节点的度分布。在此,我们推导出一个充要条件,该条件保证子网和真实网络的度分布属于同一概率分布族。对于节点的完全随机抽样,我们发现经典随机图满足此条件;然而,对于绝大多数网络,此条件将不满足。我们还讨论了节点抽样概率取决于节点度的情况,并且发现即使是经典随机图在这种抽样方式下也不再封闭。最后,我们将结果与真实的大肠杆菌蛋白质相互作用网络数据相关联。