Sadegh Sepideh, Nazarieh Maryam, Spaniol Christian, Helms Volkhard
.
J Integr Bioinform. 2017 Jul 4;14(2):20170017. doi: 10.1515/jib-2017-0017.
Gene-regulatory networks are an abstract way of capturing the regulatory connectivity between transcription factors, microRNAs, and target genes in biological cells. Here, we address the problem of identifying enriched co-regulatory three-node motifs that are found significantly more often in real network than in randomized networks. First, we compare two randomization strategies, that either only conserve the degree distribution of the nodes' in- and out-links, or that also conserve the degree distributions of different regulatory edge types. Then, we address the issue how convergence of randomization can be measured. We show that after at most 10 × |E| edge swappings, converged motif counts are obtained and the memory of initial edge identities is lost.
基因调控网络是一种抽象的方式,用于捕捉生物细胞中转录因子、微小RNA和靶基因之间的调控连接性。在这里,我们解决了识别富集的共调控三节点基序的问题,这些基序在真实网络中出现的频率明显高于随机网络。首先,我们比较了两种随机化策略,一种是仅保留节点入边和出边的度分布,另一种是还保留不同调控边类型的度分布。然后,我们解决了如何衡量随机化收敛的问题。我们表明,在最多进行10×|E|次边交换后,可获得收敛的基序计数,并且初始边身份的记忆会丢失。