Lim James T, Chen Chen, Grant Adam D, Padi Megha
Department of Molecular and Cellular Biology, The University of Arizona, Tucson, AZ, United States.
Department of Epidemiology and Biostatistics, Mel and Enid Zuckerman College of Public Health, The University of Arizona, Tucson, AZ, United States.
Front Genet. 2021 Jan 14;11:603264. doi: 10.3389/fgene.2020.603264. eCollection 2020.
The use of biological networks such as protein-protein interaction and transcriptional regulatory networks is becoming an integral part of genomics research. However, these networks are not static, and during phenotypic transitions like disease onset, they can acquire new "communities" (or highly interacting groups) of genes that carry out cellular processes. Disease communities can be detected by maximizing a modularity-based score, but since biological systems and network inference algorithms are inherently noisy, it remains a challenge to determine whether these changes represent real cellular responses or whether they appeared by random chance. Here, we introduce Constrained Random Alteration of Network Edges (CRANE), a method for randomizing networks with fixed node strengths. CRANE can be used to generate a null distribution of gene regulatory networks that can in turn be used to rank the most significant changes in candidate disease communities. Compared to other approaches, such as consensus clustering or commonly used generative models, CRANE emulates biologically realistic networks and recovers simulated disease modules with higher accuracy. When applied to breast and ovarian cancer networks, CRANE improves the identification of cancer-relevant GO terms while reducing the signal from non-specific housekeeping processes.
诸如蛋白质-蛋白质相互作用和转录调控网络等生物网络的使用正成为基因组学研究不可或缺的一部分。然而,这些网络并非一成不变,在诸如疾病发作等表型转变过程中,它们能够获得执行细胞过程的新的基因“群落”(或高度相互作用的基因群组)。通过最大化基于模块度的评分可以检测到疾病群落,但由于生物系统和网络推理算法本质上存在噪声,确定这些变化是代表真实的细胞反应还是随机出现的,仍然是一项挑战。在此,我们介绍网络边的约束随机改变(CRANE),一种用于在固定节点强度下随机化网络的方法。CRANE可用于生成基因调控网络的零分布,进而用于对候选疾病群落中最显著的变化进行排名。与其他方法(如共识聚类或常用的生成模型)相比,CRANE模拟生物现实网络,并能以更高的准确性恢复模拟的疾病模块。当应用于乳腺癌和卵巢癌网络时,CRANE在减少来自非特异性看家过程信号的同时,提高了对癌症相关基因本体(GO)术语的识别。