University of Texas Southwestern Medical Center, Dallas, Texas, United States of America.
PLoS One. 2012;7(1):e29348. doi: 10.1371/journal.pone.0029348. Epub 2012 Jan 17.
The gene regulatory network (GRN) reveals the regulatory relationships among genes and can provide a systematic understanding of molecular mechanisms underlying biological processes. The importance of computer simulations in understanding cellular processes is now widely accepted; a variety of algorithms have been developed to study these biological networks. The goal of this study is to provide a comprehensive evaluation and a practical guide to aid in choosing statistical methods for constructing large scale GRNs. Using both simulation studies and a real application in E. coli data, we compare different methods in terms of sensitivity and specificity in identifying the true connections and the hub genes, the ease of use, and computational speed. Our results show that these algorithms performed reasonably well, and each method has its own advantages: (1) GeneNet, WGCNA (Weighted Correlation Network Analysis), and ARACNE (Algorithm for the Reconstruction of Accurate Cellular Networks) performed well in constructing the global network structure; (2) GeneNet and SPACE (Sparse PArtial Correlation Estimation) performed well in identifying a few connections with high specificity.
基因调控网络(GRN)揭示了基因之间的调控关系,可以提供对生物过程分子机制的系统理解。计算机模拟在理解细胞过程中的重要性现在已被广泛接受;已经开发了多种算法来研究这些生物网络。本研究的目的是提供全面的评估和实用指南,以帮助选择构建大规模 GRN 的统计方法。我们使用模拟研究和大肠杆菌数据的实际应用,根据识别真实连接和枢纽基因的灵敏度和特异性、易用性和计算速度,比较了不同方法。我们的结果表明,这些算法表现相当不错,每种方法都有其自身的优势:(1)GeneNet、WGCNA(加权相关网络分析)和 ARACNE(准确细胞网络重建算法)在构建全局网络结构方面表现良好;(2)GeneNet 和 SPACE(稀疏部分相关估计)在识别具有高特异性的少数连接方面表现良好。