Ma Shisong, Gong Qingqiu, Bohnert Hans J
Physiological and Molecular Plant Biology Program, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA.
Genome Res. 2007 Nov;17(11):1614-25. doi: 10.1101/gr.6911207. Epub 2007 Oct 5.
We describe a gene network for the Arabidopsis thaliana transcriptome based on a modified graphical Gaussian model (GGM). Through partial correlation (pcor), GGM infers coregulation patterns between gene pairs conditional on the behavior of other genes. Regularized GGM calculated pcor between gene pairs among approximately 2000 input genes at a time. Regularized GGM coupled with iterative random samplings of genes was expanded into a network that covered the Arabidopsis genome (22,266 genes). This resulted in a network of 18,625 interactions (edges) among 6760 genes (nodes) with high confidence and connections representing approximately 0.01% of all possible edges. When queried for selected genes, locally coherent subnetworks mainly related to metabolic functions, and stress responses emerged. Examples of networks for biochemical pathways, cell wall metabolism, and cold responses are presented. GGM displayed known coregulation pathways as subnetworks and added novel components to known edges. Finally, the network reconciled individual subnetworks in a topology joined at the whole-genome level and provided a general framework that can instruct future studies on plant metabolism and stress responses. The network model is included.
我们基于改进的图形高斯模型(GGM)描述了拟南芥转录组的基因网络。通过偏相关(pcor),GGM在其他基因行为的条件下推断基因对之间的共调控模式。正则化GGM一次计算约2000个输入基因中基因对之间的pcor。将正则化GGM与基因的迭代随机抽样相结合,扩展成一个覆盖拟南芥基因组(22266个基因)的网络。这产生了一个由6760个基因(节点)之间的18625个相互作用(边)组成的网络,这些相互作用具有高可信度,连接数约占所有可能边的0.01%。当查询选定基因时,出现了主要与代谢功能和应激反应相关的局部连贯子网。展示了生化途径、细胞壁代谢和冷反应的网络示例。GGM将已知的共调控途径显示为子网,并为已知边添加了新的组成部分。最后,该网络在全基因组水平连接的拓扑结构中协调了各个子网,并提供了一个可指导未来植物代谢和应激反应研究的通用框架。网络模型也包含在内。