Fujita A, Sato J R, Garay-Malpartida H M, Morettin P A, Sogayar M C, Ferreira C E
Institute of Mathematics and Statistics, University of São Paulo, Rua do Matão, 1010-São Paulo, 05508-090, SP, Brazil.
Bioinformatics. 2007 Jul 1;23(13):1623-30. doi: 10.1093/bioinformatics/btm151. Epub 2007 Apr 26.
A variety of biological cellular processes are achieved through a variety of extracellular regulators, signal transduction, protein-protein interactions and differential gene expression. Understanding of the mechanisms underlying these processes requires detailed molecular description of the protein and gene networks involved. To better understand these molecular networks, we propose a statistical method to estimate time-varying gene regulatory networks from time series microarray data. One well known problem when inferring connectivity in gene regulatory networks is the fact that the relationships found constitute correlations that do not allow inferring causation, for which, a priori biological knowledge is required. Moreover, it is also necessary to know the time period at which this causation occurs. Here, we present the Dynamic Vector Autoregressive model as a solution to these problems.
We have applied the Dynamic Vector Autoregressive model to estimate time-varying gene regulatory networks based on gene expression profiles obtained from microarray experiments. The network is determined entirely based on gene expression profiles data, without any prior biological knowledge. Through construction of three gene regulatory networks (of p53, NF-kappaB and c-myc) for HeLa cells, we were able to predict the connectivity, Granger-causality and dynamics of the information flow in these networks.
Additional figures may be found at http://mariwork.iq.usp.br/dvar/.
多种生物细胞过程是通过多种细胞外调节因子、信号转导、蛋白质 - 蛋白质相互作用和差异基因表达来实现的。要理解这些过程背后的机制,需要对所涉及的蛋白质和基因网络进行详细的分子描述。为了更好地理解这些分子网络,我们提出了一种统计方法,用于从时间序列微阵列数据中估计随时间变化的基因调控网络。在推断基因调控网络的连通性时,一个众所周知的问题是所发现的关系构成的是相关性,无法据此推断因果关系,为此需要先验生物学知识。此外,还需要知道这种因果关系发生的时间段。在此,我们提出动态向量自回归模型来解决这些问题。
我们已应用动态向量自回归模型,基于从微阵列实验获得的基因表达谱来估计随时间变化的基因调控网络。该网络完全基于基因表达谱数据确定,无需任何先验生物学知识。通过构建针对HeLa细胞的三个基因调控网络(p53、NF - κB和c - myc的网络),我们能够预测这些网络中的连通性、格兰杰因果关系和信息流动力学。