Gat-Viks Irit, Shamir Ron
School of Computer Science, Tel-Aviv University, Tel-Aviv 69978, Israel.
Genome Res. 2007 Mar;17(3):358-67. doi: 10.1101/gr.5750507. Epub 2007 Jan 31.
The analysis of large-scale genome-wide experiments carries the promise of dramatically broadening our understanding on biological networks. The challenge of systematic integration of experimental results with established biological knowledge on a pathway is still unanswered. Here we present a methodology that attempts to answer this challenge when investigating signaling pathways. We formalize existing qualitative knowledge as a probabilistic model that depicts known interactions between molecules (genes, proteins, etc.) as a network and known regulatory relations as logics. We present algorithms that analyze experimental results (e.g., transcription profiles) vis-à-vis the model and propose improvements to the model based on the fit to the experimental data. These algorithms refine the relations between model components, as well as expand the model to include new components that are regulated by components of the original network. Using our methodology, we have modeled together the knowledge on four established signaling pathways related to osmotic shock response in Saccharomyces cerevisiae. Using over 100 published transcription profiles, our refinement methodology revealed three cross talks in the network. The expansion procedure identified with high confidence large groups of genes that are coregulated by transcription factors from the original network via a common logic. The results reveal a novel delicate repressive effect of the HOG pathway on many transcriptional target genes and suggest an unexpected alternative functional mode of the MAP kinase Hog1. These results demonstrate that, by integrated analysis of data and of well-defined knowledge, one can generate concrete biological hypotheses about signaling cascades and their downstream regulatory programs.
大规模全基因组实验分析有望极大地拓宽我们对生物网络的理解。将实验结果与已有的关于某一通路的生物学知识进行系统整合这一挑战仍未得到解决。在此,我们提出一种方法,试图在研究信号通路时应对这一挑战。我们将现有的定性知识形式化为一个概率模型,该模型将分子(基因、蛋白质等)之间已知的相互作用描绘为一个网络,将已知的调控关系描绘为逻辑关系。我们提出算法,相对于该模型分析实验结果(例如转录谱),并根据与实验数据的拟合情况对模型提出改进。这些算法优化模型组件之间的关系,以及扩展模型以纳入由原始网络组件调控的新组件。使用我们的方法,我们共同构建了与酿酒酵母渗透休克反应相关的四条已确立信号通路的知识模型。利用100多篇已发表的转录谱,我们的优化方法揭示了网络中的三个相互作用。扩展程序以高置信度识别出由原始网络中的转录因子通过共同逻辑共同调控的大量基因。结果揭示了高渗甘油(HOG)通路对许多转录靶基因的一种新的微妙抑制作用,并暗示了丝裂原活化蛋白激酶Hog1一种意想不到的替代功能模式。这些结果表明,通过对数据和明确知识的综合分析,可以生成关于信号级联及其下游调控程序的具体生物学假设。