Huang Shao-shan Carol, Fraenkel Ernest
Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA.
Methods Cell Biol. 2012;110:57-80. doi: 10.1016/B978-0-12-388403-9.00003-5.
Signaling and transcription are tightly integrated processes that underlie many cellular responses to the environment. A network of signaling events, often mediated by post-translational modification on proteins, can lead to long-term changes in cellular behavior by altering the activity of specific transcriptional regulators and consequently the expression level of their downstream targets. As many high-throughput, "-omics" methods are now available that can simultaneously measure changes in hundreds of proteins and thousands of transcripts, it should be possible to systematically reconstruct cellular responses to perturbations in order to discover previously unrecognized signaling pathways. This chapter describes a computational method for discovering such pathways that aims to compensate for the varying levels of noise present in these diverse data sources. Based on the concept of constraint optimization on networks, the method seeks to achieve two conflicting aims: (1) to link together many of the signaling proteins and differentially expressed transcripts identified in the experiments "constraints" using previously reported protein-protein and protein-DNA interactions, while (2) keeping the resulting network small and ensuring it is composed of the highest confidence interactions "optimization". A further distinctive feature of this approach is the use of transcriptional data as evidence of upstream signaling events that drive changes in gene expression, rather than as proxies for downstream changes in the levels of the encoded proteins. We recently demonstrated that by applying this method to phosphoproteomic and transcriptional data from the pheromone response in yeast, we were able to recover functionally coherent pathways and to reveal many components of the cellular response that are not readily apparent in the original data. Here, we provide a more detailed description of the method, explore the robustness of the solution to the noise level of input data and discuss the effect of parameter values.
信号传导和转录是紧密整合的过程,是许多细胞对环境反应的基础。一个通常由蛋白质翻译后修饰介导的信号事件网络,可以通过改变特定转录调节因子的活性,进而改变其下游靶标的表达水平,导致细胞行为的长期变化。由于现在有许多高通量的“组学”方法可以同时测量数百种蛋白质和数千种转录本的变化,因此应该有可能系统地重建细胞对扰动的反应,以发现以前未被认识的信号通路。本章描述了一种发现此类通路的计算方法,旨在补偿这些不同数据源中存在的不同水平的噪声。基于网络约束优化的概念,该方法试图实现两个相互冲突的目标:(1)利用先前报道的蛋白质-蛋白质和蛋白质-DNA相互作用,将实验中鉴定的许多信号蛋白和差异表达的转录本“约束”联系在一起,同时(2)保持所得网络较小,并确保其由最高置信度的相互作用“优化”组成。这种方法的另一个显著特点是使用转录数据作为驱动基因表达变化的上游信号事件的证据,而不是作为编码蛋白质水平下游变化的代理。我们最近证明,通过将这种方法应用于酵母中信息素反应的磷酸蛋白质组学和转录数据,我们能够恢复功能上连贯的通路,并揭示原始数据中不易察觉的细胞反应的许多组成部分。在这里,我们提供了该方法更详细的描述,探讨了解决方案对输入数据噪声水平的稳健性,并讨论了参数值的影响。