Center for Biophysics and Computational Biology, Institute for Genomic Biology, Department of Chemical and Biomolecular Engineering, University of Illinois, Urbana, IL 61801, USA.
Proc Natl Acad Sci U S A. 2010 Oct 12;107(41):17845-50. doi: 10.1073/pnas.1005139107. Epub 2010 Sep 27.
Prediction of metabolic changes that result from genetic or environmental perturbations has several important applications, including diagnosing metabolic disorders and discovering novel drug targets. A cardinal challenge in obtaining accurate predictions is the integration of transcriptional regulatory networks with the corresponding metabolic network. We propose a method called probabilistic regulation of metabolism (PROM) that achieves this synthesis and enables straightforward, automated, and quantitative integration of high-throughput data into constraint-based modeling, making it an ideal tool for constructing genome-scale regulatory-metabolic network models for less-studied organisms. PROM introduces probabilities to represent gene states and gene-transcription factor interactions. By using PROM, we constructed an integrated regulatory-metabolic network for the model organism, Escherichia coli, and demonstrated that our method based on automated inference is more accurate and comprehensive than the current state of the art, which is based on manual curation of literature. After validating the approach, we used PROM to build a genome-scale integrated metabolic-regulatory model for Mycobacterium tuberculosis, a critically important human pathogen. This study incorporated data from more than 1,300 microarrays, 2,000 transcription factor-target interactions regulating 3,300 metabolic reactions, and 1,905 KO phenotypes for E. coli and M. tuberculosis. PROM identified KO phenotypes with accuracies as high as 95%, and predicted growth rates quantitatively with correlation of 0.95. Importantly, PROM represents the successful integration of a top-down reconstructed, statistically inferred regulatory network with a bottom-up reconstructed, biochemically detailed metabolic network, bridging two important classes of systems biology models that are rarely combined quantitatively.
预测遗传或环境扰动导致的代谢变化有几个重要的应用,包括诊断代谢紊乱和发现新的药物靶点。获得准确预测的一个主要挑战是将转录调控网络与相应的代谢网络整合起来。我们提出了一种称为代谢概率调控(PROM)的方法,实现了这种综合,并能够将高通量数据直接、自动地纳入基于约束的建模中,使其成为构建针对研究较少的生物体的基因组规模调控代谢网络模型的理想工具。PROM 引入概率来表示基因状态和基因-转录因子相互作用。通过使用 PROM,我们为模型生物大肠杆菌构建了一个集成的调控代谢网络,并证明了我们基于自动推理的方法比基于文献手动注释的当前最先进方法更准确和全面。在验证了该方法后,我们使用 PROM 为重要的人类病原体结核分枝杆菌构建了一个基因组规模的代谢调控整合模型。该研究整合了来自 1300 多个微阵列、调控 3300 个代谢反应的 2000 个转录因子-靶标相互作用以及大肠杆菌和结核分枝杆菌的 1905 个 KO 表型的数据。PROM 鉴定 KO 表型的准确率高达 95%,并对生长速率进行了定量预测,相关系数为 0.95。重要的是,PROM 代表了自上而下重建的、统计推断的调控网络与自下而上重建的、生化详细的代谢网络的成功整合,这两种网络模型很少被定量结合。