Department of Bioscience and Bioinformatics, Kyushu Institute of Technology, 680-4 Kawazu, Iizuka, Fukuoka, 820-8502, Japan.
J Biosci Bioeng. 2010 Aug;110(2):254-61. doi: 10.1016/j.jbiosc.2010.01.015. Epub 2010 Feb 6.
Elementary mode (EM) analysis is potentially effective in integrating transcriptome or proteome data into metabolic network analyses and in exploring the mechanism of how phenotypic or metabolic flux distribution is changed with respect to environmental and genetic perturbations. The EM coefficients (EMCs) indicate the quantitative contribution of their associated EMs and can be estimated by maximizing Shannon's entropy as a general objective function in our previous study, but the use of EMCs is still restricted to a relatively small-scale networks. We propose a fast and universal method that optimizes hundreds of thousands of EMCs under the constraint of the Maximum entropy principle (MEP). Lagrange multipliers (LMs) are applied to maximize the Shannon's entropy-based objective function, analytically solving each EMC as the function of LMs. Consequently, the number of such search variables, the EMC number, is dramatically reduced to the reaction number. To demonstrate the feasibility of the MEP with Lagrange multipliers (MEPLM), it is coupled with enzyme control flux (ECF) to predict the flux distributions of Escherichia coli and Saccharomycescerevisiae for different conditions (gene deletion, adaptive evolution, temperature, and dilution rate) and to provide a quantitative understanding of how metabolic or physiological states are changed in response to these genetic or environmental perturbations at the elementary mode level. It is shown that the ECF-based method is a feasible framework for the prediction of metabolic flux distribution by integrating enzyme activity data into EMs to genetic and environmental perturbations.
基本模式 (EM) 分析在将转录组或蛋白质组数据整合到代谢网络分析中以及探索表型或代谢通量分布如何随环境和遗传扰动而变化的机制方面具有潜在的有效性。EM 系数 (EMC) 表示与其相关的 EMs 的定量贡献,可以通过在我们之前的研究中最大化香农熵作为一般目标函数来估计,但 EMC 的使用仍然限于相对较小规模的网络。我们提出了一种快速且通用的方法,该方法可以在最大熵原理 (MEP) 的约束下优化数十万个 EMC。拉格朗日乘数 (LM) 用于最大化基于香农熵的目标函数,通过解析求解每个 EMC 作为 LM 的函数。因此,这样的搜索变量的数量,即 EMC 数量,大大减少到反应数量。为了证明具有拉格朗日乘数的 MEP(MEPLM)的可行性,它与酶控制通量(ECF)耦合,以预测不同条件(基因缺失、适应性进化、温度和稀释率)下大肠杆菌和酿酒酵母的通量分布,并在基本模式水平上提供对代谢或生理状态如何响应这些遗传或环境扰动的定量理解。结果表明,通过将酶活性数据整合到 EMs 中以应对遗传和环境扰动,基于 ECF 的方法是预测代谢通量分布的可行框架。