Meister Arwen, Du Chao, Li Ye Henry, Wong Wing Hung
Computational Biology Lab, Bio-X Program, Stanford University, Stanford, CA 94305, USA.
Quant Biol. 2014 Mar;2(1):1-29. doi: 10.1007/s40484-014-0025-7.
The Master equation is considered the gold standard for modeling the stochastic mechanisms of gene regulation in molecular detail, but it is too complex to solve exactly in most cases, so approximation and simulation methods are essential. However, there is still a lack of consensus about the best way to carry these out. To help clarify the situation, we review Master equation models of gene regulation, theoretical approximations based on an expansion method due to N.G. van Kampen and R. Kubo, and simulation algorithms due to D.T. Gillespie and P. Langevin. Expansion of the Master equation shows that for systems with a single stable steady-state, the stochastic model reduces to a deterministic model in a first-order approximation. Additional theory, also due to van Kampen, describes the asymptotic behavior of multistable systems. To support and illustrate the theory and provide further insight into the complex behavior of multistable systems, we perform a detailed simulation study comparing the various approximation and simulation methods applied to synthetic gene regulatory systems with various qualitative characteristics. The simulation studies show that for large stochastic systems with a single steady-state, deterministic models are quite accurate, since the probability distribution of the solution has a single peak tracking the deterministic trajectory whose variance is inversely proportional to the system size. In multistable stochastic systems, large fluctuations can cause individual trajectories to escape from the domain of attraction of one steady-state and be attracted to another, so the system eventually reaches a multimodal probability distribution in which all stable steady-states are represented proportional to their relative stability. However, since the escape time scales exponentially with system size, this process can take a very long time in large systems.
主方程被认为是在分子层面详细建模基因调控随机机制的黄金标准,但在大多数情况下它过于复杂而无法精确求解,因此近似和模拟方法至关重要。然而,对于实施这些方法的最佳方式仍缺乏共识。为了帮助厘清情况,我们回顾了基因调控的主方程模型、基于N.G. 范坎彭和R. 久保的展开方法的理论近似,以及D.T. 吉莱斯皮和P. 朗之万的模拟算法。主方程的展开表明,对于具有单一稳定稳态的系统,随机模型在一阶近似下简化为确定性模型。同样由范坎彭提出的其他理论描述了多稳态系统的渐近行为。为了支持和说明该理论,并进一步深入了解多稳态系统的复杂行为,我们进行了一项详细的模拟研究,比较了应用于具有各种定性特征的合成基因调控系统的各种近似和模拟方法。模拟研究表明,对于具有单一稳态的大型随机系统,确定性模型相当准确,因为解的概率分布有一个单峰跟踪确定性轨迹,其方差与系统大小成反比。在多稳态随机系统中,大的涨落会导致个体轨迹从一个稳态的吸引域逃逸并被另一个稳态吸引,因此系统最终会达到一个多峰概率分布,其中所有稳定稳态都按其相对稳定性成比例地表示。然而,由于逃逸时间与系统大小呈指数关系,在大型系统中这个过程可能需要很长时间。