McCollum James M, Peterson Gregory D, Cox Chris D, Simpson Michael L, Samatova Nagiza F
Computational Biology Institute, Oak Ridge National Laboratory, P.O. Box 2008 MS6164, Oak Ridge, TN 37831, USA.
Comput Biol Chem. 2006 Feb;30(1):39-49. doi: 10.1016/j.compbiolchem.2005.10.007.
A key to advancing the understanding of molecular biology in the post-genomic age is the development of accurate predictive models for genetic regulation, protein interaction, metabolism, and other biochemical processes. To facilitate model development, simulation algorithms must provide an accurate representation of the system, while performing the simulation in a reasonable amount of time. Gillespie's stochastic simulation algorithm (SSA) accurately depicts spatially homogeneous models with small populations of chemical species and properly represents noise, but it is often abandoned when modeling larger systems because of its computational complexity. In this work, we examine the performance of different versions of the SSA when applied to several biochemical models. Through our analysis, we discover that transient changes in reaction execution frequencies, which are typical of biochemical models with gene induction and repression, can dramatically affect simulator performance. To account for these shifts, we propose a new algorithm called the sorting direct method that maintains a loosely sorted order of the reactions as the simulation executes. Our measurements show that the sorting direct method performs favorably when compared to other well-known exact stochastic simulation algorithms.
在后基因组时代,推进对分子生物学理解的关键在于开发用于基因调控、蛋白质相互作用、新陈代谢及其他生化过程的精确预测模型。为便于模型开发,模拟算法必须在合理时间内执行模拟的同时,准确呈现系统。吉莱斯皮的随机模拟算法(SSA)能精确描绘化学物种数量较少的空间均匀模型,并恰当地体现噪声,但由于其计算复杂性,在对较大系统建模时常常被弃用。在这项工作中,我们研究了不同版本的SSA应用于多个生化模型时的性能。通过分析,我们发现反应执行频率的瞬态变化(这在具有基因诱导和抑制的生化模型中很典型)会显著影响模拟器性能。为考虑这些变化,我们提出一种名为排序直接法的新算法,该算法在模拟执行时保持反应的松散排序顺序。我们的测量结果表明,与其他知名的精确随机模拟算法相比,排序直接法表现良好。