Department of Computer Science, University of California Santa Barbara, Santa Barbara, California 93106, USA.
J Chem Phys. 2011 Jan 28;134(4):044110. doi: 10.1063/1.3522769.
In biochemical systems, the occurrence of a rare event can be accompanied by catastrophic consequences. Precise characterization of these events using Monte Carlo simulation methods is often intractable, as the number of realizations needed to witness even a single rare event can be very large. The weighted stochastic simulation algorithm (wSSA) [J. Chem. Phys. 129, 165101 (2008)] and its subsequent extension [J. Chem. Phys. 130, 174103 (2009)] alleviate this difficulty with importance sampling, which effectively biases the system toward the desired rare event. However, extensive computation coupled with substantial insight into a given system is required, as there is currently no automatic approach for choosing wSSA parameters. We present a novel modification of the wSSA--the doubly weighted SSA (dwSSA)--that makes possible a fully automated parameter selection method. Our approach uses the information-theoretic concept of cross entropy to identify parameter values yielding minimum variance rare event probability estimates. We apply the method to four examples: a pure birth process, a birth-death process, an enzymatic futile cycle, and a yeast polarization model. Our results demonstrate that the proposed method (1) enables probability estimation for a class of rare events that cannot be interrogated with the wSSA, and (2) for all examples tested, reduces the number of runs needed to achieve comparable accuracy by multiple orders of magnitude. For a particular rare event in the yeast polarization model, our method transforms a projected simulation time of 600 years to three hours. Furthermore, by incorporating information-theoretic principles, our approach provides a framework for the development of more sophisticated influencing schemes that should further improve estimation accuracy.
在生化系统中,罕见事件的发生可能伴随着灾难性的后果。使用蒙特卡罗模拟方法精确地描述这些事件通常是难以处理的,因为即使要观察到一个罕见事件,所需的实现数量也可能非常大。加权随机模拟算法(wSSA)[J. Chem. Phys. 129, 165101 (2008)]及其随后的扩展[J. Chem. Phys. 130, 174103 (2009)]通过重要性抽样来缓解这一困难,这有效地使系统偏向于所需的罕见事件。然而,需要广泛的计算和对给定系统的大量了解,因为目前还没有自动选择 wSSA 参数的方法。我们提出了 wSSA 的一种新的修改方法——双重加权 SSA(dwSSA)——它使得全自动参数选择方法成为可能。我们的方法使用信息论中的交叉熵概念来确定产生最小方差罕见事件概率估计的参数值。我们将该方法应用于四个示例:纯生过程、生死过程、酶无效循环和酵母极化模型。我们的结果表明,该方法(1)能够对一类不能用 wSSA 询问的罕见事件进行概率估计;(2)对于所有测试的示例,通过多个数量级减少了达到可比精度所需的运行次数。对于酵母极化模型中的一个特定罕见事件,我们的方法将模拟时间从 600 年减少到 3 小时。此外,通过纳入信息论原理,我们的方法为开发更复杂的影响方案提供了一个框架,这应该进一步提高估计精度。