Warne David J, Baker Ruth E, Simpson Matthew J
School of Mathematical Sciences, Queensland University of Technology, Brisbane, Queensland 4001, Australia.
Mathematical Institute, University of Oxford, Oxford, OX2 6GG, United Kingdom.
J Theor Biol. 2020 Jul 7;496:110255. doi: 10.1016/j.jtbi.2020.110255. Epub 2020 Mar 26.
For many stochastic models of interest in systems biology, such as those describing biochemical reaction networks, exact quantification of parameter uncertainty through statistical inference is intractable. Likelihood-free computational inference techniques enable parameter inference when the likelihood function for the model is intractable but the generation of many sample paths is feasible through stochastic simulation of the forward problem. The most common likelihood-free method in systems biology is approximate Bayesian computation that accepts parameters that result in low discrepancy between stochastic simulations and measured data. However, it can be difficult to assess how the accuracy of the resulting inferences are affected by the choice of acceptance threshold and discrepancy function. The pseudo-marginal approach is an alternative likelihood-free inference method that utilises a Monte Carlo estimate of the likelihood function. This approach has several advantages, particularly in the context of noisy, partially observed, time-course data typical in biochemical reaction network studies. Specifically, the pseudo-marginal approach facilitates exact inference and uncertainty quantification, and may be efficiently combined with particle filters for low variance, high-accuracy likelihood estimation. In this review, we provide a practical introduction to the pseudo-marginal approach using inference for biochemical reaction networks as a series of case studies. Implementations of key algorithms and examples are provided using the Julia programming language; a high performance, open source programming language for scientific computing (https://github.com/davidwarne/Warne2019_GuideToPseudoMarginal).
对于系统生物学中许多感兴趣的随机模型,例如那些描述生化反应网络的模型,通过统计推断对参数不确定性进行精确量化是难以处理的。当模型的似然函数难以处理但通过正向问题的随机模拟生成许多样本路径是可行时,无似然计算推断技术能够进行参数推断。系统生物学中最常见的无似然方法是近似贝叶斯计算,它接受那些在随机模拟和测量数据之间导致低差异的参数。然而,可能很难评估所得推断的准确性是如何受到接受阈值和差异函数选择的影响的。伪边缘方法是一种替代的无似然推断方法,它利用似然函数的蒙特卡罗估计。这种方法有几个优点,特别是在生化反应网络研究中典型的有噪声、部分观测的时间序列数据的背景下。具体来说,伪边缘方法有助于精确推断和不确定性量化,并且可以与粒子滤波器有效地结合以进行低方差、高精度的似然估计。在这篇综述中,我们通过将生化反应网络的推断作为一系列案例研究,对伪边缘方法进行了实际介绍。使用Julia编程语言提供了关键算法的实现和示例;Julia是一种用于科学计算的高性能开源编程语言(https://github.com/davidwarne/Warne2019_GuideToPseudoMarginal)。