School of Mathematical Sciences, Queensland University of Technology, Brisbane, QLD 4001, Australia.
Mathematical Institute, University of Oxford, Oxford, UK.
J Theor Biol. 2022 Sep 21;549:111201. doi: 10.1016/j.jtbi.2022.111201. Epub 2022 Jun 22.
Stochastic individual-based mathematical models are attractive for modelling biological phenomena because they naturally capture the stochasticity and variability that is often evident in biological data. Such models also allow us to track the motion of individuals within the population of interest. Unfortunately, capturing this microscopic detail means that simulation and parameter inference can become computationally expensive. One approach for overcoming this computational limitation is to coarse-grain the stochastic model to provide an approximate continuum model that can be solved using far less computational effort. However, coarse-grained continuum models can be biased or inaccurate, particularly for certain parameter regimes. In this work, we combine stochastic and continuum mathematical models in the context of lattice-based models of two-dimensional cell biology experiments by demonstrating how to simulate two commonly used experiments: cell proliferation assays and barrier assays. Our approach involves building a simple statistical model of the discrepancy between the expensive stochastic model and the associated computationally inexpensive coarse-grained continuum model. We form this statistical model based on a limited number of expensive stochastic model evaluations at design points sampled from a user-chosen distribution, corresponding to a computer experiment design problem. With straightforward design point selection schemes, we show that using the statistical model of the discrepancy in tandem with the computationally inexpensive continuum model allows us to carry out prediction and inference while correcting for biases and inaccuracies due to the continuum approximation. We demonstrate this approach by simulating a proliferation assay, where the continuum limit model is the well-known logistic ordinary differential equation, as well as a barrier assay where the continuum limit model is closely related to the well-known Fisher-KPP partial differential equation. We construct an approximate likelihood function for parameter inference, both with and without discrepancy correction terms. Using maximum likelihood estimation, we provide point estimates of the unknown parameters, and use the profile likelihood to characterise the uncertainty in these estimates and form approximate confidence intervals. For the range of inference problems considered, working with the continuum limit model alone leads to biased parameter estimation and confidence intervals with poor coverage. In contrast, incorporating correction terms arising from the statistical model of the model discrepancy allows us to recover the parameters accurately with minimal computational overhead. The main tradeoff is that the associated confidence intervals are typically broader, reflecting the additional uncertainty introduced by the approximation process. All algorithms required to replicate the results in this work are written in the open source Julia language and are available at GitHub.
随机个体基础数学模型在生物现象建模方面具有吸引力,因为它们可以自然地捕捉到生物数据中经常出现的随机性和可变性。这种模型还允许我们跟踪感兴趣的种群中个体的运动。不幸的是,捕捉这种微观细节意味着模拟和参数推断可能变得计算昂贵。克服这种计算限制的一种方法是对随机模型进行粗化,以提供可以使用更少计算工作量来解决的近似连续模型。然而,粗化连续模型可能存在偏差或不准确,特别是对于某些参数范围。在这项工作中,我们在二维细胞生物学实验的基于晶格模型的上下文中结合了随机和连续数学模型,展示了如何模拟两个常用的实验:细胞增殖测定和屏障测定。我们的方法涉及构建昂贵的随机模型与相关计算上便宜的粗化连续模型之间差异的简单统计模型。我们基于从用户选择的分布中采样的设计点的有限数量的昂贵随机模型评估来形成这种统计模型,对应于计算机实验设计问题。通过简单的设计点选择方案,我们表明,使用差异的统计模型与计算上便宜的连续模型结合使用,可以在纠正连续近似引起的偏差和不准确的同时进行预测和推断。我们通过模拟增殖测定来演示这种方法,其中连续极限模型是众所周知的 logistic 常微分方程,以及屏障测定,其中连续极限模型与著名的 Fisher-KPP 偏微分方程密切相关。我们为参数推断构建了一个近似似然函数,包括和不包括差异校正项。使用最大似然估计,我们提供未知参数的点估计,并使用轮廓似然来描述这些估计的不确定性并形成近似置信区间。对于考虑的推断问题范围,单独使用连续极限模型会导致参数估计有偏差,置信区间覆盖率差。相比之下,包含源自模型差异的统计模型的校正项允许我们以最小的计算开销准确地恢复参数。主要的权衡是关联的置信区间通常更宽,反映了逼近过程引入的额外不确定性。要复制本工作中的结果所需的所有算法均用开源 Julia 语言编写,并可在 GitHub 上获得。