Department of Computational Biology, Institut Pasteur, Paris, France.
Biosciences, Living Systems Institute, University of Exeter, Exeter, The United Kingdom.
PLoS Comput Biol. 2022 Mar 18;18(3):e1009950. doi: 10.1371/journal.pcbi.1009950. eCollection 2022 Mar.
Understanding and characterising biochemical processes inside single cells requires experimental platforms that allow one to perturb and observe the dynamics of such processes as well as computational methods to build and parameterise models from the collected data. Recent progress with experimental platforms and optogenetics has made it possible to expose each cell in an experiment to an individualised input and automatically record cellular responses over days with fine time resolution. However, methods to infer parameters of stochastic kinetic models from single-cell longitudinal data have generally been developed under the assumption that experimental data is sparse and that responses of cells to at most a few different input perturbations can be observed. Here, we investigate and compare different approaches for calculating parameter likelihoods of single-cell longitudinal data based on approximations of the chemical master equation (CME) with a particular focus on coupling the linear noise approximation (LNA) or moment closure methods to a Kalman filter. We show that, as long as cells are measured sufficiently frequently, coupling the LNA to a Kalman filter allows one to accurately approximate likelihoods and to infer model parameters from data even in cases where the LNA provides poor approximations of the CME. Furthermore, the computational cost of filtering-based iterative likelihood evaluation scales advantageously in the number of measurement times and different input perturbations and is thus ideally suited for data obtained from modern experimental platforms. To demonstrate the practical usefulness of these results, we perform an experiment in which single cells, equipped with an optogenetic gene expression system, are exposed to various different light-input sequences and measured at several hundred time points and use parameter inference based on iterative likelihood evaluation to parameterise a stochastic model of the system.
理解和描述单个细胞内的生化过程需要实验平台,这些平台允许人们干扰和观察这些过程的动态,以及构建和参数化模型的计算方法,这些模型是基于收集的数据。最近,实验平台和光遗传学的进展使得人们有可能将实验中的每个细胞暴露于个性化的输入,并自动以精细的时间分辨率记录多天的细胞反应。然而,从单细胞纵向数据推断随机动力学模型参数的方法通常是在实验数据稀疏的假设下开发的,并且可以观察到细胞对最多几个不同输入扰动的反应。在这里,我们研究和比较了基于化学主方程(CME)近似的不同方法来计算单细胞纵向数据的参数似然,特别关注将线性噪声近似(LNA)或矩闭合法耦合到卡尔曼滤波器。我们表明,只要细胞被足够频繁地测量,将 LNA 耦合到卡尔曼滤波器可以允许从数据中准确地近似似然,并推断模型参数,即使在 LNA 对 CME 提供较差近似的情况下也是如此。此外,基于滤波的迭代似然评估的计算成本在测量次数和不同输入扰动的数量上具有优势,因此非常适合从现代实验平台获得的数据。为了演示这些结果的实际有用性,我们进行了一项实验,其中单个细胞配备了光遗传学基因表达系统,然后将其暴露于各种不同的光输入序列,并在几百个时间点进行测量,并使用基于迭代似然评估的参数推断来参数化系统的随机模型。