Institute of Molecular Life Sciences, University of Zürich, Zürich, Switzerland.
Swiss Institue of Bioinformatics, University of Zürich, Zürich, Switzerland.
Bioinformatics. 2018 Sep 1;34(17):i647-i655. doi: 10.1093/bioinformatics/bty568.
Transcription in single cells is an inherently stochastic process as mRNA levels vary greatly between cells, even for genetically identical cells under the same experimental and environmental conditions. We present a stochastic two-state switch model for the population of mRNA molecules in single cells where genes stochastically alternate between a more active ON state and a less active OFF state. We prove that the stationary solution of such a model can be written as a mixture of a Poisson and a Poisson-beta probability distribution. This finding facilitates inference for single cell expression data, observed at a single time point, from flow cytometry experiments such as FACS or fluorescence in situ hybridization (FISH) as it allows one to sample directly from the equilibrium distribution of the mRNA population. We hence propose a Bayesian inferential methodology using a pseudo-marginal approach and a recent approximation to integrate over unobserved states associated with measurement error.
We provide a general inferential framework which can be widely used to study transcription in single cells from the kind of data arising in flow cytometry experiments. The approach allows us to separate between the intrinsic stochasticity of the molecular dynamics and the measurement noise. The methodology is tested in simulation studies and results are obtained for experimental multiple single cell expression data from FISH flow cytometry experiments.
All analyses were implemented in R. Source code and the experimental data are available at https://github.com/SimoneTiberi/Bayesian-inference-on-stochastic-gene-transcription-from-flow-cytometry-data.
Supplementary data are available at Bioinformatics online.
转录在单细胞中是一个固有随机的过程,因为即使在相同的实验和环境条件下,不同细胞之间的 mRNA 水平差异也很大。我们提出了一种随机二态开关模型,用于单个细胞中的 mRNA 分子群体,其中基因随机地在更活跃的 ON 状态和不那么活跃的 OFF 状态之间交替。我们证明了这种模型的平稳解可以表示为泊松和泊松-β概率分布的混合。这一发现促进了来自流式细胞术实验(如 FACS 或荧光原位杂交(FISH))的单个细胞表达数据的推断,因为它允许直接从 mRNA 群体的平衡分布中采样,这些数据在单个时间点进行了观测。因此,我们提出了一种基于伪边际方法和最近的近似方法的贝叶斯推断方法,用于整合与测量误差相关的未观测状态。
我们提供了一个通用的推断框架,可以广泛用于从流式细胞术实验中产生的数据来研究单细胞中的转录。该方法允许我们区分分子动力学的固有随机性和测量噪声。该方法在模拟研究中进行了测试,并从 FISH 流式细胞术实验的多个单个细胞表达数据中获得了结果。
所有分析均在 R 中实现。源代码和实验数据可在 https://github.com/SimoneTiberi/Bayesian-inference-on-stochastic-gene-transcription-from-flow-cytometry-data 上获得。
补充数据可在生物信息学在线获得。