The Key Laboratory of Advanced Control and Optimization for Chemical Processes, Ministry of Education, East China University of Science and Technology, Shanghai 200237, People's Republic of China.
School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3BF, United Kingdom.
Proc Natl Acad Sci U S A. 2020 Mar 3;117(9):4682-4692. doi: 10.1073/pnas.1910888117. Epub 2020 Feb 18.
The stochasticity of gene expression presents significant challenges to the modeling of genetic networks. A two-state model describing promoter switching, transcription, and messenger RNA (mRNA) decay is the standard model of stochastic mRNA dynamics in eukaryotic cells. Here, we extend this model to include mRNA maturation, cell division, gene replication, dosage compensation, and growth-dependent transcription. We derive expressions for the time-dependent distributions of nascent mRNA and mature mRNA numbers, provided two assumptions hold: 1) nascent mRNA dynamics are much faster than those of mature mRNA; and 2) gene-inactivation events occur far more frequently than gene-activation events. We confirm that thousands of eukaryotic genes satisfy these assumptions by using data from yeast, mouse, and human cells. We use the expressions to perform a sensitivity analysis of the coefficient of variation of mRNA fluctuations averaged over the cell cycle, for a large number of genes in mouse embryonic stem cells, identifying degradation and gene-activation rates as the most sensitive parameters. Furthermore, it is shown that, despite the model's complexity, the time-dependent distributions predicted by our model are generally well approximated by the negative binomial distribution. Finally, we extend our model to include translation, protein decay, and auto-regulatory feedback, and derive expressions for the approximate time-dependent protein-number distributions, assuming slow protein decay. Our expressions enable us to study how complex biological processes contribute to the fluctuations of gene products in eukaryotic cells, as well as allowing a detailed quantitative comparison with experimental data via maximum-likelihood methods.
基因表达的随机性给遗传网络的建模带来了重大挑战。描述启动子开关、转录和信使 RNA(mRNA)降解的两状态模型是真核细胞中随机 mRNA 动力学的标准模型。在这里,我们将这个模型扩展到包括 mRNA 成熟、细胞分裂、基因复制、剂量补偿和依赖生长的转录。我们推导出了新生 mRNA 和成熟 mRNA 数量的时变分布表达式,前提是两个假设成立:1)新生 mRNA 动力学比成熟 mRNA 快得多;2)基因失活事件发生的频率远高于基因激活事件。我们通过使用来自酵母、小鼠和人类细胞的数据证实了数千个真核基因满足这些假设。我们使用这些表达式对大量小鼠胚胎干细胞中的基因进行了 mRNA 波动的变异系数的敏感性分析,确定降解和基因激活率是最敏感的参数。此外,尽管模型很复杂,但我们的模型预测的时变分布通常可以很好地用负二项分布来近似。最后,我们将模型扩展到包括翻译、蛋白质降解和自调节反馈,并推导出了假设蛋白质降解缓慢时近似的时变蛋白质数量分布的表达式。我们的表达式使我们能够研究复杂的生物过程如何导致真核细胞中基因产物的波动,并且可以通过最大似然方法与实验数据进行详细的定量比较。