Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA.
Department of Genetics, Rutgers University, Piscataway, NJ, USA.
Mol Biol Evol. 2023 Aug 3;40(8). doi: 10.1093/molbev/msad169.
Variation in gene expression across lineages is thought to explain much of the observed phenotypic variation and adaptation. The protein is closer to the target of natural selection but gene expression is typically measured as the amount of mRNA. The broad assumption that mRNA levels are good proxies for protein levels has been undermined by a number of studies reporting moderate or weak correlations between the two measures across species. One biological explanation for this discrepancy is that there has been compensatory evolution between the mRNA level and regulation of translation. However, we do not understand the evolutionary conditions necessary for this to occur nor the expected strength of the correlation between mRNA and protein levels. Here, we develop a theoretical model for the coevolution of mRNA and protein levels and investigate the dynamics of the model over time. We find that compensatory evolution is widespread when there is stabilizing selection on the protein level; this observation held true across a variety of regulatory pathways. When the protein level is under directional selection, the mRNA level of a gene and the translation rate of the same gene were negatively correlated across lineages but positively correlated across genes. These findings help explain results from comparative studies of gene expression and potentially enable researchers to disentangle biological and statistical hypotheses for the mismatch between transcriptomic and proteomic data.
基因表达在谱系间的变化被认为可以解释大部分观察到的表型变异和适应。蛋白质更接近自然选择的目标,但基因表达通常是作为 mRNA 的量来测量的。大量研究报告称,在不同物种之间,这两种测量方法之间存在中度或弱相关性,这一广泛的假设表明 mRNA 水平是蛋白质水平的良好替代物,但这一假设已被推翻。这种差异的一个生物学解释是,mRNA 水平和翻译调节之间存在补偿性进化。然而,我们并不了解这种情况发生所需的进化条件,也不了解 mRNA 和蛋白质水平之间的相关性的预期强度。在这里,我们为 mRNA 和蛋白质水平的共同进化开发了一个理论模型,并研究了模型随时间的动态变化。我们发现,当蛋白质水平受到稳定选择时,补偿性进化是普遍存在的;这种观察结果在各种调节途径中都是成立的。当蛋白质水平受到定向选择时,一个基因的 mRNA 水平和同一基因的翻译率在谱系间呈负相关,但在基因间呈正相关。这些发现有助于解释基因表达的比较研究结果,并可能使研究人员能够区分转录组和蛋白质组数据不匹配的生物学和统计学假设。