Department of Bioengineering, University of California, Berkeley, CA 94720, USA, California Institute for Quantitative Biosciences, University of California, Berkeley, CA 94720, USA, Computer Science and Technology Center, School of Engineering, University of Minho, Campus de Gualtar, 4710-057 Braga, Portugal and Physical Biosciences Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA.
Nucleic Acids Res. 2014 Apr;42(8):4791-9. doi: 10.1093/nar/gku126. Epub 2014 Feb 7.
The range over which a protein is expressed, and its cell-to-cell variability, is often thought to be linked to the demand for its activity. Steady-state protein level is determined by multiple mechanisms controlling transcription and translation, many of which are limited by DNA- and RNA-encoded signals that affect initiation, elongation and termination of polymerases and ribosomes. We performed a comprehensive analysis of >100 sequence features to derive a predictive model composed of a minimal non-redundant set of factors explaining 66% of the total variation of protein abundance observed in >800 genes in Escherichia coli. The model suggests that protein abundance is primarily determined by the transcript level (53%) and by effectors of translation elongation (12%), whereas only a small fraction of the variation is explained by translational initiation (1%). Our analyses uncover a new sequence determinant, not previously described, affecting translation initiation and suggest that elongation rate is affected by both codon biases and specific amino acid composition. We also show that transcription and translation efficiency may have an effect on expression noise, which is more similar than previously assumed.
蛋白质表达的范围及其细胞间的可变性,通常被认为与其活性的需求有关。稳态蛋白质水平由控制转录和翻译的多种机制决定,其中许多机制受到影响聚合酶和核糖体起始、延伸和终止的 DNA 和 RNA 编码信号的限制。我们对超过 100 个序列特征进行了全面分析,得出了一个预测模型,该模型由一组最小的非冗余因子组成,解释了在大肠杆菌中超过 800 个基因中观察到的蛋白质丰度总变异的 66%。该模型表明,蛋白质丰度主要由转录水平(53%)和翻译延伸的效应物(12%)决定,而翻译起始(1%)仅能解释一小部分变异。我们的分析揭示了一个以前未描述的新的序列决定因素,它影响翻译起始,并表明延伸率受密码子偏性和特定氨基酸组成的影响。我们还表明,转录和翻译效率可能对表达噪声有影响,其相似性比以前假设的要高。