Coghlan A, Wolfe K H
Department of Genetics, Smurfit Institute, University of Dublin, Trinity College, Dublin 2, Ireland.
Yeast. 2000 Sep 15;16(12):1131-45. doi: 10.1002/1097-0061(20000915)16:12<1131::AID-YEA609>3.0.CO;2-F.
In 1982, Ikemura reported a strikingly unequal usage of different synonymous codons, in five Saccharomyces cerevisiae nuclear genes having high protein levels. To study this trend in detail, we examined data from three independent studies that used oligonucleotide arrays or SAGE to estimate mRNA concentrations for nearly all genes in the genome. Correlation coefficients were calculated for the relationship of mRNA concentration to four commonly used measures of synonymous codon usage bias: the codon adaptation index (CAI), the codon bias index (CBI), the frequency of optimal codons (F(op)), and the effective number of codons (N(c)). mRNA concentration was best approximated as an exponential function of each of these four measures. Of the four, the CAI was the most strongly correlated with mRNA concentration (r(s)=0.62+/-0.01, n=2525, p<10(-17)). When we controlled for CAI, mRNA concentration and protein length were negatively correlated (partial r(s)=-0.23+/-0.01, n=4765, p<10(-17)). This may result from selection to reduce the size of abundant proteins to minimize transcriptional and translational costs. When we controlled for mRNA concentration, protein length and CAI were positively correlated (partial r(s)=0.16+/-0.01, n=4765, p<10(-17)). This may reflect more effective selection in longer genes against missense errors during translation. The correlation coefficients between the mRNA levels of individual genes, as measured by different investigators and methods, were low, in the range r(s)=0.39-0.68.
1982年,池村报道了在五个蛋白质水平较高的酿酒酵母核基因中,不同同义密码子的使用存在显著差异。为了详细研究这一趋势,我们检查了三项独立研究的数据,这些研究使用寡核苷酸阵列或SAGE来估计基因组中几乎所有基因的mRNA浓度。计算了mRNA浓度与四种常用的同义密码子使用偏好测量指标之间的相关系数:密码子适应指数(CAI)、密码子偏好指数(CBI)、最优密码子频率(F(op))和有效密码子数(N(c))。mRNA浓度最好近似为这四种测量指标中每一种的指数函数。在这四种指标中,CAI与mRNA浓度的相关性最强(r(s)=0.62±0.01,n=2525,p<10(-17))。当我们控制CAI时,mRNA浓度与蛋白质长度呈负相关(偏相关系数r(s)=-0.23±0.01,n=4765,p<10(-17))。这可能是由于选择减少丰富蛋白质的大小,以最小化转录和翻译成本。当我们控制mRNA浓度时,蛋白质长度与CAI呈正相关(偏相关系数r(s)=0.16±0.01,n=4765,p<10(-17))。这可能反映了在较长基因中,针对翻译过程中的错义错误进行了更有效的选择。不同研究者和方法测量的单个基因mRNA水平之间的相关系数较低,范围在r(s)=0.39-0.68之间。