Comeron J M, Aguadé M
Departament de Genètica, Facultat de Biologia, Universitat de Barcelona, Av. Diagonal 645, 08071 Barcelona, Spain. jcomeron@midway
J Mol Evol. 1998 Sep;47(3):268-74. doi: 10.1007/pl00006384.
Synonymous codons are not generally used at equal frequencies, and this trend is observed for most genes and organisms. Several methods have been proposed and used to estimate the degree of the nonrandom use of the different synonymous codons. The estimates obtained by these methods, however, show different levels of both precision and dispersion when coding regions of a finite number of codons are under analysis. Here, we present a study, based on computer simulation, of how the different methods proposed to evaluate the nonrandom use of synonymous codons are affected by the length of the coding region analyzed. The results show that some of these methods are heavily influenced by the number of codons and that the comparison of codon usage bias between coding regions of different lengths shows a methodological bias under different conditions of nonrandom use of synonymous codons. The study of the dispersion of the estimates obtained by the different methods gives, on the other hand, an indication of the methods to be applied to compare values of codon usage bias among coding regions of equivalent length.
同义密码子的使用频率通常并不相同,这种趋势在大多数基因和生物体中都能观察到。人们已经提出并使用了几种方法来估计不同同义密码子的非随机使用程度。然而,当分析有限数量密码子的编码区域时,通过这些方法获得的估计值在精度和离散度上都表现出不同的水平。在此,我们基于计算机模拟进行了一项研究,探究用于评估同义密码子非随机使用的不同方法是如何受到所分析编码区域长度影响的。结果表明,其中一些方法受密码子数量的影响很大,并且在同义密码子非随机使用的不同条件下,不同长度编码区域之间密码子使用偏好的比较显示出方法上的偏差。另一方面,对不同方法获得的估计值离散度的研究,为比较等长编码区域之间密码子使用偏好值时应采用的方法提供了一个指示。