Frappat L, Minichini C, Sciarrino A, Sorba P
Laboratoire d'Annecy-le-Vieux de Physique Théorique LAPTH, CNRS, UMR 5108 associée à l'Université de Savoie, Boîte Postale 110, F-74941 Annecy-le-Vieux Cedex, France.
Phys Rev E Stat Nonlin Soft Matter Phys. 2003 Dec;68(6 Pt 1):061910. doi: 10.1103/PhysRevE.68.061910. Epub 2003 Dec 24.
The distribution functions of codon usage probabilities, computed over all the available GenBank data for 40 eukaryotic biological species and five chloroplasts, are best fitted by the sum of a constant, an exponential, and a linear function in the rank of usage. For mitochondria the analysis is not conclusive. These functions are characterized by parameters that strongly depend on the total guanine and cytosine (GC) content of the coding regions of biological species. It is predicted that the codon usage is the same in all exonic genes with the same GC content. The Shannon entropy for codons, also strongly dependent on the exonic GC content, is computed.
通过对40种真核生物物种和5种叶绿体的所有可用GenBank数据计算密码子使用概率的分布函数,发现其最佳拟合为一个常数、一个指数函数和一个与使用排名相关的线性函数之和。对于线粒体,分析尚无定论。这些函数的特征参数强烈依赖于生物物种编码区的总鸟嘌呤和胞嘧啶(GC)含量。据预测,具有相同GC含量的所有外显子基因中的密码子使用情况相同。还计算了同样强烈依赖于外显子GC含量的密码子香农熵。