Department of Microbiology, University of Illinois at Urbana-Champaign, IL, USA.
Mol Biol Evol. 2010 Apr;27(4):800-10. doi: 10.1093/molbev/msp281. Epub 2009 Dec 17.
Most genomes are heterogeneous in codon usage, so a codon usage study should start by defining the codon usage that is typical to the genome. Although this is commonly taken to be the genomewide average, we propose that the mode-the codon usage that matches the most genes-provides a more useful approximation of the typical codon usage of a genome. We provide a method for estimating the modal codon usage, which utilizes a continuous approximation to the number of matching genes and a simplex optimization. In a survey of bacterial and archaeal genomes, as many as 20% more of the genes in a given genome match the modal codon usage than the average codon usage. We use the mode to examine the evolution of the multireplicon genomes of Agrobacterium tumefaciens C58 and Borrelia burgdorferi B31. In A. tumefaciens, the circular and linear chromosomes are characterized by a common "chromosome-like" codon usage, whereas both plasmids share a distinct "plasmid-like" codon usage. In B. burgdorferi, in addition to different codon-usage biases on the leading and lagging strands of DNA replication found by McInerney (McInerney JO. 1998. Replicational and transcriptional selection on codon usage in Borrelia burgdorferi. Proc Natl Acad Sci USA. 95:10698-10703), we also detect a codon-usage similarity between linear plasmid lp38 and the leading strand of the chromosome and a high similarity among the cp32 family of plasmids.
大多数基因组在密码子使用上具有异质性,因此密码子使用研究应该首先定义典型的基因组密码子使用。尽管通常认为这是全基因组的平均值,但我们建议模式——与最多基因匹配的密码子使用——为基因组典型密码子使用提供了更有用的近似值。我们提供了一种估计模式密码子使用的方法,该方法利用了与匹配基因数量的连续近似值和单纯形优化。在对细菌和古细菌基因组的调查中,给定基因组中多达 20%的基因与模式密码子使用匹配,而不是平均密码子使用。我们使用模式来研究根癌农杆菌 C58 和伯氏疏螺旋体 B31 的多复制子基因组的进化。在根癌农杆菌中,圆形和线性染色体的特征是共同的“染色体样”密码子使用,而两个质粒则具有独特的“质粒样”密码子使用。在伯氏疏螺旋体中,除了 McInerney(McInerney JO. 1998. Replicational and transcriptional selection on codon usage in Borrelia burgdorferi. Proc Natl Acad Sci USA. 95:10698-10703)发现的 DNA 复制的前导链和滞后链上的不同密码子使用偏向性之外,我们还检测到线性质粒 lp38 与染色体的前导链之间的密码子使用相似性,以及 cp32 家族质粒之间的高度相似性。