一个基于突变和选择的简单模型解释了密码子和氨基酸使用的趋势以及基因组内部和之间的GC组成。

A simple model based on mutation and selection explains trends in codon and amino-acid usage and GC composition within and across genomes.

作者信息

Knight R D, Freeland S J, Landweber L F

机构信息

Department of Ecology and Evolutionary Biology, Princeton University, Princeton, NJ 08544, USA.

出版信息

Genome Biol. 2001;2(4):RESEARCH0010. doi: 10.1186/gb-2001-2-4-research0010. Epub 2001 Mar 22.

DOI:10.1186/gb-2001-2-4-research0010

PMID:11305938

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC31479/

Abstract

BACKGROUND

Correlations between genome composition (in terms of GC content) and usage of particular codons and amino acids have been widely reported, but poorly explained. We show here that a simple model of processes acting at the nucleotide level explains codon usage across a large sample of species (311 bacteria, 28 archaea and 257 eukaryotes). The model quantitatively predicts responses (slope and intercept of the regression line on genome GC content) of individual codons and amino acids to genome composition.

RESULTS

Codons respond to genome composition on the basis of their GC content relative to their synonyms (explaining 71-87% of the variance in response among the different codons, depending on measure). Amino-acid responses are determined by the mean GC content of their codons (explaining 71-79% of the variance). Similar trends hold for genes within a genome. Position-dependent selection for error minimization explains why individual bases respond differently to directional mutation pressure.

CONCLUSIONS

Our model suggests that GC content drives codon usage (rather than the converse). It unifies a large body of empirical evidence concerning relationships between GC content and amino-acid or codon usage in disparate systems. The relationship between GC content and codon and amino-acid usage is ahistorical; it is replicated independently in the three domains of living organisms, reinforcing the idea that genes and genomes at mutation/selection equilibrium reproduce a unique relationship between nucleic acid and protein composition. Thus, the model may be useful in predicting amino-acid or nucleotide sequences in poorly characterized taxa.

摘要

背景

基因组组成（以GC含量衡量）与特定密码子和氨基酸使用之间的相关性已被广泛报道，但解释不足。我们在此表明，一个作用于核苷酸水平的简单过程模型可以解释大量物种样本（311种细菌、28种古细菌和257种真核生物）中的密码子使用情况。该模型定量预测了各个密码子和氨基酸对基因组组成的响应（回归线在基因组GC含量上的斜率和截距）。

结果

密码子根据其相对于同义密码子的GC含量对基因组组成做出响应（根据测量方法不同，解释了不同密码子间71%-87%的响应差异）。氨基酸的响应由其密码子的平均GC含量决定（解释了71%-79%的差异）。基因组内的基因也呈现类似趋势。为使错误最小化而进行的位置依赖性选择解释了为什么单个碱基对定向突变压力的响应不同。

结论

我们的模型表明GC含量驱动密码子使用（而非相反）。它统一了大量关于不同系统中GC含量与氨基酸或密码子使用之间关系的实证证据。GC含量与密码子及氨基酸使用之间的关系不依赖于进化历史；它在生物的三个域中独立复制，强化了这样一种观点，即处于突变/选择平衡状态的基因和基因组再现了核酸与蛋白质组成之间的独特关系。因此，该模型可能有助于预测特征描述不足的分类群中的氨基酸或核苷酸序列。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4467/31479/6935fe3640df/gb-2001-2-4-research0010-1.jpg

相似文献

A simple model based on mutation and selection explains trends in codon and amino-acid usage and GC composition within and across genomes.

Genome Biol. 2001;2(4):RESEARCH0010. doi: 10.1186/gb-2001-2-4-research0010. Epub 2001 Mar 22.

Across bacterial phyla, distantly-related genomes with similar genomic GC content have similar patterns of amino acid usage.

PLoS One. 2011 Mar 10;6(3):e17677. doi: 10.1371/journal.pone.0017677.

The genome of Campylobacter jejuni: codon and amino acid usage.

APMIS. 2003 Jun;111(6):605-18. doi: 10.1034/j.1600-0463.2003.1110603.x.

Thermophilic prokaryotes have characteristic patterns of codon usage, amino acid composition and nucleotide content.

Gene. 2003 Oct 23;317(1-2):39-47. doi: 10.1016/s0378-1119(03)00660-7.

Codon Usage Optimization in the Prokaryotic Tree of Life: How Synonymous Codons Are Differentially Selected in Sequence Domains with Different Expression Levels and Degrees of Conservation.

mBio. 2020 Jul 21;11(4):e00766-20. doi: 10.1128/mBio.00766-20.

Selection on codon usage for error minimization at the protein level.

J Mol Evol. 2004 Sep;59(3):400-15. doi: 10.1007/s00239-004-2634-7.

Coupling between protein level selection and codon usage optimization in the evolution of bacteria and archaea.

mBio. 2014 Mar 25;5(2):e00956-14. doi: 10.1128/mBio.00956-14.

Comparative genomic analysis for nucleotide, codon, and amino acid usage patterns of mycoplasmas.

J Basic Microbiol. 2018 May;58(5):425-439. doi: 10.1002/jobm.201700490. Epub 2018 Mar 14.

Amino acid usage is asymmetrically biased in AT- and GC-rich microbial genomes.

PLoS One. 2013 Jul 26;8(7):e69878. doi: 10.1371/journal.pone.0069878. Print 2013.

GC-biased gene conversion and selection affect GC content in the Oryza genus (rice).

Mol Biol Evol. 2011 Sep;28(9):2695-706. doi: 10.1093/molbev/msr104. Epub 2011 Apr 18.

引用本文的文献

Comparative mitogenomics reveals evolutionary drivers of Strongyloidea nematodes dwelling in gastrointestinal tract.

BMC Genomics. 2025 Sep 1;26(1):793. doi: 10.1186/s12864-025-11980-5.

Codon usage bias is presumably affected by tRNA selection effects in Actinidia polyploidization events.

BMC Genomics. 2025 Jul 23;26(1):685. doi: 10.1186/s12864-025-11873-7.

Macroevolutionary changes in natural selection on codon usage reflect evolution of the tRNA pool across a budding yeast subphylum.

Proc Natl Acad Sci U S A. 2025 Jul 8;122(27):e2419889122. doi: 10.1073/pnas.2419889122. Epub 2025 Jul 1.

Lost in translation: conserved amino acid usage despite extreme codon bias in foraminifera.

mBio. 2025 Apr 9;16(4):e0391624. doi: 10.1128/mbio.03916-24. Epub 2025 Mar 5.

Does metabolic rate influence genome-wide amino acid composition in the course of animal evolution?

Evol Lett. 2024 Nov 8;9(1):137-149. doi: 10.1093/evlett/qrae061. eCollection 2025 Feb.

Phylogeny and divergence time estimation of the subfamily Amphipsyllinae based on the mitogenome.

Front Vet Sci. 2024 Dec 11;11:1494204. doi: 10.3389/fvets.2024.1494204. eCollection 2024.

The first mitogenome of the genus (Siphonaptera: Ceratophyllidae) and its phylogenetic implications.

Parasitology. 2024 Sep;151(10):1085-1095. doi: 10.1017/S0031182024000635. Epub 2024 Dec 3.

Comprehensive Genomics Investigation of Neboviruses Reveals Distinct Codon Usage Patterns and Host Specificity.

Microorganisms. 2024 Mar 29;12(4):696. doi: 10.3390/microorganisms12040696.

Genetic background of adaptation of Crimean-Congo haemorrhagic fever virus to the different tick hosts.

PLoS One. 2024 Apr 25;19(4):e0302224. doi: 10.1371/journal.pone.0302224. eCollection 2024.

How do bacterial endosymbionts work with so few genes?

PLoS Biol. 2024 Apr 16;22(4):e3002577. doi: 10.1371/journal.pbio.3002577. eCollection 2024 Apr.

本文引用的文献

On the probability of fixation of mutant genes in a population.

Genetics. 1962 Jun;47(6):713-9. doi: 10.1093/genetics/47.6.713.

On the genetic basis of variation and heterogeneity of DNA base composition.

Proc Natl Acad Sci U S A. 1962 Apr 15;48(4):582-92. doi: 10.1073/pnas.48.4.582.

Compositional correlation between deoxyribonucleic acid and protein.

Cold Spring Harb Symp Quant Biol. 1961;26:35-43. doi: 10.1101/sqb.1961.026.01.009.

Absence of translationally selected synonymous codon usage bias in Helicobacter pylori.

Microbiology (Reading). 2000 Apr;146 ( Pt 4):851-860. doi: 10.1099/00221287-146-4-851.

Studies on the relationships between the synonymous codon usage and protein secondary structural units.

Biochem Biophys Res Commun. 2000 Mar 24;269(3):692-6. doi: 10.1006/bbrc.2000.2351.

Isochores and the evolutionary genomics of vertebrates.

Gene. 2000 Jan 4;241(1):3-17. doi: 10.1016/s0378-1119(99)00485-0.

Codon usage tabulated from international DNA sequence databases: status for the year 2000.

Nucleic Acids Res. 2000 Jan 1;28(1):292. doi: 10.1093/nar/28.1.292.

Studies of codon usage and tRNA genes of 18 unicellular organisms and quantification of Bacillus subtilis tRNAs: gene expression level and species-specific diversity of codon usage based on multivariate analysis.

Gene. 1999 Sep 30;238(1):143-55. doi: 10.1016/s0378-1119(99)00225-5.

The correlation of protein hydropathy with the base composition of coding sequences.

Gene. 1999 Sep 30;238(1):3-14. doi: 10.1016/s0378-1119(99)00257-7.

Two aspects of DNA base composition: G+C content and translation-coupled deviation from intra-strand rule of A = T and G = C.

J Mol Evol. 1999 Jul;49(1):49-62. doi: 10.1007/pl00006534.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一个基于突变和选择的简单模型解释了密码子和氨基酸使用的趋势以及基因组内部和之间的GC组成。

A simple model based on mutation and selection explains trends in codon and amino-acid usage and GC composition within and across genomes.

作者信息

Knight R D, Freeland S J, Landweber L F

机构信息

Department of Ecology and Evolutionary Biology, Princeton University, Princeton, NJ 08544, USA.