Chaney Julie L, Steele Aaron, Carmichael Rory, Rodriguez Anabel, Specht Alicia T, Ngo Kim, Li Jun, Emrich Scott, Clark Patricia L
Department of Chemistry & Biochemistry, University of Notre Dame, Notre Dame, Indiana, United States of America.
Department of Computer Science & Engineering, University of Notre Dame, Notre Dame, Indiana, United States of America.
PLoS Comput Biol. 2017 May 5;13(5):e1005531. doi: 10.1371/journal.pcbi.1005531. eCollection 2017 May.
Synonymous rare codons are considered to be sub-optimal for gene expression because they are translated more slowly than common codons. Yet surprisingly, many protein coding sequences include large clusters of synonymous rare codons. Rare codons at the 5' terminus of coding sequences have been shown to increase translational efficiency. Although a general functional role for synonymous rare codons farther within coding sequences has not yet been established, several recent reports have identified rare-to-common synonymous codon substitutions that impair folding of the encoded protein. Here we test the hypothesis that although the usage frequencies of synonymous codons change from organism to organism, codon rarity will be conserved at specific positions in a set of homologous coding sequences, for example to tune translation rate without altering a protein sequence. Such conservation of rarity-rather than specific codon identity-could coordinate co-translational folding of the encoded protein. We demonstrate that many rare codon cluster positions are indeed conserved within homologous coding sequences across diverse eukaryotic, bacterial, and archaeal species, suggesting they result from positive selection and have a functional role. Most conserved rare codon clusters occur within rather than between conserved protein domains, challenging the view that their primary function is to facilitate co-translational folding after synthesis of an autonomous structural unit. Instead, many conserved rare codon clusters separate smaller protein structural motifs within structural domains. These smaller motifs typically fold faster than an entire domain, on a time scale more consistent with translation rate modulation by synonymous codon usage. While proteins with conserved rare codon clusters are structurally and functionally diverse, they are enriched in functions associated with organism growth and development, suggesting an important role for synonymous codon usage in organism physiology. The identification of conserved rare codon clusters advances our understanding of distinct, functional roles for otherwise synonymous codons and enables experimental testing of the impact of synonymous codon usage on the production of functional proteins.
同义稀有密码子被认为对基因表达而言并非最优选择,因为它们的翻译速度比常见密码子更慢。然而令人惊讶的是,许多蛋白质编码序列包含大量同义稀有密码子簇。编码序列5'端的稀有密码子已被证明可提高翻译效率。虽然编码序列中更靠后的同义稀有密码子的一般功能作用尚未确定,但最近的几份报告已经鉴定出一些从稀有到常见的同义密码子替换,这些替换会损害所编码蛋白质的折叠。在这里,我们检验这样一个假设:尽管同义密码子的使用频率因生物体而异,但在一组同源编码序列的特定位置,密码子的稀有性将得以保留,例如用于调节翻译速率而不改变蛋白质序列。这种稀有性的保留——而非特定密码子的一致性——可以协调所编码蛋白质的共翻译折叠。我们证明,许多稀有密码子簇位置在不同的真核生物、细菌和古细菌物种的同源编码序列中确实是保守的,这表明它们是正选择的结果并且具有功能作用。大多数保守的稀有密码子簇出现在保守蛋白质结构域内部而非之间,这对它们的主要功能是在自主结构单元合成后促进共翻译折叠这一观点提出了挑战。相反,许多保守的稀有密码子簇在结构域内分隔较小的蛋白质结构基序。这些较小的基序通常比整个结构域折叠得更快,其时间尺度与同义密码子使用对翻译速率的调节更一致。虽然具有保守稀有密码子簇的蛋白质在结构和功能上各不相同,但它们在与生物体生长和发育相关的功能中富集,这表明同义密码子使用在生物体生理学中具有重要作用。保守稀有密码子簇的鉴定推进了我们对原本同义密码子的不同功能作用的理解,并使得对同义密码子使用对功能性蛋白质产生的影响进行实验测试成为可能。