Institute of Organismic and Molecular Evolution iOME, Anthropology, Johannes Gutenberg University Mainz , Anselm-Franz-von-Bentzel-Weg 7, 55099 Mainz , Germany.
Open Biol. 2019 May 31;9(5):190020. doi: 10.1098/rsob.190020. Epub 2019 May 29.
Codon composition, GC content and local RNA secondary structures can have a profound effect on gene expression, and mutations affecting these parameters, even though they do not alter the protein sequence, are not neutral in terms of selection. Although evidence exists that, in some cases, selection favours more stable RNA secondary structures, we currently lack a concrete idea of how many genes are affected within a species, and whether this is a universal phenomenon in nature. We searched for signs of structural selection in a global manner, analysing a set of 1 million coding sequences from 73 species representing all domains of life, as well as viruses, by means of our newly developed software PACKEIS. We show that codon composition and amino acid identity are main determinants of RNA secondary structure. In addition, we show that the arrangement of synonymous codons within coding sequences is non-random, yielding extremely high, but also extremely low, RNA structuredness significantly more often than expected by chance. Taken together, we demonstrate that selection for high and low levels of secondary structure is a widespread phenomenon. Our results provide another line of evidence that synonymous mutations are less neutral than commonly thought, which is of importance for many evolutionary models.
密码子组成、GC 含量和局部 RNA 二级结构可能对基因表达产生深远影响,即使这些参数的突变不会改变蛋白质序列,但在选择方面并非中性。尽管有证据表明,在某些情况下,选择有利于更稳定的 RNA 二级结构,但我们目前还不清楚在一个物种中有多少基因受到影响,以及这是否是自然界中的普遍现象。我们通过我们新开发的软件 PACKEIS,从代表生命所有领域的 73 个物种以及病毒中,对 100 万个编码序列进行了全面的结构选择搜索。我们表明,密码子组成和氨基酸同一性是 RNA 二级结构的主要决定因素。此外,我们还表明,编码序列中同义密码子的排列是非随机的,产生非常高但也非常低的 RNA 结构的频率显著高于随机预期。总的来说,我们证明了对高和低水平的二级结构的选择是一种普遍现象。我们的结果提供了另一条证据,表明同义突变并不像普遍认为的那样中性,这对许多进化模型都很重要。