Evolutionary Biology Program, Department of Ecology and Genetics (IEG), Uppsala University, Uppsala, Sweden.
Genome Biol Evol. 2023 Aug 1;15(8). doi: 10.1093/gbe/evad150.
Coding sequence evolution is influenced by both natural selection and neutral evolutionary forces. In many species, the effects of mutation bias, codon usage, and GC-biased gene conversion (gBGC) on gene sequence evolution have not been detailed. Quantification of how these forces shape substitution patterns is therefore necessary to understand the strength and direction of natural selection. Here, we used comparative genomics to investigate the association between base composition and codon usage bias on gene sequence evolution in butterflies and moths (Lepidoptera), including an in-depth analysis of underlying patterns and processes in one species, Leptidea sinapis. The data revealed significant G/C to A/T substitution bias at third codon position with some variation in the strength among different butterfly lineages. However, the substitution bias was lower than expected from previously estimated mutation rate ratios, partly due to the influence of gBGC. We found that A/T-ending codons were overrepresented in most species, but there was a positive association between the magnitude of codon usage bias and GC-content in third codon positions. In addition, the tRNA-gene population in L. sinapis showed higher GC-content at third codon positions compared to coding sequences in general and less overrepresentation of A/T-ending codons. There was an inverse relationship between synonymous substitutions and codon usage bias indicating selection on synonymous sites. We conclude that the evolutionary rate in Lepidoptera is affected by a complex interaction between underlying G/C -> A/T mutation bias and partly counteracting fixation biases, predominantly conferred by overall purifying selection, gBGC, and selection on codon usage.
编码序列的进化受到自然选择和中性进化力量的影响。在许多物种中,突变偏向、密码子使用和 GC 偏向基因转换 (gBGC) 对基因序列进化的影响尚未详细研究。因此,量化这些力量如何塑造替代模式对于理解自然选择的强度和方向是必要的。在这里,我们使用比较基因组学来研究蝴蝶和蛾类(鳞翅目)中碱基组成和密码子使用偏好与基因序列进化之间的关联,包括对一个物种(Leptidea sinapis)的基础模式和过程进行深入分析。数据显示,第三密码子位置存在显著的 G/C 到 A/T 替代偏向,不同蝴蝶谱系之间的强度存在一些差异。然而,替代偏向低于先前估计的突变率比,部分原因是 gBGC 的影响。我们发现,大多数物种中 A/T 结尾的密码子过表达,但密码子使用偏好的程度与第三密码子位置的 GC 含量之间存在正相关。此外,与一般编码序列相比,L. sinapis 的 tRNA 基因群体在第三密码子位置显示出更高的 GC 含量,并且 A/T 结尾的密码子过表达较少。同义替换和密码子使用偏好之间存在反相关关系,表明同义位点存在选择。我们得出结论,鳞翅目昆虫的进化率受到潜在的 G/C->A/T 突变偏向和部分抵消的固定偏向之间的复杂相互作用的影响,主要由整体纯化选择、gBGC 和密码子使用选择引起。