The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath BA2 7AY, United Kingdom.
Genome Res. 2018 Oct;28(10):1442-1454. doi: 10.1101/gr.233999.117. Epub 2018 Aug 24.
What proportion of coding sequence nucleotides have roles in splicing, and how strong is the selection that maintains them? Despite a large body of research into exonic splice regulatory signals, these questions have not been answered. This is because, to our knowledge, previous investigations have not explicitly disentangled the frequency of splice regulatory elements from the strength of the evolutionary constraint under which they evolve. Current data are consistent both with a scenario of weak and diffuse constraint, enveloping large swaths of sequence, as well as with well-defined pockets of strong purifying selection. In the former case, natural selection on exonic splice enhancers (ESEs) might primarily act as a slight modifier of codon usage bias. In the latter, mutations that disrupt ESEs are likely to have large fitness and, potentially, clinical effects. To distinguish between these scenarios, we used several different methods to determine the distribution of selection coefficients for new mutations within ESEs. The analyses converged to suggest that ∼15%-20% of fourfold degenerate sites are part of functional ESEs. Most of these sites are under strong evolutionary constraint. Therefore, exonic splice regulation does not simply impose a weak bias that gently nudges coding sequence evolution in a particular direction. Rather, the selection to preserve these motifs is a strong force that severely constrains the evolution of a substantial proportion of coding nucleotides. Thus synonymous mutations that disrupt ESEs should be considered as a potentially common cause of single-locus genetic disorders.
有多少编码序列核苷酸在剪接中起作用,维持它们的选择压力有多强?尽管有大量关于外显子剪接调控信号的研究,但这些问题仍未得到解答。这是因为据我们所知,以前的研究并没有明确区分剪接调控元件的频率与它们进化所经历的进化约束强度。目前的数据与以下两种情况都一致:一种是约束较弱且弥散,覆盖了大片序列;另一种是具有明确的强净化选择的“热点”。在前一种情况下,外显子剪接增强子 (ESE) 上的自然选择可能主要作为密码子使用偏性的轻微修饰。在后一种情况下,破坏 ESE 的突变很可能对适应性有较大影响,并且可能具有临床意义。为了区分这两种情况,我们使用了几种不同的方法来确定 ESE 内新突变的选择系数分布。分析结果表明,四核苷酸简并位点约有 15%-20%是功能性 ESE 的一部分。这些位点中的大多数都受到强烈的进化约束。因此,外显子剪接调控并不是简单地施加一个微弱的偏向,将编码序列的进化推向特定的方向。相反,保留这些模体的选择是一种强大的力量,严重限制了相当一部分编码核苷酸的进化。因此,破坏 ESE 的同义突变应被视为单基因遗传疾病的一个潜在常见原因。