Corvelo André, Eyras Eduardo
Computational Genomics, Universitat Pompeu Fabra, Barcelona, Spain.
Genome Biol. 2008;9(9):R141. doi: 10.1186/gb-2008-9-9-r141.
A large proportion of species-specific exons are alternatively spliced. In primates, Alu elements play a crucial role in the process of exon creation but many new exons have appeared through other mechanisms. Despite many recent studies, it is still unclear which are the splicing regulatory requirements for de novo exonization and how splicing regulation changes throughout an exon's lifespan.
Using comparative genomics, we have defined sets of exons with different evolutionary ages. Younger exons have weaker splice-sites and lower absolute values for the relative abundance of putative splicing regulators between exonic and adjacent intronic regions, indicating a less consolidated splicing regulation. This relative abundance is shown to increase with exon age, leading to higher exon inclusion. We show that this local difference in the density of regulators might be of biological significance, as it outperforms other measures in real exon versus pseudo-exon classification. We apply this new measure to the specific case of the exonization of anti-sense Alu elements and show that they are characterized by a general lack of exonic splicing silencers.
Our results suggest that specific sequence environments are required for exonization and that these can change with time. We propose a model of exon creation and establishment in human genes, in which splicing decisions depend on the relative local abundance of regulatory motifs. Using this model, we provide further explanation as to why Alu elements serve as a major substrate for exon creation in primates. Finally, we discuss the benefits of integrating such information in gene prediction.
很大一部分物种特异性外显子存在可变剪接。在灵长类动物中,Alu元件在新外显子的产生过程中发挥着关键作用,但许多新外显子是通过其他机制出现的。尽管最近有许多研究,但对于从头外显子化的剪接调控要求以及剪接调控在整个外显子生命周期中如何变化仍不清楚。
通过比较基因组学,我们定义了具有不同进化年龄的外显子集。较年轻的外显子具有较弱的剪接位点,外显子区域与相邻内含子区域之间假定剪接调节因子的相对丰度绝对值较低,这表明剪接调控不太稳定。这种相对丰度随着外显子年龄的增加而增加,导致外显子包含率更高。我们表明,调节因子密度的这种局部差异可能具有生物学意义,因为在实际外显子与假外显子分类中,它优于其他指标。我们将这一新指标应用于反义Alu元件外显子化的具体情况,结果表明它们的特征是普遍缺乏外显子剪接沉默子。
我们的结果表明,外显子化需要特定的序列环境,并且这些环境会随时间变化。我们提出了一个人类基因中外显子产生和确立的模型,其中剪接决定取决于调节基序的相对局部丰度。利用这个模型,我们进一步解释了为什么Alu元件是灵长类动物新外显子产生的主要底物。最后,我们讨论了将这些信息整合到基因预测中的好处。