Voelker Rodger B, Berglund J Andrew
Institute of Molecular Biology, University of Oregon, Eugene, Oregon 97403, USA.
Genome Res. 2007 Jul;17(7):1023-33. doi: 10.1101/gr.6017807. Epub 2007 May 24.
Orthologous mammalian introns contain many highly conserved sequences. Of these sequences, many are likely to represent protein binding sites that are under strong positive selection. In order to identify conserved protein binding sites that are important for splicing, we analyzed the composition of intronic sequences that are conserved between human and six eutherian mammals. We focused on all completely conserved sequences of seven or more nucleotides located in the regions adjacent to splice-junctions. We found that these conserved intronic sequences are enriched in specific motifs, and that many of these motifs are statistically associated with either alternative or constitutive splicing. In validation of our methods, we identified several motifs that are known to play important roles in alternative splicing. In addition, we identified several novel motifs containing GCT that are abundant and are associated with alternative splicing. Furthermore, we demonstrate that, for some of these motifs, conservation is a strong indicator of potential functionality since conserved instances are associated with alternative splicing while nonconserved instances are not. A surprising outcome of this analysis was the identification of a large number of AT-rich motifs that are strongly associated with constitutive splicing. Many of these appear to be novel and may represent conserved intronic splicing enhancers (ISEs). Together these data show that conservation provides important insights into the identification and possible roles of cis-acting intronic sequences important for alternative and constitutive splicing.
直系同源的哺乳动物内含子包含许多高度保守的序列。在这些序列中,许多可能代表处于强烈正选择下的蛋白质结合位点。为了鉴定对剪接重要的保守蛋白质结合位点,我们分析了人类与六种真兽亚纲哺乳动物之间保守的内含子序列的组成。我们关注位于剪接连接点相邻区域的所有七个或更多核苷酸的完全保守序列。我们发现这些保守的内含子序列富含特定基序,并且其中许多基序在统计学上与可变剪接或组成型剪接相关。在对我们方法的验证中,我们鉴定出了几个已知在可变剪接中起重要作用的基序。此外,我们鉴定出了几个富含GCT且与可变剪接相关的新基序。此外,我们证明,对于其中一些基序,保守性是潜在功能的有力指标,因为保守的实例与可变剪接相关,而非保守的实例则不然。该分析的一个令人惊讶的结果是鉴定出大量与组成型剪接密切相关的富含AT的基序。其中许多似乎是新的,可能代表保守的内含子剪接增强子(ISE)。这些数据共同表明,保守性为鉴定对可变剪接和组成型剪接重要的顺式作用内含子序列及其可能的作用提供了重要见解。