Long M, de Souza S J, Rosenberg C, Gilbert W
Department of Molecular and Cellular Biology, The Biological Laboratories, Harvard University, Cambridge, MA 02138, USA.
Proc Natl Acad Sci U S A. 1998 Jan 6;95(1):219-23. doi: 10.1073/pnas.95.1.219.
The coding sequence at the boundaries of exons flanking nuclear introns shows some degree of conservation. To the extent that such sequences might be recognized by the splicing machinery, this conservation may be a derived result of evolution for efficient splicing. Alternatively, such conserved sequences might be remnants of proto-splice sites, which might have existed early in eukaryotic genes and served as the targets for the insertion of introns, as has been proposed by the introns-late theory. The distribution of intron phases, the position of the intron within a codon, is biased with an over-representation of phase 0 introns. Could any distribution of proto-splice sites account for today's intron phase distribution? Here, we examine the dicodon usage in six model organisms, based on current sequences in the GenBank database, and predict the phase distribution that would be expected if introns had been inserted into proto-splice sites. However, these predictions differ between the various model organisms and disagree with the observed intron phase distributions. Thus, we reject the hypothesis that introns are inserted into hypothetical proto-splice sites. Finally, we analyze the sequences around the splice sites of introns in all six of the species to show that the actual conservation of sequence in exon regions near introns is very small and differs considerably between these species, which is inconsistent with a general proto-splice sites model.
核内含子两侧外显子边界处的编码序列呈现出一定程度的保守性。就剪接机制可能识别此类序列而言,这种保守性可能是高效剪接进化的衍生结果。或者,此类保守序列可能是原剪接位点的残余,正如内含子晚期理论所提出的,原剪接位点可能在真核基因早期就已存在,并作为内含子插入的靶点。内含子相位(即内含子在密码子内的位置)的分布存在偏差,0相位内含子的比例过高。原剪接位点的任何分布能否解释当今的内含子相位分布呢?在此,我们基于GenBank数据库中的当前序列,研究了六种模式生物的双密码子使用情况,并预测了如果内含子插入原剪接位点时预期的相位分布。然而,这些预测在不同的模式生物之间存在差异,且与观察到的内含子相位分布不一致。因此,我们拒绝内含子插入假设的原剪接位点这一假说。最后,我们分析了所有六个物种内含子剪接位点周围的序列,以表明内含子附近外显子区域序列的实际保守性非常小,且在这些物种之间差异很大,这与一般的原剪接位点模型不一致。