Iida Y, Sasaki F
J Biochem. 1983 Dec;94(6):1731-8. doi: 10.1093/oxfordjournals.jbchem.a134524.
Most genes of higher eukaryotes are discontinuous. The DNA which codes for a protein is interrupted by introns. In the nucleus, certain machinery is supposed to recognize the exon-intron and intron-exon junctions. Applying computer searching to eighteen genes from various biological species, we examined what kinds of patterns or nucleotide sequences are necessary and sufficient to recognize the splice junctions. We propose that four common patterns of AG/GTA, /GTAAGT, RG/GTGAG and AG/GTXXGT, where R = A or G and X = A, T, G, or C, are often used as signals for exon-intron junctions. This proposal is based on the facts that they are not found in the exons of the genes examined and on the assumption that the recognition machinery scans the mRNA precursor from the 5'-end to the 3'-end.
大多数高等真核生物的基因是不连续的。编码蛋白质的DNA被内含子打断。在细胞核中,某些机制应该能够识别外显子-内含子和内含子-外显子的连接处。通过对来自不同生物物种的18个基因进行计算机搜索,我们研究了识别剪接连接处需要哪些模式或核苷酸序列,以及这些模式或序列是否足够。我们提出,AG/GTA、/GTAAGT、RG/GTGAG和AG/GTXXGT这四种常见模式(其中R = A或G,X = A、T、G或C)常被用作外显子-内含子连接处的信号。这一观点基于以下事实:在所研究基因的外显子中未发现这些模式,并且基于这样的假设,即识别机制从5'端到3'端扫描mRNA前体。