Barrette I H, McKenna S, Taylor D R, Forsdyke D R
Department of Biochemistry, Queen's University, Kingston, K7L3N6, Ontario, Canada.
Gene. 2001 May 30;270(1-2):181-9. doi: 10.1016/s0378-1119(01)00477-2.
Many eukaryotic genes are split into exons and introns, the latter being removed post-transcriptionally so that only exon sequences appear in cytoplasmic RNAs. Since introns appear in both protein-encoding RNAs and non-protein-coding RNAs, they interrupt genetic information per se, not just protein-encoding information. A DNA sequence has the potential to carry more than one type of genetic information, but different types may conflict. Thus, it has been proposed that introns arose because sequences were unable to contain concomitantly complete information for the encoding both of stem-loops and of cytoplasmic products (protein and/or RNA). Stem-loop potential is held to be selectively advantageous since it promotes the recombination-dependent correction of genetic errors. Stem-loop potential, the best local measure of which is base order-dependent stem-loop potential, tends to be less in exons than in introns. This is particularly evident in genes evolving rapidly under positive Darwinian selection, where the protein-encoding function is dominant. Evidence is now presented that the rare regions where genes overlap also impose excessive encoding demands so that the concomitant coding of base order-dependent stem-loop potential is decreased. Our results are consistent with the hypothesis that sequences with high stem-loop potential arose in the early 'RNA world'. Ancestors of modern genes would have entered this world when sequences (exons) encoding cytoplasmic products, were interspersed with sequences (introns) encoding selectively advantageous stem-loops. Purine-loading pressure would also have favoured intron formation.
许多真核基因被分割成外显子和内含子,内含子在转录后被去除,因此只有外显子序列出现在细胞质RNA中。由于内含子既出现在蛋白质编码RNA中,也出现在非蛋白质编码RNA中,它们打断的是遗传信息本身,而不仅仅是蛋白质编码信息。一个DNA序列有可能携带不止一种类型的遗传信息,但不同类型的信息可能会相互冲突。因此,有人提出内含子的出现是因为序列无法同时包含完整的茎环编码信息和细胞质产物(蛋白质和/或RNA)编码信息。茎环潜力被认为具有选择优势,因为它促进了依赖重组的遗传错误校正。茎环潜力,其最佳的局部衡量标准是碱基顺序依赖的茎环潜力,在外显子中往往比在内含子中要低。这在正达尔文选择下快速进化的基因中尤为明显,在这些基因中蛋白质编码功能占主导地位。现在有证据表明,基因重叠的罕见区域也会带来过多的编码需求,从而导致碱基顺序依赖的茎环潜力的伴随编码减少。我们的结果与以下假设一致:具有高茎环潜力的序列出现在早期的“RNA世界”。现代基因的祖先在编码细胞质产物的序列(外显子)与编码具有选择优势的茎环的序列(内含子)相互穿插时进入了这个世界。嘌呤加载压力也有利于内含子的形成。