Collins Lesley, Penny David
Allan Wilson Centre for Molecular Ecology and Evolution, Massey University, Palmerston North, New Zealand.
Mol Biol Evol. 2006 May;23(5):901-10. doi: 10.1093/molbev/msj084. Epub 2005 Dec 21.
Recent studies indicate that many introns, as well as the complex spliceosomal mechanism to remove them, were present early in eukaryotic evolution. This study examines intron and exon characteristics from annotations of whole genomes to investigate the intron recognition mechanism. Exon definition uses the exon as the unit of recognition, placing length constraints on the exon but not on the intron (allowing it a greater range of lengths). In contrast, intron definition uses the intron itself as the unit of recognition and thus removes constraints on internal exon length forced by the use of an exon definition mechanism. Thus, intron and exon lengths within a genome can reflect the constraints imposed by its splicing. This study shows that it is possible firstly to recover valid intron and exon information from genome annotation. We then compare internal intron and exon information from a range of eukaryotic genomes and investigate possible evolutionary length constraints on introns and exons and how they can impact on the intron recognition mechanism. Results indicate that exon definition-based mechanisms may predominate in vertebrates although the exact system in fish is expected to show some differences with the better characterized system from mammals. We also raise the possibility that the last common ancestor of plants and animals contained some type of exon definition and that this mechanism was replaced in some genes and lineages by intron definition, possibly as a result of intron loss and/or intron shortening.
近期研究表明,许多内含子以及去除它们的复杂剪接体机制在真核生物进化早期就已存在。本研究从全基因组注释中考察内含子和外显子特征,以探究内含子识别机制。外显子定义以外显子作为识别单位,对外显子长度加以限制,而对内含子则无此限制(使其长度范围更大)。相比之下,内含子定义以内含子自身作为识别单位,从而消除了因使用外显子定义机制而对外显子内部长度产生的限制。因此,基因组内的内含子和外显子长度能够反映其剪接所施加的限制。本研究表明,首先有可能从基因组注释中恢复有效的内含子和外显子信息。接着,我们比较了一系列真核生物基因组的内部内含子和外显子信息,研究内含子和外显子可能存在的进化长度限制以及它们如何影响内含子识别机制。结果表明,基于外显子定义的机制在脊椎动物中可能占主导地位,尽管鱼类的确切系统预计会与特征更明确的哺乳动物系统存在一些差异。我们还提出一种可能性,即动植物的最后共同祖先包含某种类型的外显子定义,并且这种机制在某些基因和谱系中被内含子定义所取代,这可能是内含子丢失和/或内含子缩短的结果。