Strelets V B, Lim H A
Supercomputer Computations Research Institute, Florida State University, Tallahassee 32306-4052, USA.
Biosystems. 1995;36(1):37-41. doi: 10.1016/0303-2647(95)01525-p.
Splice junction shadows (ancient exon-exon junctions) presumably reflect the existence of amino acid primary blocks which were used in the course of evolution for the construction of new proteins. The lengths of such blocks (i.e. regions between splice junctions), as the lengths of corresponding inserted or duplicated ancient exons, should be divisible by three in order to store the preexisting coding frame in the course of evolution. In this paper, we will test the hypothesis of intron-mediated recombination in a model of block molecular evolution (exon shuffling) by revealing corresponding blocks in existing database-contained coding sequences. For this purpose, we use a weight matrix prediction of ancient splice junction shadows in coding regions of the nucleotide sequences in current databases. The usage of splice junction shadows allows us to test the block evolution hypothesis in better detail in comparison with previous methods which were based only on currently existing recent exons. Our result of block length distribution at the nucleotide level shows a clear tendency to be divisible by three. At the protein level, several unexpected favorable block lengths, which are six, nine, 12 and 15 amino acids in length, were observed. Further refinements in our method for revealing splice junction shadows (structural block boundaries) might reveal peptides which probably maintain stable folds in different structures. The latter can in turn be used for protein structure prediction.
剪接连接阴影(古老的外显子-外显子连接)大概反映了氨基酸初级模块的存在,这些模块在进化过程中被用于构建新蛋白质。这些模块的长度(即剪接连接之间的区域),如同相应插入或重复的古老外显子的长度一样,应该能被三整除,以便在进化过程中保存预先存在的编码框架。在本文中,我们将通过揭示现有数据库中编码序列中的相应模块,在模块分子进化(外显子洗牌)模型中检验内含子介导的重组假说。为此,我们使用权重矩阵预测当前数据库中核苷酸序列编码区域的古老剪接连接阴影。与仅基于当前存在的近期外显子的先前方法相比,剪接连接阴影的使用使我们能够更详细地检验模块进化假说。我们在核苷酸水平上的模块长度分布结果显示出明显的被三整除的趋势。在蛋白质水平上,观察到了几种意想不到的有利模块长度,即长度为六个、九个、十二个和十五个氨基酸。我们揭示剪接连接阴影(结构模块边界)的方法的进一步改进可能会揭示出可能在不同结构中保持稳定折叠的肽段。后者反过来可用于蛋白质结构预测。