Gazave Elodie, Marqués-Bonet Tomàs, Fernando Olga, Charlesworth Brian, Navarro Arcadi
Unitat de Biologia Evolutiva, Departament de Ciències Experimentals i de la Salut, Universitat Pompeu Fabra, Carrer Dr Aiguader 88, 08003 Barcelona, Catalonia, Spain.
Genome Biol. 2007;8(2):R21. doi: 10.1186/gb-2007-8-2-r21.
Introns, which constitute the largest fraction of eukaryotic genes and which had been considered to be neutral sequences, are increasingly acknowledged as having important functions. Several studies have investigated levels of evolutionary constraint along introns and across classes of introns of different length and location within genes. However, thus far these studies have yielded contradictory results.
We present the first analysis of human-chimpanzee intron divergence, in which differences in the number of substitutions per intronic site (Ki) can be interpreted as the footprint of different intensities and directions of the pressures of natural selection. Our main findings are as follows: there was a strong positive correlation between intron length and divergence; there was a strong negative correlation between intron length and GC content; and divergence rates vary along introns and depending on their ordinal position within genes (for instance, first introns are more GC rich, longer and more divergent, and divergence is lower at the 3' and 5' ends of all types of introns).
We show that the higher divergence of first introns is related to their larger size. Also, the lower divergence of short introns suggests that they may harbor a relatively greater proportion of regulatory elements than long introns. Moreover, our results are consistent with the presence of functionally relevant sequences near the 5' and 3' ends of introns. Finally, our findings suggest that other parts of introns may also be under selective constraints.
内含子构成了真核基因的最大部分,曾被认为是中性序列,但现在越来越被认为具有重要功能。多项研究调查了内含子之间以及基因内不同长度和位置的内含子类别之间的进化约束水平。然而,迄今为止这些研究得出了相互矛盾的结果。
我们首次对人类与黑猩猩的内含子差异进行了分析,其中每个内含子位点的替换数(Ki)差异可被解释为自然选择压力不同强度和方向的印记。我们的主要发现如下:内含子长度与差异之间存在强烈的正相关;内含子长度与GC含量之间存在强烈的负相关;差异率沿内含子变化,并取决于它们在基因中的顺序位置(例如,第一个内含子富含GC、更长且差异更大,并且所有类型内含子的3'和5'末端差异较低)。
我们表明第一个内含子的较高差异与其较大的尺寸有关。此外,短内含子的较低差异表明它们可能比长内含子含有相对更大比例的调控元件。而且,我们的结果与内含子5'和3'末端附近存在功能相关序列一致。最后,我们的发现表明内含子的其他部分也可能受到选择约束。