Chiaromonte F, Yang S, Elnitski L, Yap V B, Miller W, Hardison R C
Department of Statistics, Pennsylvania State University, University Park, PA 16802, USA.
Proc Natl Acad Sci U S A. 2001 Dec 4;98(25):14503-8. doi: 10.1073/pnas.251423898. Epub 2001 Nov 20.
The amount of noncoding genomic DNA sequence that aligns between human and mouse varies substantially in different regions of their genomes, and the amount of repetitive DNA also varies. In this report, we show that divergence in noncoding nonrepetitive DNA is strongly correlated with the amount of repetitive DNA in a region. We investigated aligned DNA in four large genomic regions with finished human sequence and almost or completely finished mouse sequence. These regions, totaling 5.89 Mb of DNA, are on different chromosomes and vary in their base composition. An analysis based on sliding windows of 10 kb shows that the fraction of aligned noncoding nonrepetitive DNA and the fraction of repetitive DNA are negatively correlated, both at the level of an entire region and locally within it. This conclusion is strongly supported by a randomization study, in which repetitive elements are removed and randomly relocated along the sequences. Thus, regions of noncoding genomic DNA that accumulated fewer point mutations since the primate-rodent divergence also suffered fewer retrotransposition events. These results indicate that some regions of the genome are more "flexible" over the time scale of mammalian evolution, being able to accommodate many point mutations and insertions, whereas other regions are more "rigid" and accumulate fewer changes. Stronger conservation is generally interpreted as indicating more extensive or more important function. The evidence presented here of correlated variation in the rates of different evolutionary processes across noncoding DNA must be considered in assessing such conservation for evidence of selection.
人类和小鼠基因组之间可比对的非编码基因组DNA序列数量在其基因组的不同区域有很大差异,重复DNA的数量也各不相同。在本报告中,我们表明非编码非重复DNA的差异与一个区域内重复DNA的数量密切相关。我们研究了人类基因组序列已完成、小鼠基因组序列几乎或完全完成的四个大型基因组区域中的比对DNA。这些区域的DNA总量为5.89 Mb,位于不同的染色体上,碱基组成也各不相同。基于10 kb滑动窗口的分析表明,在整个区域及其局部水平上,比对的非编码非重复DNA的比例与重复DNA的比例呈负相关。一项随机化研究有力地支持了这一结论,在该研究中,重复元件被去除并沿序列随机重新定位。因此,自灵长类动物与啮齿动物分化以来积累较少点突变的非编码基因组DNA区域,反转录转座事件也较少。这些结果表明,在哺乳动物进化的时间尺度上,基因组的某些区域更“灵活”,能够容纳许多点突变和插入,而其他区域则更“刚性”,积累的变化较少。更强的保守性通常被解释为表明功能更广泛或更重要。在评估这种保守性以寻找选择证据时,必须考虑这里提出的确凿证据,即不同进化过程在非编码DNA中的速率存在相关变化。