Center for Comparative Genomics and Bioinformatics, The Pennsylvania State University, University Park, PA 16802, USA.
Genome Biol. 2008 Apr 30;9(4):R76. doi: 10.1186/gb-2008-9-4-r76.
The evolutionary distance between human and macaque is particularly attractive for investigating local variation in neutral substitution rates, because substitutions can be inferred more reliably than in comparisons with rodents and are less influenced by the effects of current and ancient diversity than in comparisons with closer primates. Here we investigate the human-macaque neutral substitution rate as a function of a number of genomic parameters.
Using regression analyses we find that male mutation bias, male (but not female) recombination rate, distance to telomeres and substitution rates computed from orthologous regions in mouse-rat and dog-cow comparisons are prominent predictors of the neutral rate. Additionally, we demonstrate that the previously observed biphasic relationship between neutral rate and GC content can be accounted for by properly combining rates at CpG and non-CpG sites. Finally, we find the neutral rate to be negatively correlated with the densities of several classes of computationally predicted functional elements, and less so with the densities of certain classes of experimentally verified functional elements.
Our results suggest that while female recombination may be mainly responsible for driving evolution in GC content, male recombination may be mutagenic, and that other mutagenic mechanisms acting near telomeres, and mechanisms whose effects are shared across mammalian genomes, play significant roles. We also have evidence that the nonlinear increase in rates at high GC levels may be largely due to hyper-mutability of CpG dinucleotides. Finally, our results suggest that the performance of conservation-based prediction methods can be improved by accounting for neutral rates.
人类和猕猴之间的进化距离特别适合研究中性替换率的局部变化,因为与啮齿动物相比,可以更可靠地推断出替换,并且与更接近的灵长类动物相比,受当前和古代多样性影响的程度更小。在这里,我们研究了人类与猕猴之间的中性替换率与许多基因组参数的关系。
通过回归分析,我们发现雄性突变偏向、雄性(而非雌性)重组率、与端粒的距离以及从鼠-鼠和犬-牛比较中同源区计算得出的替换率是中性率的主要预测因子。此外,我们证明了先前观察到的中性率与 GC 含量之间的双相关系可以通过正确组合 CpG 和非 CpG 位点的速率来解释。最后,我们发现中性率与几类计算预测的功能元件的密度呈负相关,而与某些类别的实验验证的功能元件的密度相关性较低。
我们的结果表明,虽然雌性重组可能主要负责驱动 GC 含量的进化,但雄性重组可能具有诱变作用,而在端粒附近起作用的其他诱变机制以及在哺乳动物基因组中共享作用的机制则起着重要作用。我们还有证据表明,在高 GC 水平下,速率的非线性增加可能主要是由于 CpG 二核苷酸的超突变性。最后,我们的结果表明,通过考虑中性速率,可以提高基于保守性的预测方法的性能。