Mugal Carina F, von Grünberg Hans-Hennig, Peifer Martin
Institute of Chemistry, Karl-Franzens University Graz, Graz, Austria.
Mol Biol Evol. 2009 Jan;26(1):131-42. doi: 10.1093/molbev/msn245. Epub 2008 Oct 29.
If substitution rates are not the same on the two complementary DNA strands, a substitution is considered strand asymmetric. Such substitutional strand asymmetries are determined here for the three most frequent types of substitution on the human genome (C --> T, A --> G, and G --> T). Substitution rate differences between both strands are estimated for 4,590 human genes by aligning all repeats occurring within the introns with their ancestral consensus sequences. For 1,630 of these genes, both coding strand and noncoding strand rates could be compared with rates in gene-flanking regions. All three rates considered are found to be on average higher on the coding strand and lower on the transcribed strand in comparison to their values in the gene-flanking regions. This finding points to the simultaneous action of rate-increasing effects on the coding strand--such as increased adenine and cytosine deamination--and transcription-coupled repair as a rate-reducing effect on the transcribed strand. The common behavior of the three rates leads to strong correlations of the rate asymmetries: Whenever one rate is strand biased, the other two rates are likely to show the same bias. Furthermore, we determine all three rate asymmetries as a function of time: the A --> G and G --> T rate asymmetries are both found to be constant in time, whereas the C --> T rate asymmetry shows a pronounced time dependence, an observation that explains the difference between our results and those of an earlier work by Green et al. (2003. Transcription-associated mutational asymmetry in mammalian evolution. Nat Genet. 33:514-517.). Finally, we show that in addition to transcription also the replication process biases the substitution rates in genes.
如果两条互补DNA链上的替换率不同,则该替换被认为是链不对称的。本文确定了人类基因组中三种最常见的替换类型(C→T、A→G和G→T)的这种替换链不对称性。通过将内含子中出现的所有重复序列与其祖先共有序列进行比对,估计了4590个人类基因两条链之间的替换率差异。对于其中1630个基因,编码链和非编码链的速率都可以与基因侧翼区域的速率进行比较。与基因侧翼区域的值相比,发现所有考虑的三种速率在编码链上平均较高,而在转录链上较低。这一发现表明,编码链上存在速率增加效应(如腺嘌呤和胞嘧啶脱氨增加)的同时作用,以及转录偶联修复作为转录链上的速率降低效应。这三种速率的共同行为导致速率不对称性之间存在强相关性:每当一种速率存在链偏好时,其他两种速率可能会表现出相同的偏好。此外,我们确定了所有三种速率不对称性随时间的变化:发现A→G和G→T速率不对称性在时间上都是恒定的,而C→T速率不对称性表现出明显的时间依赖性,这一观察结果解释了我们的结果与Green等人(2003年。哺乳动物进化中与转录相关的突变不对称性。《自然遗传学》。33:514 - 517)早期工作结果之间的差异。最后,我们表明,除了转录之外,复制过程也会使基因中的替换率产生偏差。