Lobry Jean R, Sueoka Noboru
Laboratoire BBE CNRS UMR 5558, Université Claude Bernard, 43 Bd du 11 Novembre 1918, F-69622 Villeurbanne cedex, France.
Genome Biol. 2002 Sep 26;3(10):RESEARCH0058. doi: 10.1186/gb-2002-3-10-research0058.
When there are no strand-specific biases in mutation and selection rates (that is, in the substitution rates) between the two strands of DNA, the average nucleotide composition is theoretically expected to be A = T and G = C within each strand. Deviations from these equalities are therefore evidence for an asymmetry in selection and/or mutation between the two strands. By focusing on weakly selected regions that could be oriented with respect to replication in 43 out of 51 completely sequenced bacterial chromosomes, we have been able to detect asymmetric directional mutation pressures.
Most of the 43 chromosomes were found to be relatively enriched in G over C and T over A, and slightly depleted in G+C, in their weakly selected positions (intergenic regions and third codon positions) in the leading strand compared with the lagging strand. Deviations from A = T and G = C were highly correlated between third codon positions and intergenic regions, with a lower degree of deviation in intergenic regions, and were not correlated with overall genomic G+C content.
During the course of bacterial chromosome evolution, the effects of asymmetric directional mutation pressures are commonly observed in weakly selected positions. The degree of deviation from equality is highly variable among species, and within species is higher in third codon positions than in intergenic regions. The orientation of these effects is almost universal and is compatible in most cases with the hypothesis of an excess of cytosine deamination in the single-stranded state during DNA replication. However, the variation in G+C content between species is influenced by factors other than asymmetric mutation pressure.
当DNA两条链之间的突变率和选择率(即替代率)不存在链特异性偏差时,理论上每条链内的平均核苷酸组成应是A = T且G = C。因此,与这些等式的偏差是两条链之间选择和/或突变不对称的证据。通过关注51条完全测序的细菌染色体中有43条可以相对于复制进行定向的弱选择区域,我们能够检测到不对称的定向突变压力。
在43条染色体中,大多数在其前导链的弱选择位置(基因间区域和第三密码子位置)相对于滞后链而言,G相对于C以及T相对于A相对富集,而G + C含量略有减少。第三密码子位置和基因间区域之间与A = T和G = C的偏差高度相关,基因间区域的偏差程度较低,并且与总体基因组G + C含量无关。
在细菌染色体进化过程中,通常在弱选择位置观察到不对称定向突变压力的影响。物种之间偏离等式的程度变化很大,并且在物种内部,第三密码子位置的偏差高于基因间区域。这些影响的方向几乎是普遍的,并且在大多数情况下与DNA复制过程中单链状态下胞嘧啶脱氨过量的假说是一致的。然而,物种之间G + C含量的变化受不对称突变压力以外的因素影响。