Urbina Daniel, Tang Bin, Higgs Paul G
Department of Physics and Astronomy, McMaster University, Hamilton, Ontario, Canada.
J Mol Evol. 2006 Mar;62(3):340-61. doi: 10.1007/s00239-005-0051-1. Epub 2006 Feb 13.
The frequencies of A, C, G, and T in mitochondrial DNA vary among species due to unequal rates of mutation between the bases. The frequencies of bases at fourfold degenerate sites respond directly to mutation pressure. At first and second positions, selection reduces the degree of frequency variation. Using a simple evolutionary model, we show that first position sites are less constrained by selection than second position sites and, therefore, that the frequencies of bases at first position are more responsive to mutation pressure than those at second position. We define a measure of distance between amino acids that is dependent on eight measured physical properties and a similarity measure that is the inverse of this distance. Columns 1, 2, 3, and 4 of the genetic code correspond to codons with U, C, A, and G in their second position, respectively. The similarity of amino acids in the four columns decreases systematically from column 1 to column 2 to column 3 to column 4. We then show that the responsiveness of first position bases to mutation pressure is dependent on the second position base and follows the same decreasing trend through the four columns. Again, this shows the correlation between physical properties and responsiveness. We determine a proximity measure for each amino acid, which is the average similarity between an amino acid and all others that are accessible via single point mutations in the mitochondrial genetic code structure. We also define a responsiveness for each amino acid, which measures how rapidly an amino acid frequency changes as a result of mutation pressure acting on the base frequencies. We show that there is a strong correlation between responsiveness and proximity, and that both these quantities are also correlated with the mutability of amino acids estimated from the mtREV substitution rate matrix. We also consider the variation of base frequencies between strands and between genes on a strand. These trends are consistent with the patterns expected from analysis of the variation among genomes.
线粒体DNA中A、C、G和T的频率在不同物种间存在差异,这是由于碱基间的突变率不均等所致。四倍简并位点的碱基频率直接响应突变压力。在第一和第二位点,选择会降低频率变化的程度。通过一个简单的进化模型,我们表明第一位点受选择的限制比第二位点小,因此,第一位点的碱基频率比第二位点的碱基频率对突变压力更敏感。我们定义了一种依赖于八种测量物理性质的氨基酸间距离度量以及一种作为该距离倒数的相似性度量。遗传密码的第1、2、3和4列分别对应于第二位为U、C、A和G的密码子。这四列中氨基酸的相似性从第1列到第2列再到第3列最后到第4列呈系统性降低。然后我们表明第一位碱基对突变压力的响应性取决于第二位碱基,并且在这四列中遵循相同的递减趋势。同样,这显示了物理性质与响应性之间的相关性。我们确定了每个氨基酸的接近度度量,它是一个氨基酸与线粒体遗传密码结构中通过单点突变可及的所有其他氨基酸之间的平均相似性。我们还为每个氨基酸定义了响应性,它衡量氨基酸频率因作用于碱基频率的突变压力而变化的速度。我们表明响应性与接近度之间存在很强的相关性,并且这两个量也与根据mtREV替换率矩阵估计的氨基酸可变性相关。我们还考虑了链间以及链上基因间碱基频率的变化。这些趋势与基因组间变异分析所预期的模式一致。