Tajima F, Nei M
J Mol Evol. 1982;18(2):115-20. doi: 10.1007/BF01810830.
A mathematical formula for the relationship between the average number of nucleotide substitutions per site and the proportion of shared restriction sites between two homologous nucleons is developed by taking into account the unequal rates of substitution among different pairs of nucleotides. Using this formula, the possible amount of bias of the estimate of the number of nucleotide substitutions obtained by the Upholt-Nei-Li formula for restriction site data is investigated. The results obtained indicate that the bias depends upon the nucleotides in the recognition sequence of the restriction enzyme used, the unequal rates of substitution among different nucleotides, and the unequal nucleotide frequencies, but the primary factor is the unequal rates of nucleotide substitution. The amount of bias is generally larger for four-base enzymes than for six-base enzymes. However, when many restriction enzymes are used for the study of DNA divergence, the bias is unlikely to be very large unless the rate of substitution greatly varies from nucleotide to nucleotide.
通过考虑不同核苷酸对之间替换速率的不均等,推导了一个关于每个位点核苷酸替换平均数与两个同源核酸之间共享限制位点比例关系的数学公式。利用这个公式,研究了通过Upholt-Nei-Li公式从限制位点数据获得的核苷酸替换数估计值可能存在的偏差量。所得结果表明,偏差取决于所用限制酶识别序列中的核苷酸、不同核苷酸之间替换速率的不均等以及核苷酸频率的不均等,但主要因素是核苷酸替换速率的不均等。对于四碱基酶,偏差量通常比六碱基酶更大。然而,当使用多种限制酶研究DNA分歧时,除非核苷酸之间的替换速率差异极大,否则偏差不太可能非常大。