Tajima F
Genetics. 1983 Oct;105(2):437-60. doi: 10.1093/genetics/105.2.437.
With the aim of analyzing and interpreting data on DNA polymorphism obtained by DNA sequencing or restriction enzyme technique, a mathematical theory on the expected evolutionary relationship among DNA sequences (nucleons) sampled is developed under the assumption that the evolutionary change of nucleons is determined solely by mutation and random genetic drift. The statistical property of the number of nucleotide differences between randomly chosen nucleons and that of heterozygosity or nucleon diversity is investigated using this theory. These studies indicate that the estimates of the average number of nucleotide differences and nucleon diversity have a large variance, and a large part of this variance is due to stochastic factors. Therefore, increasing sample size does not help reduce the variance significantly The distribution of sample allele (nucleomorph) frequencies is also studied, and it is shown that a small number of samples are sufficient in order to know the distribution pattern.
为了分析和解释通过DNA测序或限制性酶技术获得的DNA多态性数据,在核子的进化变化仅由突变和随机遗传漂变决定的假设下,建立了一个关于所采样的DNA序列(核子)之间预期进化关系的数学理论。利用该理论研究了随机选择的核子之间核苷酸差异数量以及杂合性或核子多样性的统计特性。这些研究表明,核苷酸差异平均数和核子多样性的估计值具有很大的方差,并且这种方差的很大一部分是由随机因素引起的。因此,增加样本量并不能显著帮助降低方差。还研究了样本等位基因(核形态)频率的分布,结果表明,为了了解分布模式,少量样本就足够了。