Tajima F
Department of Biology, Kyushu University, Fukuoka, Japan.
Genetics. 1989 Nov;123(3):585-95. doi: 10.1093/genetics/123.3.585.
The relationship between the two estimates of genetic variation at the DNA level, namely the number of segregating sites and the average number of nucleotide differences estimated from pairwise comparison, is investigated. It is found that the correlation between these two estimates is large when the sample size is small, and decreases slowly as the sample size increases. Using the relationship obtained, a statistical method for testing the neutral mutation hypothesis is developed. This method needs only the data of DNA polymorphism, namely the genetic variation within population at the DNA level. A simple method of computer simulation, that was used in order to obtain the distribution of a new statistic developed, is also presented. Applying this statistical method to the five regions of DNA sequences in Drosophila melanogaster, it is found that large insertion/deletion (greater than 100 bp) is deleterious. It is suggested that the natural selection against large insertion/deletion is so weak that a large amount of variation is maintained in a population.
研究了DNA水平上遗传变异的两种估计值之间的关系,即分离位点的数量和通过成对比较估计的核苷酸平均差异数。结果发现,当样本量较小时,这两种估计值之间的相关性很大,并且随着样本量的增加而缓慢下降。利用所得到的关系,开发了一种检验中性突变假说的统计方法。该方法只需要DNA多态性数据,即DNA水平上群体内的遗传变异。还介绍了一种为获得新开发统计量的分布而使用的简单计算机模拟方法。将这种统计方法应用于黑腹果蝇DNA序列的五个区域,发现大的插入/缺失(大于100bp)是有害的。有人提出,针对大插入/缺失的自然选择非常微弱,以至于群体中维持了大量的变异。