McDonald J H
Department of Biology, University of Delaware, Newark, USA.
Mol Biol Evol. 1998 Apr;15(4):377-84. doi: 10.1093/oxfordjournals.molbev.a025934.
The neutral theory of molecular evolution predicts that the ratio of polymorphisms to fixed differences should be fairly uniform across a region of DNA sequence. Significant heterogeneity in this ratio can indicate the effects of balancing selection, selective sweeps, mildly deleterious mutations, or background selection. Comparing an observed heterogeneity statistic with simulations of the heterogeneity resulting from random phylogenetic and sampling variation provides a test of the statistical significance of the observed pattern. When simulated data sets containing heterogeneity in the polymorphism-to-divergence ratio are examined, different statistics are most powerful for detecting different patterns of heterogeneity. The number of runs is most powerful for detecting patterns containing several peaks of polymorphism; the Kolmogorov-Smirnov statistic is most powerful for detecting patterns in which one end of the gene has high polymorphism and the other end has low polymorphism; and a newly developed statistic, the mean sliding G statistic, is most powerful for detecting patterns containing one or two peaks of polymorphism with reduced polymorphism on either side. Nine out of 27 genes from the Drosophila melanogaster subgroup exhibit heterogeneity that is significant under at least one of these three tests, with five of the nine remaining significant after a correction for multiple comparisons, suggesting that detectable evidence for the effects of some kind of selection is fairly common.
分子进化的中性理论预测,多态性与固定差异的比率在一段DNA序列区域内应相当均匀。该比率的显著异质性可能表明平衡选择、选择性清除、轻度有害突变或背景选择的影响。将观察到的异质性统计量与由随机系统发育和抽样变异产生的异质性模拟进行比较,可对观察到的模式的统计显著性进行检验。当检查包含多态性与分化率异质性的模拟数据集时,不同的统计量对检测不同的异质性模式最为有效。游程数对检测包含多个多态性峰值的模式最为有效;柯尔莫哥洛夫-斯米尔诺夫统计量对检测基因一端具有高多态性而另一端具有低多态性的模式最为有效;一种新开发的统计量,即平均滑动G统计量,对检测包含一两个多态性峰值且两侧多态性降低的模式最为有效。黑腹果蝇亚组的27个基因中有9个表现出异质性,在这三种测试中的至少一种下具有显著性,其中9个中的5个在进行多重比较校正后仍具有显著性,这表明某种选择效应的可检测证据相当普遍。