Centre for the Study of Evolution, School of Life Sciences, University of Sussex, Brighton, United Kingdom.
Mol Biol Evol. 2011 Jan;28(1):63-70. doi: 10.1093/molbev/msq249. Epub 2010 Sep 13.
The McDonald-Kreitman (MK) test is a simple and widely used test of selection in which the numbers of nonsilent and silent substitutions (D(n) and D(s)) are compared with the numbers of nonsilent and silent polymorphisms (P(n) and P(s)). The neutrality index (NI = D(s)P(n)/D(n)P(s)), the odds ratio (OR) of the MK table, measures the direction and degree of departure from neutral evolution. The mean of NI values across genes is often taken to summarize patterns of selection in a species. Here, we show that this leads to statistical bias in both simulated and real data to the extent that species, which show a pattern of adaptive evolution, can apparently be subject to weak purifying selection and vice versa. We show that this bias can be removed by using a variant of the Cochran-Mantel-Haenszel procedure for estimating a weighted average OR. We also show that several point estimators of NI are statistically biased even when cutoff values are employed. We therefore suggest that a new statistic be used to study patterns of selection when data are sparse, the direction of selection: DoS = D(n)/(D(n) + D(s)) - P(n)/(P(n) + P(s)).
麦当劳-克里坦(MK)检验是一种简单而广泛使用的选择检验方法,用于比较非沉默和沉默替换(D(n)和 D(s))的数量与非沉默和沉默多态性(P(n)和 P(s))的数量。中性指数(NI = D(s)P(n)/D(n)P(s))和 MK 表的比值(OR)衡量了偏离中性进化的方向和程度。跨基因的 NI 值平均值通常用于总结物种中的选择模式。在这里,我们表明,这会导致模拟和真实数据中的统计偏差,以至于表现出适应性进化模式的物种可能明显受到弱净化选择的影响,反之亦然。我们表明,可以通过使用 Cochran-Mantel-Haenszel 程序的变体来估计加权平均 OR,从而消除这种偏差。我们还表明,即使使用截止值,几个 NI 的点估计值也存在统计偏差。因此,当数据稀疏时,我们建议使用新的统计量来研究选择模式,即选择的方向:DoS = D(n)/(D(n) + D(s)) - P(n)/(P(n) + P(s))。