Hudson R R
National Institute of Environmental Health Sciences, Research Triangle Park, North Carolina 27709.
Theor Popul Biol. 1989 Aug;36(1):23-33. doi: 10.1016/0040-5809(89)90021-x.
An approximate expression is obtained for the probability that a restriction site, which is polymorphic in a random sample, is a site at which two or more mutations have occurred in the descent to the sample from the most recent common ancestor of the sample. The analysis is based on the assumption that the population from which the sample is obtained is at equilibrium under a selectively neutral Wright-Fisher model. Monte Carlo simulations show that the approximation is quite accurate. For commonly observed levels of genetic variation in humans and in natural populations of Drosophila, it is found that multiple mutations would occur at 5 to 10 percent of polymorphic restriction sites assuming that six-cutter enzymes are used on samples of size 50 to 100. Simulations are also used to investigate the bias and mean square error of four estimators of 4Nu, where N is the population size and u is the neutral mutation rate per nucleotide site. Two of the estimators are biased by approximately 20 percent when levels of variation are similar to those which have been observed in natural populations of Drosophila.
得到了一个近似表达式,用于计算在随机样本中具有多态性的限制酶切位点,是在从样本的最近共同祖先到样本的世系中发生了两个或更多突变的位点的概率。该分析基于这样的假设:所获取样本的群体在选择性中性的赖特 - 费希尔模型下处于平衡状态。蒙特卡罗模拟表明该近似相当准确。对于人类和果蝇自然种群中常见的遗传变异水平,发现假设对大小为50至100的样本使用六碱基切割酶,那么在5%至10%的多态性限制酶切位点会发生多个突变。模拟还用于研究4Nu的四个估计量的偏差和均方误差,其中N是种群大小,u是每个核苷酸位点的中性突变率。当变异水平与在果蝇自然种群中观察到的水平相似时,其中两个估计量存在约20%的偏差。