Whittaker John C, Harbord Roger M, Boxall Nicola, Mackay Ian, Dawson Gary, Sibly Richard M
School of Applied Statistics, University of Reading, United Kingdom.
Genetics. 2003 Jun;164(2):781-7. doi: 10.1093/genetics/164.2.781.
Microsatellites are widely used in genetic analyses, many of which require reliable estimates of microsatellite mutation rates, yet the factors determining mutation rates are uncertain. The most straightforward and conclusive method by which to study mutation is direct observation of allele transmissions in parent-child pairs, and studies of this type suggest a positive, possibly exponential, relationship between mutation rate and allele size, together with a bias toward length increase. Except for microsatellites on the Y chromosome, however, previous analyses have not made full use of available data and may have introduced bias: mutations have been identified only where child genotypes could not be generated by transmission from parents' genotypes, so that the probability that a mutation is detected depends on the distribution of allele lengths and varies with allele length. We introduce a likelihood-based approach that has two key advantages over existing methods. First, we can make formal comparisons between competing models of microsatellite evolution; second, we obtain asymptotically unbiased and efficient parameter estimates. Application to data composed of 118,866 parent-offspring transmissions of AC microsatellites supports the hypothesis that mutation rate increases exponentially with microsatellite length, with a suggestion that contractions become more likely than expansions as length increases. This would lead to a stationary distribution for allele length maintained by mutational balance. There is no evidence that contractions and expansions differ in their step size distributions.
微卫星广泛应用于遗传分析,其中许多分析需要对微卫星突变率进行可靠估计,但决定突变率的因素尚不确定。研究突变最直接、最具决定性的方法是直接观察亲子对中的等位基因传递,这类研究表明突变率与等位基因大小之间呈正相关,可能是指数关系,同时存在长度增加的偏向性。然而,除了Y染色体上的微卫星外,以往的分析并未充分利用现有数据,可能还引入了偏差:仅在无法通过父母基因型传递产生子代基因型的情况下才识别出突变,因此检测到突变的概率取决于等位基因长度的分布,并随等位基因长度而变化。我们引入了一种基于似然性的方法,与现有方法相比有两个关键优势。第一,我们可以对微卫星进化的竞争模型进行正式比较;第二,我们获得渐近无偏且有效的参数估计。应用于由118,866个AC微卫星亲子传递组成的数据,支持了突变率随微卫星长度呈指数增加的假设,同时表明随着长度增加,收缩比扩张更有可能发生。这将导致由突变平衡维持的等位基因长度的稳定分布。没有证据表明收缩和扩张在步长分布上存在差异。