CSIRO Mathematics, Informatics and Statistics, St Lucia, QLD, Australia.
Theor Appl Genet. 2010 May;120(8):1653-72. doi: 10.1007/s00122-010-1283-z. Epub 2010 Feb 25.
In sugarcane or other autopolyploids, after generating the data, the first step in constructing molecular marker maps is to determine marker dosage. Improved methods for correctly allocating marker dosage will result in more accurate maps and increased efficiency of QTL linkage detection. When employing dominant markers like AFLPs, single-dose markers represent alleles present as one copy in one parent and null in the other parent, double-dose markers are those present as two copies in one parent and null in the other parent and so on. Observed segregation ratios in the offspring are employed to infer marker dosage in the parent from which the marker was inherited. Commonly, for each marker, a chi (2) test is used to assign dosage. Such an approach does not address important practical considerations such as multiple testing and departures from theoretical assumptions. In particular, extra-binomial variation or overdispersion has been observed in sugarcane studies and standard methods may result in fewer correct dosage allocations than the data warrant. To address these shortcomings, a Bayesian mixture model is proposed where all markers are considered simultaneously. Since analytic solutions are not available, Markov chain Monte Carlo methods are employed. Marker dosage allocation for each individual marker employs the estimated posterior probability of each dosage. For a sugarcane study these methods resulted in more markers being allocated a dosage than by standard approaches. Simulation studies demonstrated that, in general, not only are more markers classified but that more markers are also correctly classified, particularly if overdispersion is present.
在甘蔗或其他同源多倍体中,在生成数据后,构建分子标记图谱的第一步是确定标记剂量。改进正确分配标记剂量的方法将导致更准确的图谱和增加 QTL 连锁检测的效率。当使用显性标记如 AFLP 时,单剂量标记表示在一个亲本中存在一个拷贝,而在另一个亲本中为 null,双剂量标记表示在一个亲本中存在两个拷贝,而在另一个亲本中为 null 等。后代中观察到的分离比用于推断从标记被继承的亲本中的标记剂量。通常,对于每个标记,使用卡方检验 (chi (2)) 来分配剂量。这种方法没有解决重要的实际问题,例如多重检验和偏离理论假设。特别是,在甘蔗研究中已经观察到超二项式变异或过度分散,标准方法可能会导致比数据要求更少的正确剂量分配。为了解决这些缺点,提出了一种贝叶斯混合模型,其中所有标记同时被考虑。由于没有解析解,因此采用马尔可夫链蒙特卡罗方法。对于每个个体标记的标记剂量分配,使用每个剂量的估计后验概率。对于甘蔗研究,这些方法导致比标准方法更多的标记被分配剂量。模拟研究表明,一般来说,不仅有更多的标记被分类,而且有更多的标记被正确分类,特别是如果存在过度分散。