Berg Arthur, He Qiuling, Shen Ye, Chen Ying, Huang Minren, Wu Rongling
Pennsylvania State University and Beijing Forestry University.
Stat Appl Genet Mol Biol. 2010;9:Article 16. doi: 10.2202/1544-6115.1528. Epub 2010 Feb 10.
Multiallelic markers, such as microsatellites, provide a powerful tool for studying the genetic structure and organization of an outcrossing population. However, statistical methods of analyzing multiallelic markers in current literature are limited in scope due to the complexity of the multiple alleles. We present a closed-form EM algorithm framework to estimate trigenic linkage disequilibria coefficients of three multiallelic markers and present joint and separate statistical hypothesis tests of different linkage disequilibria. Linkage disequilibria analysis with three multiallelic markers is shown to be considerably more powerful than a two marker analysis or a three marker analysis that treats the multiallelic markers as biallelic markers. A three multiallelic marker model was used to analyze marker data from Lycoris longituba, a tulip-like ornamental plant in China, where each marker consisted of two to four distinct alleles. This algorithm will be useful for studying the pattern of genetic variation for outcrossing populations.
多等位基因标记,如微卫星,为研究异交群体的遗传结构和组织提供了一个强大的工具。然而,由于多个等位基因的复杂性,当前文献中分析多等位基因标记的统计方法在范围上受到限制。我们提出了一个封闭形式的期望最大化(EM)算法框架,用于估计三个多等位基因标记的三基因连锁不平衡系数,并给出不同连锁不平衡的联合和单独统计假设检验。结果表明,对三个多等位基因标记进行连锁不平衡分析比将多等位基因标记视为双等位基因标记的双标记分析或三标记分析更具效力。利用一个三多等位基因标记模型分析了中国一种类似郁金香的观赏植物长筒石蒜的标记数据,其中每个标记由两到四个不同的等位基因组成。该算法将有助于研究异交群体的遗传变异模式。