Li Yanchun, Li Yang, Wu Song, Han Kun, Wang Zhengjia, Hou Wei, Zeng Yanru, Wu Rongling
School of Forestry and Biotechnology, Zhejiang Forestry University, Lin'an, Zhejiang, People's Republic of China.
Genetics. 2007 Jul;176(3):1811-21. doi: 10.1534/genetics.106.068890. Epub 2007 Jun 11.
Analysis of population structure and organization with DNA-based markers can provide important information regarding the history and evolution of a species. Linkage disequilibrium (LD) analysis based on allelic associations between different loci is emerging as a viable tool to unravel the genetic basis of population differentiation. In this article, we derive the EM algorithm to obtain the maximum-likelihood estimates of the linkage disequilibria between dominant markers, to study the patterns of genetic diversity for a diploid species. The algorithm was expanded to estimate and test linkage disequilibria of different orders among three dominant markers and can be technically extended to manipulate an arbitrary number of dominant markers. The feasibility of the proposed algorithm is validated by an example of population genetic studies of hickory trees, native to southeastern China, using dominant random amplified polymorphic DNA markers. Extensive simulation studies were performed to investigate the statistical properties of this algorithm. The precision of the estimates of linkage disequilibrium between dominant markers was compared with that between codominant markers. Results from simulation studies suggest that three-locus LD analysis displays increased power of LD detection relative to two-locus LD analysis. This algorithm is useful for studying the pattern and amount of genetic variation within and among populations.
利用基于DNA的标记分析种群结构和组织,可以提供有关物种历史和进化的重要信息。基于不同位点间等位基因关联的连锁不平衡(LD)分析正成为揭示种群分化遗传基础的一种可行工具。在本文中,我们推导了期望最大化(EM)算法,以获得显性标记间连锁不平衡的最大似然估计,从而研究二倍体物种的遗传多样性模式。该算法被扩展用于估计和检验三个显性标记间不同阶数的连锁不平衡,并且在技术上可以扩展以处理任意数量的显性标记。通过对原产于中国东南部的山核桃树进行种群遗传学研究的实例,使用显性随机扩增多态性DNA标记,验证了所提算法的可行性。进行了广泛的模拟研究以考察该算法的统计特性。将显性标记间连锁不平衡估计的精度与共显性标记间的精度进行了比较。模拟研究结果表明,相对于两位点LD分析,三位点LD分析显示出更高的LD检测效能。该算法对于研究种群内和种群间遗传变异的模式及数量很有用。