Unit of Animal Genomics, GIGA-Research and Department of Animal Production, Faculty of Veterinary Medicine, University of Liège, Belgium.
Genetics. 2010 Mar;184(3):789-98. doi: 10.1534/genetics.109.108431. Epub 2009 Dec 14.
Faithful reconstruction of haplotypes from diploid marker data (phasing) is important for many kinds of genetic analyses, including mapping of trait loci, prediction of genomic breeding values, and identification of signatures of selection. In human genetics, phasing most often exploits population information (linkage disequilibrium), while in animal genetics the primary source of information is familial (Mendelian segregation and linkage). We herein develop and evaluate a method that simultaneously exploits both sources of information. It builds on hidden Markov models that were initially developed to exploit population information only. We demonstrate that the approach improves the accuracy of allele phasing as well as imputation of missing genotypes. Reconstructed haplotypes are assigned to hidden states that are shown to correspond to clusters of genealogically related chromosomes. We show that these cluster states can directly be used to fine map QTL. The method is computationally effective at handling large data sets based on high-density SNP panels.
从二倍体标记数据中准确重建单倍型(相位)对于许多遗传分析非常重要,包括性状位点的定位、基因组育种值的预测和选择信号的识别。在人类遗传学中,相位通常利用群体信息(连锁不平衡),而在动物遗传学中,信息的主要来源是家族性的(孟德尔分离和连锁)。我们在此开发并评估了一种同时利用这两种信息源的方法。它基于最初仅用于利用群体信息的隐马尔可夫模型。我们证明,该方法可以提高等位基因相位和缺失基因型推断的准确性。重建的单倍型被分配到隐藏状态,这些状态被证明与谱系相关的染色体簇相对应。我们表明,这些簇状态可以直接用于精细定位 QTL。该方法在处理基于高密度 SNP 面板的大型数据集时具有计算效率。