Liu Jianfeng, Zhang Yuan, Zhang Qin, Wang Lixian, Zhang Jigang
College of Animal Science and Technology, China Agricultural University, Beijing 100094, China.
Sci China C Life Sci. 2006 Dec;49(6):552-9. doi: 10.1007/s11427-006-2024-z.
It is a challenging issue to map Quantitative Trait Loci (QTL) underlying complex discrete traits, which usually show discontinuous distribution and less information, using conventional statistical methods. Bayesian-Markov chain Monte Carlo (Bayesian-MCMC) approach is the key procedure in mapping QTL for complex binary traits, which provides a complete posterior distribution for QTL parameters using all prior information. As a consequence, Bayesian estimates of all interested variables can be obtained straightforwardly basing on their posterior samples simulated by the MCMC algorithm. In our study, utilities of Bayesian-MCMC are demonstrated using simulated several animal outbred full-sib families with different family structures for a complex binary trait underlied by both a QTL and polygene. Under the Identity-by-Descent-Based variance component random model, three samplers basing on MCMC, including Gibbs sampling, Metropolis algorithm and reversible jump MCMC, were implemented to generate the joint posterior distribution of all unknowns so that the QTL parameters were obtained by Bayesian statistical inferring. The results showed that Bayesian-MCMC approach could work well and robust under different family structures and QTL effects. As family size increases and the number of family decreases, the accuracy of the parameter estimates will be improved. When the true QTL has a small effect, using outbred population experiment design with large family size is the optimal mapping strategy.
利用传统统计方法来定位复杂离散性状背后的数量性状基因座(QTL)是一个具有挑战性的问题,这类性状通常呈现不连续分布且信息较少。贝叶斯 - 马尔可夫链蒙特卡罗(Bayesian - MCMC)方法是定位复杂二元性状QTL的关键步骤,它利用所有先验信息为QTL参数提供完整的后验分布。因此,基于通过MCMC算法模拟的后验样本,可以直接获得所有感兴趣变量的贝叶斯估计值。在我们的研究中,通过模拟具有不同家系结构的几个动物远交全同胞家系,来展示Bayesian - MCMC在一个由QTL和多基因控制的复杂二元性状中的应用。在基于血缘一致性的方差分量随机模型下,实现了基于MCMC的三种采样器,包括吉布斯采样、梅特罗波利斯算法和可逆跳跃MCMC,以生成所有未知量的联合后验分布,从而通过贝叶斯统计推断获得QTL参数。结果表明,Bayesian - MCMC方法在不同家系结构和QTL效应下都能良好且稳健地工作。随着家系规模的增加和家系数目的减少,参数估计的准确性将会提高。当真实QTL效应较小时,采用具有大型家系规模的远交群体实验设计是最优的定位策略。