Sillanpää Mikko J, Hoti Fabian
Department of Mathematics and Statistics, University of Helsinki, Finland.
Genetics. 2007 Dec;177(4):2361-77. doi: 10.1534/genetics.107.081299.
A new effective Bayesian quantitative trait locus (QTL) mapping approach for the analysis of single-tail selected samples of the phenotype distribution is presented. The approach extends the affected-only tests to single-tail sampling with quantitative traits such as the log-normal survival time or censored/selected traits. A great benefit of the approach is that it enables the utilization of multiple-QTL models, is easy to incorporate into different data designs (experimental and outbred populations), and can potentially be extended to epistatic models. In inbred lines, the method exploits the fact that the parental mating type and the linkage phases (haplotypes) are known by definition. In outbred populations, two-generation data are needed, for example, selected offspring and one of the parents (the sires) in breeding material. The idea is to statistically (computationally) generate a fully complementary, maximally dissimilar, observation for each offspring in the sample. Bayesian data augmentation is then used to sample the space of possible trait values for the pseudoobservations. The benefits of the approach are illustrated using simulated data sets and a real data set on the survival of F(2) mice following infection with Listeria monocytogenes.
本文提出了一种新的有效贝叶斯数量性状基因座(QTL)定位方法,用于分析表型分布的单尾选择样本。该方法将仅针对受影响个体的检验扩展到具有定量性状(如对数正态生存时间或删失/选择性状)的单尾抽样。该方法的一个巨大优势在于,它能够利用多QTL模型,易于纳入不同的数据设计(实验群体和远交群体),并且有可能扩展到上位性模型。在近交系中,该方法利用了这样一个事实,即根据定义,亲本交配类型和连锁相(单倍型)是已知的。在远交群体中,则需要两代数据,例如育种材料中的选择后代和其中一个亲本(父本)。其思路是通过统计(计算)为样本中的每个后代生成一个完全互补、差异最大的观测值。然后使用贝叶斯数据扩充对伪观测值的可能性状值空间进行抽样。使用模拟数据集和关于单核细胞增生李斯特菌感染后F(2)小鼠存活情况的真实数据集说明了该方法的优势。