Faculty of Agricultural Sciences, Department of Genetics and Biotechnology, Research Centre Foulum, Aarhus University, Tjele, Denmark.
Genet Epidemiol. 2010 Jul;34(5):455-62. doi: 10.1002/gepi.20499.
Association mapping methods were compared using a simulation with a complex pedigree structure. The pedigree was simulated while keeping the present Danish Holstein population pedigree in view. A total of 15 quantitative trait loci (QTL) with varying effect sizes (10%, 5% and 2% of total genetic variance) were simulated. We compared the single-marker test, haplotype-based analysis, mixed model approach, and Bayesian analysis. The methods were compared for power, precision of location estimates, and type I error rates. Results found the best performance in a Bayesian method that included genetic background effects and simultaneously fitted all single-nucleotide polymorphisms (SNPs) with a variable selection method. A mixed model analysis that fitted genetic background effects and tested one SNP at a time performed nearly as well as the Bayesian method. For the Bayesian method, it proved necessary to collect SNP signals in intervals, to avoid the scattering of a QTL signal over multiple neighboring SNPs. Methods not accounting for genetic background (full pedigree information) performed worse, and methods using haplotypes were considerably worse with a high false-positive rate, probably due to the presence of low-frequency haplotypes. It was necessary to account for full relationships among individuals to avoid excess false discovery. Although the methods were tested on a cattle pedigree, the results are applicable to any population with a complex pedigree structure.
使用具有复杂家系结构的模拟来比较关联映射方法。在模拟时,考虑到当前丹麦荷斯坦牛种群的系谱,模拟了总共 15 个具有不同效应大小(总遗传方差的 10%、5%和 2%)的数量性状基因座(QTL)。我们比较了单标记检验、基于单倍型的分析、混合模型方法和贝叶斯分析。这些方法的性能比较包括功效、位置估计的精度和 I 型错误率。结果发现,在贝叶斯方法中表现最佳,该方法包括遗传背景效应,并同时使用可变选择方法拟合所有单核苷酸多态性(SNP)。拟合遗传背景效应并一次测试一个 SNP 的混合模型分析与贝叶斯方法几乎一样好。对于贝叶斯方法,证明有必要在区间中收集 SNP 信号,以避免 QTL 信号在多个相邻 SNP 上分散。不考虑遗传背景(完整系谱信息)的方法性能较差,而使用单倍型的方法假阳性率相当高,可能是由于存在低频单倍型。为了避免过度的假发现,有必要考虑个体之间的完整关系。尽管这些方法是在牛系谱上进行测试的,但结果适用于任何具有复杂系谱结构的群体。