Statistical Genomics Lab, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, China.
State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing 210095, China.
Sci Rep. 2016 Jul 20;6:29951. doi: 10.1038/srep29951.
Composite interval mapping (CIM) is the most widely-used method in linkage analysis. Its main feature is the ability to control genomic background effects via inclusion of co-factors in its genetic model. However, the result often depends on how the co-factors are selected, especially for small-effect and linked quantitative trait loci (QTL). To address this issue, here we proposed a new method under the framework of genome-wide association studies (GWAS). First, a single-locus random-SNP-effect mixed linear model method for GWAS was used to scan each putative QTL on the genome in backcross or doubled haploid populations. Here, controlling background via selecting markers in the CIM was replaced by estimating polygenic variance. Then, all the peaks in the negative logarithm P-value curve were selected as the positions of multiple putative QTL to be included in a multi-locus genetic model, and true QTL were automatically identified by empirical Bayes. This called genome-wide CIM (GCIM). A series of simulated and real datasets was used to validate the new method. As a result, the new method had higher power in QTL detection, greater accuracy in QTL effect estimation, and stronger robustness under various backgrounds as compared with the CIM and empirical Bayes methods.
复合区间作图(CIM)是连锁分析中最广泛使用的方法。其主要特点是能够通过在遗传模型中包含协变量来控制基因组背景效应。然而,结果往往取决于协变量的选择方式,尤其是对于小效应和连锁的数量性状位点(QTL)。为了解决这个问题,我们在这里提出了一种在全基因组关联研究(GWAS)框架下的新方法。首先,使用单基因座随机 SNP 效应混合线性模型方法在回交或双单倍体群体中扫描基因组上的每个假定 QTL。在这里,通过在 CIM 中选择标记来控制背景,被估计多基因方差所取代。然后,将负对数 P 值曲线中的所有峰作为包含在多基因座遗传模型中的多个假定 QTL 的位置,并且通过经验贝叶斯自动识别真正的 QTL。这被称为全基因组 CIM(GCIM)。一系列模拟和真实数据集被用来验证新方法。结果表明,与 CIM 和经验贝叶斯方法相比,新方法在 QTL 检测中具有更高的功效,在 QTL 效应估计中具有更高的准确性,并且在各种背景下具有更强的稳健性。