Iwata Hiroyoshi, Ebana Kaworu, Fukuoka Shuichi, Jannink Jean-Luc, Hayashi Takeshi
Data Mining and Grid Research Team, National Agricultural Research Center, National Agriculture and Food Research Organization, 3-1-1 Kannondai, Tsukuba, Ibaraki, 305-8666, Japan.
Theor Appl Genet. 2009 Mar;118(5):865-80. doi: 10.1007/s00122-008-0945-6. Epub 2009 Jan 9.
Association mapping can be a powerful tool for detecting quantitative trait loci (QTLs) without requiring line-crossing experiments. We previously proposed a Bayesian approach for simultaneously mapping multiple QTLs by a regression method that directly incorporates estimates of the population structure. In the present study, we extended our method to analyze ordinal and censored traits, since both types of traits are common in the evaluation of germplasm collections. Ordinal-probit and tobit models were employed to analyze ordinal and censored traits, respectively. In both models, we postulated the existence of a latent continuous variable associated with the observable data, and we used a Markov-chain Monte Carlo algorithm to sample the latent variable and determine the model parameters. We evaluated the efficiency of our approach by using simulated- and real-trait analyses of a rice germplasm collection. Simulation analyses based on real marker data showed that our models could reduce both false-positive and false-negative rates in detecting QTLs to reasonable levels. Simulation analyses based on highly polymorphic marker data, which were generated by coalescent simulations, showed that our models could be applied to genotype data based on highly polymorphic marker systems, like simple sequence repeats. For the real traits, we analyzed heading date as a censored trait and amylose content and the shape of milled rice grains as ordinal traits. We found significant markers that may be linked to previously reported QTLs. Our approach will be useful for whole-genome association mapping of ordinal and censored traits in rice germplasm collections.
关联作图可以成为检测数量性状基因座(QTL)的强大工具,而无需进行品系杂交实验。我们之前提出了一种贝叶斯方法,通过一种直接纳入群体结构估计值的回归方法来同时定位多个QTL。在本研究中,我们扩展了我们的方法以分析有序和删失性状,因为这两种性状在种质资源收集评估中都很常见。分别采用有序概率模型和托比特模型来分析有序和删失性状。在这两种模型中,我们假定存在一个与可观测数据相关的潜在连续变量,并使用马尔可夫链蒙特卡罗算法对潜在变量进行采样并确定模型参数。我们通过对水稻种质资源收集进行模拟和实际性状分析来评估我们方法的效率。基于真实标记数据的模拟分析表明,我们的模型可以将检测QTL时的假阳性和假阴性率降低到合理水平。基于通过合并模拟生成的高度多态性标记数据的模拟分析表明,我们的模型可以应用于基于高度多态性标记系统(如简单序列重复)的基因型数据。对于实际性状,我们将抽穗期作为删失性状进行分析,并将直链淀粉含量和糙米籽粒形状作为有序性状进行分析。我们发现了可能与先前报道的QTL连锁的显著标记。我们的方法将有助于对水稻种质资源收集的有序和删失性状进行全基因组关联作图。