Pong-Wong Ricardo, Hadjipavlou Georgia
The Roslin Institute and R(D)SVS, University of Edinburgh, Roslin, Midlothian, EH25 9PS, UK.
BMC Proc. 2010 Mar 31;4 Suppl 1(Suppl 1):S4. doi: 10.1186/1753-6561-4-S1-S4.
We used the Gompertz growth curve to model a simulated longitudinal dataset provided by the QTLMAS2009 workshop and applied genomic evaluation to the derived model parameters and to a model-predicted trait value.
Prediction of phenotypic information from the Gompertz curve allowed us to obtain genomic breeding value estimates for a time point with no phenotypic records. Despite that the true model used to simulate the data was the logistic growth model, the Gompertz model provided a good fit of the data. Genomic breeding values calculated from predicted phenotypes were highly correlated with the breeding values obtained by directly using the respective observed phenotypes. The accuracies between the true and estimated breeding value at time 600 were above 0.93, even though t600 was outside the time range used when fitting the data. The analysis of the parameters of the Gompertz curve successfully discriminated regions with QTL affecting the asymptotic final value, but it was less successful in finding QTL affecting the other parameters of the logistic growth curve. In this study we estimated the proportion of SNPs affecting a given trait, in contrast with previously reported implementations of genomic selection in which this parameter was assumed to be known without error.
The two-step approach used to combine curve fitting and genomic selection on longitudinal data provided a simple way for combining these two complex tasks without any detrimental effect on breeding value estimation.
我们使用Gompertz生长曲线对QTLMAS2009研讨会提供的模拟纵向数据集进行建模,并将基因组评估应用于导出的模型参数和模型预测的性状值。
从Gompertz曲线预测表型信息使我们能够获得无表型记录时间点的基因组育种值估计。尽管用于模拟数据的真实模型是逻辑生长模型,但Gompertz模型对数据拟合良好。根据预测表型计算的基因组育种值与直接使用相应观察表型获得的育种值高度相关。即使t600超出拟合数据时使用的时间范围,在时间600时真实育种值与估计育种值之间的准确性仍高于0.93。对Gompertz曲线参数的分析成功地鉴别出影响渐近终值的QTL区域,但在寻找影响逻辑生长曲线其他参数的QTL方面不太成功。在本研究中,我们估计了影响给定性状的SNP比例,这与之前报道的基因组选择实施情况相反,在之前的实施中,该参数被假定为无误差已知。
用于结合纵向数据曲线拟合和基因组选择的两步法提供了一种简单的方法来结合这两项复杂任务,且对育种值估计没有任何不利影响。