Suppr超能文献

玉米单交种杂种优势预测:支持向量机回归与最佳线性预测比较。

Prediction of maize single-cross hybrid performance: support vector machine regression versus best linear prediction.

机构信息

Department of Biosciences and Landscape Architecture, University College Ghent, Voskenslaan 270, 9000 Gent, Belgium.

出版信息

Theor Appl Genet. 2010 Jan;120(2):415-27. doi: 10.1007/s00122-009-1200-5. Epub 2009 Nov 11.

Abstract

Accurate prediction of the phenotypic performance of a hybrid plant based on the molecular fingerprints of its parents should lead to a more cost-effective breeding programme as it allows to reduce the number of expensive field evaluations. The construction of a reliable prediction model requires a representative sample of hybrids for which both molecular and phenotypic information are accessible. This phenotypic information is usually readily available as typical breeding programmes test numerous new hybrids in multi-location field trials on a yearly basis. Earlier studies indicated that a linear mixed model analysis of this typically unbalanced phenotypic data allows to construct epsilon-insensitive support vector machine regression and best linear prediction models for predicting the performance of single-cross maize hybrids. We compare these prediction methods using different subsets of the phenotypic and marker data of a commercial maize breeding programme and evaluate the resulting prediction accuracies by means of a specifically designed field experiment. This balanced field trial allows to assess the reliability of the cross-validation prediction accuracies reported here and in earlier studies. The limits of the predictive capabilities of both prediction methods are further examined by reducing the number of training hybrids and the size of the molecular fingerprints. The results indicate a considerable discrepancy between prediction accuracies obtained by cross-validation procedures and those obtained by correlating the predictions with the results of a validation field trial. The prediction accuracy of best linear prediction was less sensitive to a reduction of the number of training examples compared with that of support vector machine regression. The latter was, however, better at predicting hybrid performance when the size of the molecular fingerprints was reduced, especially if the initial set of markers had a low information content.

摘要

基于杂交植物双亲的分子指纹准确预测其表型表现,应该会导致更具成本效益的育种计划,因为这可以减少昂贵的田间评估数量。构建可靠的预测模型需要具有代表性的杂交样本,这些样本可同时获得分子和表型信息。这种表型信息通常很容易获得,因为典型的育种计划每年都会在多个地点的田间试验中测试许多新的杂交品种。早期的研究表明,对这种典型的非平衡表型数据进行线性混合模型分析,可以构建用于预测单交玉米杂交种性能的ε不敏感支持向量机回归和最佳线性预测模型。我们使用商业玉米育种计划的表型和标记数据的不同子集来比较这些预测方法,并通过专门设计的田间试验评估由此产生的预测准确性。该平衡田间试验允许评估这里和早期研究中报告的交叉验证预测准确性的可靠性。通过减少训练杂交种的数量和分子指纹的大小,进一步检查了这两种预测方法的预测能力的局限性。结果表明,交叉验证程序获得的预测准确性与验证田间试验结果相关联获得的预测准确性之间存在相当大的差异。与支持向量机回归相比,最佳线性预测的预测准确性对训练样本数量的减少不太敏感。然而,当分子指纹的大小减少时,支持向量机回归更能预测杂种的表现,尤其是当初始标记集信息量较低时。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验