通过利用17年的历史数据和系谱信息优化国际水稻研究所水稻抗旱育种计划中的预测。

Optimizing predictions in IRRI's rice drought breeding program by leveraging 17 years of historical data and pedigree information.

作者信息

Khanna Apurva, Anumalla Mahender, Catolos Margaret, Bhosale Sankalp, Jarquin Diego, Hussain Waseem

机构信息

Rice Breeding Innovation Platform, International Rice Research Institute (IRRI), Los Baños, Laguna, Philippines.

Agronomy Department, University of Florida, Gainesville, FL, United States.

出版信息

Front Plant Sci. 2022 Sep 20;13:983818. doi: 10.3389/fpls.2022.983818. eCollection 2022.

DOI:10.3389/fpls.2022.983818

PMID:36204059

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9530897/

Abstract

Prediction models based on pedigree and/or molecular marker information are now an inextricable part of the crop breeding programs and have led to increased genetic gains in many crops. Optimization of IRRI's rice drought breeding program is crucial for better implementation of selections based on predictions. Historical datasets with precise and robust pedigree information have been a great resource to help optimize the prediction models in the breeding programs. Here, we leveraged 17 years of historical drought data along with the pedigree information to predict the new lines or environments and dissect the G × E interactions. Seven models ranging from basic to proposed higher advanced models incorporating interactions, and genotypic specific effects were used. These models were tested with three cross-validation schemes (CV1, CV2, and CV0) to assess the predictive ability of tested and untested lines in already observed environments and tested lines in novel or new environments. In general, the highest prediction abilities were obtained when the model accounting interactions between pedigrees (additive) and environment were included. The CV0 scheme (predicting unobserved or novel environments) reveals very low predictive abilities among the three schemes. CV1 and CV2 schemes that borrow information from the target and correlated environments have much higher predictive abilities. Further, predictive ability was lower when predicting lines in non-stress conditions using drought data as training set and/or . When predicting the lines using the data sets under the same conditions (stress or non-stress data sets), much better prediction accuracy was obtained. These results provide conclusive evidence that modeling G × E interactions are important in predictions. Thus, considering G × E interactions would help to build enhanced genomic or pedigree-based prediction models in the rice breeding program. Further, it is crucial to borrow the correlated information from other environments to improve prediction accuracy.

摘要

基于系谱和/或分子标记信息的预测模型如今已成为作物育种计划中不可或缺的一部分，并在许多作物中带来了更高的遗传增益。优化国际水稻研究所（IRRI）的水稻干旱育种计划对于更好地基于预测进行选择至关重要。具有精确且可靠系谱信息的历史数据集一直是帮助优化育种计划中预测模型的重要资源。在此，我们利用了17年的历史干旱数据以及系谱信息来预测新的品系或环境，并剖析基因型与环境互作（G×E）。使用了七种模型，从基本模型到纳入互作和基因型特定效应的更高阶模型。这些模型通过三种交叉验证方案（CV1、CV2和CV0）进行测试，以评估已观测环境中测试和未测试品系以及新环境中测试品系的预测能力。总体而言，当包含考虑系谱（加性）与环境之间互作的模型时，获得了最高的预测能力。CV0方案（预测未观测或新环境）在这三种方案中显示出非常低的预测能力。从目标环境和相关环境借用信息的CV1和CV2方案具有更高的预测能力。此外，当使用干旱数据作为训练集在非胁迫条件下预测品系时，预测能力较低。当使用相同条件下的数据集（胁迫或非胁迫数据集）预测品系时，获得了更好的预测准确性。这些结果提供了确凿的证据，表明模拟G×E互作在预测中很重要。因此，考虑G×E互作将有助于在水稻育种计划中构建增强的基于基因组或系谱的预测模型。此外，从其他环境借用相关信息以提高预测准确性至关重要。