ICAR-National Rice Research Institute, Cuttack, 753006, India.
University of Agricultural Sciences, Bangalore, 560065, India.
Heredity (Edinb). 2023 May;130(5):335-345. doi: 10.1038/s41437-023-00599-5. Epub 2023 Feb 15.
It is hypothesized that the genome-wide genic markers may increase the prediction accuracy of genomic selection for quantitative traits. To test this hypothesis, a set of candidate gene-based markers for yield and grain traits-related genes cloned across the rice genome were custom-designed. A multi-model, multi-locus genome-wide association study (GWAS) was performed using new genic markers developed to test their effectiveness for gene discovery. Two multi-locus models, FarmCPU and mrMLM, along with a single-locus mixed linear model (MLM), identified 28 significant marker-trait associations. These associations revealed novel causative alleles for grain weight and pleiotropic associations with other traits. For instance, the marker YD91 derived from the gene OsAAP3 on chromosome 1 was consistently associated with grain weight, while the gene has a significant effect on grain yield. Furthermore, nine genomic selection methods, including regression-based and machine learning-based models, were used to predict grain weight using a leave-one-out five-fold cross-validation approach to optimize the genomic selection model with genic markers. Among nine prediction models, Kernel Hilbert Space Regression (RKHS) is the best among regression-based models, and Random Forest Regression (RFR) is the best among machine learning-based models. Genomic prediction accuracies with and without GWAS significant markers were compared to assess the effectiveness of markers. The rapid decreases in prediction accuracy upon dropping GWAS significant markers indicate the effectiveness of new genic markers in genomic selection. Apart from that, the candidate gene-based markers were found to be more effective in genomic selection programs for better accuracy.
人们假设全基因组基因标记可以提高数量性状基因组选择的预测准确性。为了验证这一假设,针对克隆在水稻基因组中的与产量和粒形性状相关的基因,设计了一套候选基因标记。利用新开发的基因标记进行了多模型、多基因座全基因组关联研究(GWAS),以测试其在基因发现方面的有效性。两种多基因座模型FarmCPU 和 mrMLM 以及一种单基因座混合线性模型(MLM),共鉴定出 28 个与标记-性状显著相关的关联。这些关联揭示了与粒重相关的新的因果等位基因和与其他性状的多效性关联。例如,标记 YD91 来源于第 1 号染色体上的基因 OsAAP3,它与粒重一直显著相关,而该基因对粒产量有显著影响。此外,使用了 9 种基因组选择方法,包括基于回归和基于机器学习的模型,使用留一五折交叉验证方法,通过基因标记优化基因组选择模型,预测粒重。在 9 种预测模型中,基于核希尔伯特空间回归(RKHS)的模型在基于回归的模型中表现最好,而基于随机森林回归(RFR)的模型在基于机器学习的模型中表现最好。比较了有和没有 GWAS 显著标记的基因组预测准确性,以评估标记的有效性。在去除 GWAS 显著标记后,预测准确性迅速下降,这表明新基因标记在基因组选择中是有效的。此外,候选基因标记在基因组选择计划中被发现更有效,可以提高准确性。