He Sang, Liang Shanshan, Meng Lijun, Cao Liyong, Ye Guoyou
Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518124, China.
CAAS-IRRI Joint Laboratory for Genomics-Assisted Germplasm Enhancement, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518124, China.
Rice (N Y). 2023 Jun 7;16(1):27. doi: 10.1186/s12284-023-00643-2.
The multi-environment genomic selection enables plant breeders to select varieties resilient to diverse environments or particularly adapted to specific environments, which holds a great potential to be used in rice breeding. To realize the multi-environment genomic selection, a robust training set with multi-environment phenotypic data is of necessity. Considering the huge potential of genomic prediction enhanced sparse phenotyping on the cost saving of multi-environment trials (MET), the establishment of a multi-environment training set could also benefit from it. Optimizing the genomic prediction methods is also crucial to enhance the multi-environment genomic selection. Using haplotype-based genomic prediction models is able to capture local epistatic effects which could be conserved and accumulated across generations much like additive effects thereby benefitting breeding. However, previous studies often used fixed length haplotypes composed by a few adjacent molecular markers disregarding the linkage disequilibrium (LD) which is of essential role in determining the haplotype length. In our study, based on three rice populations with different sizes and compositions, we investigated the usefulness and effectiveness of multi-environment training sets with varying phenotyping intensities and different haplotype-based genomic prediction models based on LD-derived haplotype blocks for two agronomic traits, i.e., days to heading (DTH) and plant height (PH). Results showed that phenotyping merely 30% records in multi-environment training set is able to provide a comparable prediction accuracy to high phenotyping intensities; the local epistatic effects are much likely existent in DTH; dividing the LD-derived haplotype blocks into small segments with two or three single nucleotide polymorphisms (SNPs) helps to maintain the predictive ability of haplotype-based models in large populations; modelling the covariances between environments improves genomic prediction accuracy. Our study provides means to improve the efficiency of multi-environment genomic selection in rice.
多环境基因组选择使植物育种者能够选择对多种环境具有抗性或特别适应特定环境的品种,这在水稻育种中具有巨大的应用潜力。为了实现多环境基因组选择,拥有一个包含多环境表型数据的强大训练集是必要的。考虑到基因组预测增强稀疏表型在节省多环境试验(MET)成本方面的巨大潜力,多环境训练集的建立也可以从中受益。优化基因组预测方法对于增强多环境基因组选择也至关重要。使用基于单倍型的基因组预测模型能够捕获局部上位性效应,这种效应可以像加性效应一样在世代间保守和积累,从而有利于育种。然而,以前的研究通常使用由少数相邻分子标记组成的固定长度单倍型,而忽略了在确定单倍型长度中起关键作用的连锁不平衡(LD)。在我们的研究中,基于三个不同大小和组成的水稻群体,我们研究了具有不同表型强度的多环境训练集以及基于LD衍生单倍型块的不同基于单倍型的基因组预测模型对两个农艺性状(即抽穗天数(DTH)和株高(PH))的有效性和实用性。结果表明,在多环境训练集中仅对30%的记录进行表型分析就能提供与高表型强度相当的预测准确性;DTH中很可能存在局部上位性效应;将LD衍生的单倍型块分成包含两到三个单核苷酸多态性(SNP)的小片段有助于在大群体中保持基于单倍型模型的预测能力;对环境间的协方差进行建模可提高基因组预测准确性。我们的研究为提高水稻多环境基因组选择的效率提供了方法。