Zhang Fan, Kang Junmei, Long Ruicai, Li Mingna, Sun Yan, He Fei, Jiang Xueqian, Yang Changfu, Yang Xijiang, Kong Jie, Wang Yiwen, Wang Zhen, Zhang Zhiwu, Yang Qingchuan
Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing, China, 100193.
Department of Crop and Soil Sciences, Washington State University, Pullman, WA, USA, 99163.
Hortic Res. 2022 Oct 7;10(1):uhac225. doi: 10.1093/hr/uhac225. eCollection 2023.
Fall dormancy (FD) is an essential trait to overcome winter damage and for alfalfa () cultivar selection. The plant regrowth height after autumn clipping is an indirect way to evaluate FD. Transcriptomics, proteomics, and quantitative trait locus mapping have revealed crucial genes correlated with FD; however, these genes cannot predict alfalfa FD very well. Here, we conducted genomic prediction of FD using whole-genome SNP markers based on machine learning-related methods, including support vector machine (SVM) regression, and regularization-related methods, such as Lasso and ridge regression. The results showed that using SVM regression with linear kernel and the top 3000 genome-wide association study (GWAS)-associated markers achieved the highest prediction accuracy for FD of 64.1%. For plant regrowth height, the prediction accuracy was 59.0% using the 3000 GWAS-associated markers and the SVM linear model. This was better than the results using whole-genome markers (25.0%). Therefore, the method we explored for alfalfa FD prediction outperformed the other models, such as Lasso and ElasticNet. The study suggests the feasibility of using machine learning to predict FD with GWAS-associated markers, and the GWAS-associated markers combined with machine learning would benefit FD-related traits as well. Application of the methodology may provide potential targets for FD selection, which would accelerate genetic research and molecular breeding of alfalfa with optimized FD.
秋眠性(FD)是苜蓿克服冬季冻害及品种选择的重要性状。秋季刈割后植株的再生高度是评估秋眠性的一种间接方法。转录组学、蛋白质组学和数量性状基因座定位已揭示了与秋眠性相关的关键基因;然而,这些基因并不能很好地预测苜蓿的秋眠性。在此,我们基于机器学习相关方法,包括支持向量机(SVM)回归以及正则化相关方法,如套索回归和岭回归,使用全基因组单核苷酸多态性(SNP)标记对秋眠性进行了基因组预测。结果表明,使用线性核的支持向量机回归以及前3000个全基因组关联研究(GWAS)相关标记,秋眠性预测准确率最高可达64.1%。对于植株再生高度,使用3000个GWAS相关标记和支持向量机线性模型时,预测准确率为59.0%。这比使用全基因组标记的结果(25.0%)要好。因此,我们探索的苜蓿秋眠性预测方法优于其他模型,如套索回归和弹性网络。该研究表明利用机器学习结合GWAS相关标记预测秋眠性具有可行性,且GWAS相关标记与机器学习相结合也将有利于秋眠性相关性状的研究。该方法的应用可为秋眠性选择提供潜在靶点,从而加速具有优化秋眠性的苜蓿的遗传研究和分子育种。