在整合基因组预测和机器学习-全基因组关联研究工作流程中解析大豆基因型与环境互作效应

Disentangling soybean GxE effects in an integrated genomic prediction and machine learning-GWAS workflow.

作者信息

Verbrigghe Niel, Muylle Hilde, Pegard Marie, Rietman Hendrik, Đorđević Vuk, Ćeran Marina, Roldán-Ruiz Isabel

机构信息

Plant Sciences Unit, Flanders Research Institute for Agriculture, Fisheries and Food (ILVO), Melle, Belgium.

Unité de Recherche Pluridisciplinaire Prairies Et Plantes Fourragères (P3F), INRAE, Lusignan, France.

出版信息

Plant Methods. 2025 Aug 25;21(1):119. doi: 10.1186/s13007-025-01434-0.

DOI:10.1186/s13007-025-01434-0

PMID:40855500

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12376716/

Abstract

Integrating genotype-by-Environment (GxE) interactions into genomic prediction models has been demonstrated to enhance the accuracy of predictions for crops exposed to unfavourable environmental conditions. However, despite the increasing complexity of machine learning models in genomic prediction, no model or approach has been found to be overall superior in comparison to a classical genomic best linear unbiased prediction (GBLUP) model. In this paper, we compared two GBLUP models (Linear Mixed Effects model and Bayesian GBLUP) with two machine learning models (Random Forest and Extreme Gradient Boosting) on the EUCLEG soybean genotype set phenotyped in Belgium and Serbia. We found similar performance for the Bayesian GBLUP and the two machine learning methods. However, using a workflow that decomposed the environment-specific BLUPs into a main genetic and an interaction GxE effect, we found increased predictive ability for the interaction component compared to a single-component approach. Furthermore, conducting a machine learning-genome wide association study (ML-GWAS) on both components allowed us to identify important markers for the main genetic effect, as well as environment-specific markers. These could then be associated with correlated markers in other environments. By constructing a small random forest model using only 50 uncorrelated, important markers we constructed a genomic prediction model with similar predictive ability over all scenarios when compared to the large models including all markers. The results demonstrate a new, integrated genomic prediction and machine learning-genome-wide association study (ML-GWAS) approach, aimed at high predictive ability and coupled marker detection in the soybean genome for traits phenotyped in different environments.

摘要

将基因型与环境互作（GxE）整合到基因组预测模型中，已被证明可提高对处于不利环境条件下作物预测的准确性。然而，尽管基因组预测中机器学习模型的复杂性不断增加，但与经典的基因组最佳线性无偏预测（GBLUP）模型相比，尚未发现有模型或方法在整体上更具优势。在本文中，我们对比了两个GBLUP模型（线性混合效应模型和贝叶斯GBLUP）与两个机器学习模型（随机森林和极端梯度提升），数据来自于在比利时和塞尔维亚进行表型分析的EUCLEG大豆基因型集。我们发现贝叶斯GBLUP和这两种机器学习方法具有相似的性能。然而，通过一种将特定环境下的BLUP分解为主要遗传效应和互作GxE效应的工作流程，我们发现与单组分方法相比，互作组分的预测能力有所提高。此外，对这两个组分进行机器学习-全基因组关联研究（ML-GWAS），使我们能够识别出主要遗传效应的重要标记以及特定环境的标记。然后可以将这些标记与其他环境中的相关标记关联起来。通过仅使用50个不相关的重要标记构建一个小型随机森林模型，我们构建了一个基因组预测模型，与包含所有标记的大型模型相比，在所有情况下都具有相似的预测能力。结果展示了一种新的、整合的基因组预测和机器学习-全基因组关联研究（ML-GWAS）方法，旨在实现高预测能力，并在大豆基因组中针对不同环境下表型性状进行标记检测。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c138/12376716/5799119d978e/13007_2025_1434_Fig1_HTML.jpg

相似文献

Disentangling soybean GxE effects in an integrated genomic prediction and machine learning-GWAS workflow.

Plant Methods. 2025 Aug 25;21(1):119. doi: 10.1186/s13007-025-01434-0.

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

Prescription of Controlled Substances: Benefits and Risks

Prioritized SNP Selection from Whole-Genome Sequencing Improves Genomic Prediction Accuracy in Sturgeons Using Linear and Machine Learning Models.

Int J Mol Sci. 2025 Jul 21;26(14):7007. doi: 10.3390/ijms26147007.

Stabilizing machine learning for reproducible and explainable results: A novel validation approach to subject-specific insights.

Comput Methods Programs Biomed. 2025 Jun 21;269:108899. doi: 10.1016/j.cmpb.2025.108899.

Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?

Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.

Incorporating information of causal variants in genomic prediction using GBLUP or machine learning models in a simulated livestock population.

J Anim Sci Biotechnol. 2025 Aug 19;16(1):118. doi: 10.1186/s40104-025-01250-5.

Cross-generational genomic prediction of Norway spruce (Picea abies) wood properties: an evaluation using independent validation.

BMC Genomics. 2025 Jul 21;26(1):680. doi: 10.1186/s12864-025-11861-x.

Variable selection strategies for genomic prediction of growth and carcass related traits in experimental Nellore cattle herds under different selection criteria.

Sci Rep. 2025 Jul 1;15(1):22266. doi: 10.1038/s41598-025-06949-z.

Genotype-by-environment interaction for yearling weight of Nellore cattle in pasture and feedlot conditions using a "double" genomic reaction norm model.

J Anim Sci. 2025 Jan 4;103. doi: 10.1093/jas/skaf169.

本文引用的文献

Phenotypic characterization of soybean genetic resources at multiple locations: breeding implications for enhancing environmental resilience, yield and protein content.

Front Plant Sci. 2025 Apr 7;16:1422162. doi: 10.3389/fpls.2025.1422162. eCollection 2025.

Including marker x environment interactions improves genomic prediction in red clover ( L.).

Front Plant Sci. 2024 Jun 10;15:1407609. doi: 10.3389/fpls.2024.1407609. eCollection 2024.

Genomic prediction using machine learning: a comparison of the performance of regularized regression, ensemble, instance-based and deep learning methods on synthetic and empirical data.

BMC Genomics. 2024 Feb 7;25(1):152. doi: 10.1186/s12864-023-09933-x.

MSXFGP: combining improved sparrow search algorithm with XGBoost for enhanced genomic prediction.

BMC Bioinformatics. 2023 Oct 11;24(1):384. doi: 10.1186/s12859-023-05514-7.

Improved genomic prediction using machine learning with Variational Bayesian sparsity.

Plant Methods. 2023 Sep 2;19(1):96. doi: 10.1186/s13007-023-01073-3.

Genomic prediction in plants: opportunities for ensemble machine learning based approaches.

F1000Res. 2022 Jul 18;11:802. doi: 10.12688/f1000research.122437.2. eCollection 2022.

Genetic control of tolerance to drought stress in soybean.

BMC Plant Biol. 2022 Dec 28;22(1):615. doi: 10.1186/s12870-022-03996-w.

DNNGP, a deep neural network-based method for genomic prediction using multi-omics data in plants.

Mol Plant. 2023 Jan 2;16(1):279-293. doi: 10.1016/j.molp.2022.11.004. Epub 2022 Nov 10.

IIIVmrMLM: The R and C++ tools associated with 3VmrMLM, a comprehensive GWAS method for dissecting quantitative traits.

Mol Plant. 2022 Aug 1;15(8):1251-1253. doi: 10.1016/j.molp.2022.06.002. Epub 2022 Jun 8.

Machine-Learning-Based Genome-Wide Association Studies for Uncovering QTL Underlying Soybean Yield and Its Components.

Int J Mol Sci. 2022 May 16;23(10):5538. doi: 10.3390/ijms23105538.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

在整合基因组预测和机器学习-全基因组关联研究工作流程中解析大豆基因型与环境互作效应

Disentangling soybean GxE effects in an integrated genomic prediction and machine learning-GWAS workflow.

作者信息

Verbrigghe Niel, Muylle Hilde, Pegard Marie, Rietman Hendrik, Đorđević Vuk, Ćeran Marina, Roldán-Ruiz Isabel

机构信息

Plant Sciences Unit, Flanders Research Institute for Agriculture, Fisheries and Food (ILVO), Melle, Belgium.

Unité de Recherche Pluridisciplinaire Prairies Et Plantes Fourragères (P3F), INRAE, Lusignan, France.

出版信息

Plant Methods. 2025 Aug 25;21(1):119. doi: 10.1186/s13007-025-01434-0.

DOI:10.1186/s13007-025-01434-0

PMID:40855500

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12376716/

Abstract

摘要

在整合基因组预测和机器学习-全基因组关联研究工作流程中解析大豆基因型与环境互作效应

Disentangling soybean GxE effects in an integrated genomic prediction and machine learning-GWAS workflow.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

在整合基因组预测和机器学习-全基因组关联研究工作流程中解析大豆基因型与环境互作效应

Disentangling soybean GxE effects in an integrated genomic prediction and machine learning-GWAS workflow.

作者信息

机构信息

出版信息

相似文献

本文引用的文献