用于植物基因组预测的生物先验知识嵌入深度神经网络

Biological Prior Knowledge-Embedded Deep Neural Network for Plant Genomic Prediction.

作者信息

Ye Chonghang, Li Kai, Sun Weicheng, Jiang Yiwei, Zhang Weihan, Zhang Ping, Hu Yi-Juan, Han Yuepeng, Li Li

机构信息

Agricultural Bioinformatics Key Laboratory of Hubei Province, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China.

Hubei Hongshan Laboratory, Wuhan 430070, China.

出版信息

Genes (Basel). 2025 Mar 31;16(4):411. doi: 10.3390/genes16040411.

DOI:10.3390/genes16040411

PMID:40282370

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12027452/

Abstract

Genomic prediction is a powerful approach that predicts phenotypic traits from genotypic information, enabling the acceleration of trait improvement in plant breeding. Traditional genomic prediction methods have primarily relied on linear mixed models, such as Genomic Best Linear Unbiased Prediction (GBLUP), and conventional machine learning methods like Support Vector Regression (SVR). Traditional methods are limited in handling high-dimensional data and nonlinear relationships. Thus, deep learning methods have also been applied to genomic prediction in recent years. We proposed iADEP, Integrated Additive, Dominant, and Epistatic Prediction model based on deep learning. Specifically, single nucleotide polymorphism (SNP) data integrating latent genetic interactions and genome-wide association study results as biological prior knowledge are fused to an SNP embedding block, which is then input to a local encoder. The local encoder is fused with an omic-data-incorporated global decoder through a multi-head attention mechanism, followed by multilayer perceptrons. : Firstly, we demonstrated through experiments on four datasets that iADEP outperforms existing methods in genotype-to-phenotype prediction. Secondly, we validated the effectiveness of SNP embedding through ablation experiments. Third, we provided an available module for combining other omics data in iADEP and propose a novel method for fusing them. Fourthly, we explored the impact of feature selection on iADEP performance and conclude that utilizing the full set of SNPs generally provides optimal results. Finally, by altering the partition of training and testing sets, we investigated the differences between transductive learning and inductive learning. iADEP provides a new approach for AI breeding, a promising method that integrates biological prior knowledge and enables combination with other omics data.

摘要

基因组预测是一种强大的方法，它能根据基因型信息预测表型性状，从而加速植物育种中的性状改良。传统的基因组预测方法主要依赖线性混合模型，如基因组最佳线性无偏预测（GBLUP），以及传统机器学习方法，如支持向量回归（SVR）。传统方法在处理高维数据和非线性关系方面存在局限性。因此，近年来深度学习方法也被应用于基因组预测。我们提出了iADEP，即基于深度学习的整合加性、显性和上位性预测模型。具体而言，将整合潜在遗传相互作用和全基因组关联研究结果作为生物学先验知识的单核苷酸多态性（SNP）数据融合到一个SNP嵌入模块中，然后将其输入到一个局部编码器。局部编码器通过多头注意力机制与一个整合了组学数据的全局解码器融合，随后接多层感知器。首先，我们通过在四个数据集上的实验证明，iADEP在基因型到表型的预测方面优于现有方法。其次，我们通过消融实验验证了SNP嵌入的有效性。第三，我们在iADEP中提供了一个用于组合其他组学数据的可用模块，并提出了一种融合它们的新方法。第四，我们探讨了特征选择对iADEP性能的影响，并得出结论，使用全套SNP通常能提供最佳结果。最后，通过改变训练集和测试集的划分，我们研究了转导学习和归纳学习之间的差异。iADEP为人工智能育种提供了一种新方法，这是一种有前景的方法，它整合了生物学先验知识，并能够与其他组学数据相结合。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a7e/12027452/84379aea52fc/genes-16-00411-g001.jpg

相似文献

Biological Prior Knowledge-Embedded Deep Neural Network for Plant Genomic Prediction.

Genes (Basel). 2025 Mar 31;16(4):411. doi: 10.3390/genes16040411.

AutoGP: An intelligent breeding platform for enhancing maize genomic selection.

Plant Commun. 2025 Apr 14;6(4):101240. doi: 10.1016/j.xplc.2025.101240. Epub 2025 Jan 8.

DNNGP, a deep neural network-based method for genomic prediction using multi-omics data in plants.

Mol Plant. 2023 Jan 2;16(1):279-293. doi: 10.1016/j.molp.2022.11.004. Epub 2022 Nov 10.

Genomic prediction with NetGP based on gene network and multi-omics data in plants.

Plant Biotechnol J. 2025 Apr;23(4):1190-1201. doi: 10.1111/pbi.14577. Epub 2025 Feb 14.

Genomic prediction using information across years with epistatic models and dimension reduction via haplotype blocks.

PLoS One. 2023 Mar 31;18(3):e0282288. doi: 10.1371/journal.pone.0282288. eCollection 2023.

TrG2P: A transfer-learning-based tool integrating multi-trait data for accurate prediction of crop yield.

Plant Commun. 2024 Jul 8;5(7):100975. doi: 10.1016/j.xplc.2024.100975. Epub 2024 May 15.

GEFormer: A genotype-environment interaction-based genomic prediction method that integrates the gating multilayer perceptron and linear attention mechanisms.

Mol Plant. 2025 Mar 3;18(3):527-549. doi: 10.1016/j.molp.2025.01.020. Epub 2025 Jan 28.

A transformer-based genomic prediction method fused with knowledge-guided module.

Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad438.

Deep learning versus parametric and ensemble methods for genomic prediction of complex phenotypes.

Genet Sel Evol. 2020 Feb 24;52(1):12. doi: 10.1186/s12711-020-00531-z.

Sub-sampling graph neural networks for genomic prediction of quantitative phenotypes.

G3 (Bethesda). 2024 Nov 6;14(11). doi: 10.1093/g3journal/jkae216.

本文引用的文献

Identifying latent genetic interactions in genome-wide association studies using multiple traits.

Genome Med. 2024 Apr 25;16(1):62. doi: 10.1186/s13073-024-01329-0.

Predicting emerging drug interactions using GNNs.

Nat Comput Sci. 2023 Dec;3(12):1007-1008. doi: 10.1038/s43588-023-00555-7.

A transformer-based genomic prediction method fused with knowledge-guided module.

Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad438.

CropGS-Hub: a comprehensive database of genotype and phenotype resources for genomic prediction in major crops.

Nucleic Acids Res. 2024 Jan 5;52(D1):D1519-D1529. doi: 10.1093/nar/gkad1062.

Exploring the potential of incremental feature selection to improve genomic prediction accuracy.

Genet Sel Evol. 2023 Nov 9;55(1):78. doi: 10.1186/s12711-023-00853-8.

Metabolomic-genomic prediction can improve prediction accuracy of breeding values for malting quality traits in barley.

Genet Sel Evol. 2023 Sep 5;55(1):61. doi: 10.1186/s12711-023-00835-w.

deepGBLUP: joint deep learning networks and GBLUP framework for accurate genomic prediction of complex traits in Korean native cattle.

Genet Sel Evol. 2023 Jul 31;55(1):56. doi: 10.1186/s12711-023-00825-y.

Application of machine learning to explore the genomic prediction accuracy of fall dormancy in autotetraploid alfalfa.

Hortic Res. 2022 Oct 7;10(1):uhac225. doi: 10.1093/hr/uhac225. eCollection 2023.

Improved Prediction Model of Protein and Peptide Toxicity by Integrating Channel Attention into a Convolutional Neural Network and Gated Recurrent Units.

ACS Omega. 2022 Oct 27;7(44):40569-40577. doi: 10.1021/acsomega.2c05881. eCollection 2022 Nov 8.

DNNGP, a deep neural network-based method for genomic prediction using multi-omics data in plants.

Mol Plant. 2023 Jan 2;16(1):279-293. doi: 10.1016/j.molp.2022.11.004. Epub 2022 Nov 10.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于植物基因组预测的生物先验知识嵌入深度神经网络

Biological Prior Knowledge-Embedded Deep Neural Network for Plant Genomic Prediction.

作者信息

Ye Chonghang, Li Kai, Sun Weicheng, Jiang Yiwei, Zhang Weihan, Zhang Ping, Hu Yi-Juan, Han Yuepeng, Li Li

机构信息

Agricultural Bioinformatics Key Laboratory of Hubei Province, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China.

Hubei Hongshan Laboratory, Wuhan 430070, China.

出版信息

Genes (Basel). 2025 Mar 31;16(4):411. doi: 10.3390/genes16040411.

DOI:10.3390/genes16040411

PMID:40282370

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12027452/

Abstract

摘要

用于植物基因组预测的生物先验知识嵌入深度神经网络

Biological Prior Knowledge-Embedded Deep Neural Network for Plant Genomic Prediction.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

用于植物基因组预测的生物先验知识嵌入深度神经网络

Biological Prior Knowledge-Embedded Deep Neural Network for Plant Genomic Prediction.

作者信息

机构信息

出版信息

相似文献

本文引用的文献