• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于植物基因组预测的生物先验知识嵌入深度神经网络

Biological Prior Knowledge-Embedded Deep Neural Network for Plant Genomic Prediction.

作者信息

Ye Chonghang, Li Kai, Sun Weicheng, Jiang Yiwei, Zhang Weihan, Zhang Ping, Hu Yi-Juan, Han Yuepeng, Li Li

机构信息

Agricultural Bioinformatics Key Laboratory of Hubei Province, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China.

Hubei Hongshan Laboratory, Wuhan 430070, China.

出版信息

Genes (Basel). 2025 Mar 31;16(4):411. doi: 10.3390/genes16040411.

DOI:10.3390/genes16040411
PMID:40282370
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12027452/
Abstract

Genomic prediction is a powerful approach that predicts phenotypic traits from genotypic information, enabling the acceleration of trait improvement in plant breeding. Traditional genomic prediction methods have primarily relied on linear mixed models, such as Genomic Best Linear Unbiased Prediction (GBLUP), and conventional machine learning methods like Support Vector Regression (SVR). Traditional methods are limited in handling high-dimensional data and nonlinear relationships. Thus, deep learning methods have also been applied to genomic prediction in recent years. We proposed iADEP, Integrated Additive, Dominant, and Epistatic Prediction model based on deep learning. Specifically, single nucleotide polymorphism (SNP) data integrating latent genetic interactions and genome-wide association study results as biological prior knowledge are fused to an SNP embedding block, which is then input to a local encoder. The local encoder is fused with an omic-data-incorporated global decoder through a multi-head attention mechanism, followed by multilayer perceptrons. : Firstly, we demonstrated through experiments on four datasets that iADEP outperforms existing methods in genotype-to-phenotype prediction. Secondly, we validated the effectiveness of SNP embedding through ablation experiments. Third, we provided an available module for combining other omics data in iADEP and propose a novel method for fusing them. Fourthly, we explored the impact of feature selection on iADEP performance and conclude that utilizing the full set of SNPs generally provides optimal results. Finally, by altering the partition of training and testing sets, we investigated the differences between transductive learning and inductive learning. iADEP provides a new approach for AI breeding, a promising method that integrates biological prior knowledge and enables combination with other omics data.

摘要

基因组预测是一种强大的方法,它能根据基因型信息预测表型性状,从而加速植物育种中的性状改良。传统的基因组预测方法主要依赖线性混合模型,如基因组最佳线性无偏预测(GBLUP),以及传统机器学习方法,如支持向量回归(SVR)。传统方法在处理高维数据和非线性关系方面存在局限性。因此,近年来深度学习方法也被应用于基因组预测。我们提出了iADEP,即基于深度学习的整合加性、显性和上位性预测模型。具体而言,将整合潜在遗传相互作用和全基因组关联研究结果作为生物学先验知识的单核苷酸多态性(SNP)数据融合到一个SNP嵌入模块中,然后将其输入到一个局部编码器。局部编码器通过多头注意力机制与一个整合了组学数据的全局解码器融合,随后接多层感知器。首先,我们通过在四个数据集上的实验证明,iADEP在基因型到表型的预测方面优于现有方法。其次,我们通过消融实验验证了SNP嵌入的有效性。第三,我们在iADEP中提供了一个用于组合其他组学数据的可用模块,并提出了一种融合它们的新方法。第四,我们探讨了特征选择对iADEP性能的影响,并得出结论,使用全套SNP通常能提供最佳结果。最后,通过改变训练集和测试集的划分,我们研究了转导学习和归纳学习之间的差异。iADEP为人工智能育种提供了一种新方法,这是一种有前景的方法,它整合了生物学先验知识,并能够与其他组学数据相结合。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a7e/12027452/4f7b9e175d2d/genes-16-00411-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a7e/12027452/84379aea52fc/genes-16-00411-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a7e/12027452/a03977ed955a/genes-16-00411-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a7e/12027452/fa524b9925e6/genes-16-00411-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a7e/12027452/5bc6c82fc9f2/genes-16-00411-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a7e/12027452/f3e858cc907d/genes-16-00411-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a7e/12027452/4f7b9e175d2d/genes-16-00411-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a7e/12027452/84379aea52fc/genes-16-00411-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a7e/12027452/a03977ed955a/genes-16-00411-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a7e/12027452/fa524b9925e6/genes-16-00411-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a7e/12027452/5bc6c82fc9f2/genes-16-00411-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a7e/12027452/f3e858cc907d/genes-16-00411-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a7e/12027452/4f7b9e175d2d/genes-16-00411-g006.jpg

相似文献

1
Biological Prior Knowledge-Embedded Deep Neural Network for Plant Genomic Prediction.用于植物基因组预测的生物先验知识嵌入深度神经网络
Genes (Basel). 2025 Mar 31;16(4):411. doi: 10.3390/genes16040411.
2
AutoGP: An intelligent breeding platform for enhancing maize genomic selection.AutoGP:一个用于加强玉米基因组选择的智能育种平台。
Plant Commun. 2025 Apr 14;6(4):101240. doi: 10.1016/j.xplc.2025.101240. Epub 2025 Jan 8.
3
DNNGP, a deep neural network-based method for genomic prediction using multi-omics data in plants.DNNGP,一种基于深度神经网络的方法,用于利用植物中的多组学数据进行基因组预测。
Mol Plant. 2023 Jan 2;16(1):279-293. doi: 10.1016/j.molp.2022.11.004. Epub 2022 Nov 10.
4
Genomic prediction with NetGP based on gene network and multi-omics data in plants.基于植物基因网络和多组学数据的NetGP基因组预测
Plant Biotechnol J. 2025 Apr;23(4):1190-1201. doi: 10.1111/pbi.14577. Epub 2025 Feb 14.
5
Genomic prediction using information across years with epistatic models and dimension reduction via haplotype blocks.利用上位性模型和单倍型块进行降维,在多年信息上进行基因组预测。
PLoS One. 2023 Mar 31;18(3):e0282288. doi: 10.1371/journal.pone.0282288. eCollection 2023.
6
TrG2P: A transfer-learning-based tool integrating multi-trait data for accurate prediction of crop yield.TrG2P:一种基于迁移学习的工具,集成多性状数据,用于准确预测作物产量。
Plant Commun. 2024 Jul 8;5(7):100975. doi: 10.1016/j.xplc.2024.100975. Epub 2024 May 15.
7
GEFormer: A genotype-environment interaction-based genomic prediction method that integrates the gating multilayer perceptron and linear attention mechanisms.GEFormer:一种基于基因型-环境相互作用的基因组预测方法,该方法整合了门控多层感知器和线性注意力机制。
Mol Plant. 2025 Mar 3;18(3):527-549. doi: 10.1016/j.molp.2025.01.020. Epub 2025 Jan 28.
8
A transformer-based genomic prediction method fused with knowledge-guided module.基于变压器的融合知识引导模块的基因组预测方法。
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad438.
9
Deep learning versus parametric and ensemble methods for genomic prediction of complex phenotypes.深度学习与参数化和集成方法在复杂表型基因组预测中的比较。
Genet Sel Evol. 2020 Feb 24;52(1):12. doi: 10.1186/s12711-020-00531-z.
10
Sub-sampling graph neural networks for genomic prediction of quantitative phenotypes.基于子采样图神经网络的数量性状基因组预测。
G3 (Bethesda). 2024 Nov 6;14(11). doi: 10.1093/g3journal/jkae216.

本文引用的文献

1
Identifying latent genetic interactions in genome-wide association studies using multiple traits.利用多种性状在全基因组关联研究中识别潜在的遗传相互作用。
Genome Med. 2024 Apr 25;16(1):62. doi: 10.1186/s13073-024-01329-0.
2
Predicting emerging drug interactions using GNNs.使用图神经网络预测新出现的药物相互作用。
Nat Comput Sci. 2023 Dec;3(12):1007-1008. doi: 10.1038/s43588-023-00555-7.
3
A transformer-based genomic prediction method fused with knowledge-guided module.基于变压器的融合知识引导模块的基因组预测方法。
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad438.
4
CropGS-Hub: a comprehensive database of genotype and phenotype resources for genomic prediction in major crops.作物 GS-Hub:主要作物基因组预测中基因型和表型资源的综合数据库。
Nucleic Acids Res. 2024 Jan 5;52(D1):D1519-D1529. doi: 10.1093/nar/gkad1062.
5
Exploring the potential of incremental feature selection to improve genomic prediction accuracy.探索增量特征选择提高基因组预测准确性的潜力。
Genet Sel Evol. 2023 Nov 9;55(1):78. doi: 10.1186/s12711-023-00853-8.
6
Metabolomic-genomic prediction can improve prediction accuracy of breeding values for malting quality traits in barley.代谢组学-基因组预测可以提高大麦酿造品质性状的育种值预测准确性。
Genet Sel Evol. 2023 Sep 5;55(1):61. doi: 10.1186/s12711-023-00835-w.
7
deepGBLUP: joint deep learning networks and GBLUP framework for accurate genomic prediction of complex traits in Korean native cattle.深度 GBLUP:联合深度学习网络和 GBLUP 框架,用于准确预测韩国本土牛复杂性状的基因组。
Genet Sel Evol. 2023 Jul 31;55(1):56. doi: 10.1186/s12711-023-00825-y.
8
Application of machine learning to explore the genomic prediction accuracy of fall dormancy in autotetraploid alfalfa.应用机器学习探索同源四倍体苜蓿秋季休眠的基因组预测准确性。
Hortic Res. 2022 Oct 7;10(1):uhac225. doi: 10.1093/hr/uhac225. eCollection 2023.
9
Improved Prediction Model of Protein and Peptide Toxicity by Integrating Channel Attention into a Convolutional Neural Network and Gated Recurrent Units.通过将通道注意力机制集成到卷积神经网络和门控循环单元中改进蛋白质和肽毒性的预测模型
ACS Omega. 2022 Oct 27;7(44):40569-40577. doi: 10.1021/acsomega.2c05881. eCollection 2022 Nov 8.
10
DNNGP, a deep neural network-based method for genomic prediction using multi-omics data in plants.DNNGP,一种基于深度神经网络的方法,用于利用植物中的多组学数据进行基因组预测。
Mol Plant. 2023 Jan 2;16(1):279-293. doi: 10.1016/j.molp.2022.11.004. Epub 2022 Nov 10.