• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

G2PDeep:一个基于网络的深度学习框架,用于定量表型预测和基因组标记的发现。

G2PDeep: a web-based deep-learning framework for quantitative phenotype prediction and discovery of genomic markers.

机构信息

Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65211, USA.

Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO 65211, USA.

出版信息

Nucleic Acids Res. 2021 Jul 2;49(W1):W228-W236. doi: 10.1093/nar/gkab407.

DOI:10.1093/nar/gkab407
PMID:34037802
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8262736/
Abstract

G2PDeep is an open-access web server, which provides a deep-learning framework for quantitative phenotype prediction and discovery of genomics markers. It uses zygosity or single nucleotide polymorphism (SNP) information from plants and animals as the input to predict quantitative phenotype of interest and genomic markers associated with phenotype. It provides a one-stop-shop platform for researchers to create deep-learning models through an interactive web interface and train these models with uploaded data, using high-performance computing resources plugged at the backend. G2PDeep also provides a series of informative interfaces to monitor the training process and compare the performance among the trained models. The trained models can then be deployed automatically. The quantitative phenotype and genomic markers are predicted using a user-selected trained model and the results are visualized. Our state-of-the-art model has been benchmarked and demonstrated competitive performance in quantitative phenotype predictions by other researchers. In addition, the server integrates the soybean nested association mapping (SoyNAM) dataset with five phenotypes, including grain yield, height, moisture, oil, and protein. A publicly available dataset for seed protein and oil content has also been integrated into the server. The G2PDeep server is publicly available at http://g2pdeep.org. The Python-based deep-learning model is available at https://github.com/shuaizengMU/G2PDeep_model.

摘要

G2PDeep 是一个开放获取的网络服务器,它为定量表型预测和基因组标记物的发现提供了一个深度学习框架。它使用植物和动物的同卵性或单核苷酸多态性(SNP)信息作为输入,来预测感兴趣的定量表型和与表型相关的基因组标记物。它为研究人员提供了一个一站式平台,通过交互式网页界面创建深度学习模型,并使用后端插入的高性能计算资源来训练这些模型。G2PDeep 还提供了一系列有用的接口,用于监控训练过程并比较训练模型之间的性能。然后可以自动部署训练好的模型。使用用户选择的训练模型预测定量表型和基因组标记物,并可视化结果。我们的最先进的模型已经经过基准测试,并在其他研究人员的定量表型预测中展示了有竞争力的性能。此外,该服务器还集成了大豆嵌套关联作图(SoyNAM)数据集,其中包含五个表型,包括籽粒产量、株高、水分、油分和蛋白质。还集成了一个公开可用的种子蛋白质和油分含量数据集。G2PDeep 服务器可在 http://g2pdeep.org 上访问。基于 Python 的深度学习模型可在 https://github.com/shuaizengMU/G2PDeep_model 上获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c597/8262736/b7b080c13f22/gkab407fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c597/8262736/54491dfea550/gkab407gra1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c597/8262736/8677a61cfe6f/gkab407fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c597/8262736/07392ffd60ef/gkab407fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c597/8262736/3abe3bb674e9/gkab407fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c597/8262736/dae1e8bdbf6b/gkab407fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c597/8262736/b7b080c13f22/gkab407fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c597/8262736/54491dfea550/gkab407gra1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c597/8262736/8677a61cfe6f/gkab407fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c597/8262736/07392ffd60ef/gkab407fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c597/8262736/3abe3bb674e9/gkab407fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c597/8262736/dae1e8bdbf6b/gkab407fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c597/8262736/b7b080c13f22/gkab407fig5.jpg

相似文献

1
G2PDeep: a web-based deep-learning framework for quantitative phenotype prediction and discovery of genomic markers.G2PDeep:一个基于网络的深度学习框架,用于定量表型预测和基因组标记的发现。
Nucleic Acids Res. 2021 Jul 2;49(W1):W228-W236. doi: 10.1093/nar/gkab407.
2
G2PDeep-v2: a web-based deep-learning framework for phenotype prediction and biomarker discovery using multi-omics data.G2PDeep-v2:一个基于网络的深度学习框架,用于使用多组学数据进行表型预测和生物标志物发现。
bioRxiv. 2024 Sep 13:2024.09.10.612292. doi: 10.1101/2024.09.10.612292.
3
SoyDNGP: a web-accessible deep learning framework for genomic prediction in soybean breeding.SoyDNGP:一个用于大豆育种中基因组预测的可访问网络的深度学习框架。
Brief Bioinform. 2023 Sep 22;24(6). doi: 10.1093/bib/bbad349.
4
GMStool: GWAS-based marker selection tool for genomic prediction from genomic data.GMStool:基于 GWAS 的基因组预测标记选择工具,用于从基因组数据中进行基因组预测。
Sci Rep. 2020 Nov 12;10(1):19653. doi: 10.1038/s41598-020-76759-y.
5
PGen: large-scale genomic variations analysis workflow and browser in SoyKB.PGen:大豆知识库中的大规模基因组变异分析工作流程与浏览器
BMC Bioinformatics. 2016 Oct 6;17(Suppl 13):337. doi: 10.1186/s12859-016-1227-y.
6
MusiteDeep: a deep-learning based webserver for protein post-translational modification site prediction and visualization.MusiteDeep:一个基于深度学习的蛋白质翻译后修饰位点预测和可视化的网络服务器。
Nucleic Acids Res. 2020 Jul 2;48(W1):W140-W146. doi: 10.1093/nar/gkaa275.
7
Operon Finder: A Deep Learning-based Web Server for Accurate Prediction of Prokaryotic Operons.操纵子预测器:一个基于深度学习的用于准确预测原核生物操纵子的网络服务器。
J Mol Biol. 2023 Jul 15;435(14):167921. doi: 10.1016/j.jmb.2022.167921. Epub 2022 Dec 14.
8
NRIMD, a Web Server for Analyzing Protein Allosteric Interactions Based on Molecular Dynamics Simulation.NRIMD,一个基于分子动力学模拟分析蛋白质别构相互作用的网络服务器。
J Chem Inf Model. 2024 Oct 14;64(19):7176-7183. doi: 10.1021/acs.jcim.4c00783. Epub 2024 Jul 11.
9
Impact of imputation methods on the amount of genetic variation captured by a single-nucleotide polymorphism panel in soybeans.插补方法对大豆单核苷酸多态性面板捕获的遗传变异量的影响。
BMC Bioinformatics. 2016 Feb 2;17:55. doi: 10.1186/s12859-016-0899-7.
10
Phylesystem: a git-based data store for community-curated phylogenetic estimates.系统发育体系:一个基于Git的用于社区策划系统发育估计的数据存储库。
Bioinformatics. 2015 Sep 1;31(17):2794-800. doi: 10.1093/bioinformatics/btv276. Epub 2015 May 4.

引用本文的文献

1
Inferring genotype-phenotype maps using attention models.使用注意力模型推断基因型-表型图谱。
ArXiv. 2025 Apr 14:arXiv:2504.10388v1.
2
DPImpute: A Genotype Imputation Framework for Ultra-Low Coverage Whole-Genome Sequencing and its Application in Genomic Selection.DPImpute:一种用于超低覆盖度全基因组测序的基因型填充框架及其在基因组选择中的应用。
Adv Sci (Weinh). 2025 Apr;12(16):e2412482. doi: 10.1002/advs.202412482. Epub 2025 Feb 27.
3
G2PDeep-v2: a web-based deep-learning framework for phenotype prediction and biomarker discovery for all organisms using multi-omics data.

本文引用的文献

1
A Population Structure and Genome-Wide Association Analysis on the USDA Soybean Germplasm Collection.美国农业部大豆种质资源库的群体结构与全基因组关联分析
Plant Genome. 2015 Nov;8(3):eplantgenome2015.04.0024. doi: 10.3835/plantgenome2015.04.0024.
2
KEGG: integrating viruses and cellular organisms.KEGG:整合病毒和细胞生物。
Nucleic Acids Res. 2021 Jan 8;49(D1):D545-D551. doi: 10.1093/nar/gkaa970.
3
Pfam: The protein families database in 2021.Pfam:2021 年的蛋白质家族数据库。
G2PDeep-v2:一个基于网络的深度学习框架,用于利用多组学数据对所有生物体进行表型预测和生物标志物发现。
Res Sq. 2025 Jan 9:rs.3.rs-5776937. doi: 10.21203/rs.3.rs-5776937/v1.
4
Soybean genomics research community strategic plan: A vision for 2024-2028.大豆基因组学研究共同体战略计划:2024 - 2028年愿景
Plant Genome. 2024 Dec;17(4):e20516. doi: 10.1002/tpg2.20516. Epub 2024 Nov 21.
5
Population-wide DNA methylation polymorphisms at single-nucleotide resolution in 207 cotton accessions reveal epigenomic contributions to complex traits.207份棉花种质资源在单核苷酸分辨率下的全群体DNA甲基化多态性揭示了表观基因组对复杂性状的贡献。
Cell Res. 2024 Dec;34(12):859-872. doi: 10.1038/s41422-024-01027-x. Epub 2024 Oct 17.
6
G2PDeep-v2: a web-based deep-learning framework for phenotype prediction and biomarker discovery using multi-omics data.G2PDeep-v2:一个基于网络的深度学习框架,用于使用多组学数据进行表型预测和生物标志物发现。
bioRxiv. 2024 Sep 13:2024.09.10.612292. doi: 10.1101/2024.09.10.612292.
7
An overview of recent technological developments in bovine genomics.牛基因组学近期技术发展概述。
Vet Anim Sci. 2024 Jul 23;25:100382. doi: 10.1016/j.vas.2024.100382. eCollection 2024 Sep.
8
TrG2P: A transfer-learning-based tool integrating multi-trait data for accurate prediction of crop yield.TrG2P:一种基于迁移学习的工具,集成多性状数据,用于准确预测作物产量。
Plant Commun. 2024 Jul 8;5(7):100975. doi: 10.1016/j.xplc.2024.100975. Epub 2024 May 15.
9
A review of machine learning models applied to genomic prediction in animal breeding.应用于动物育种基因组预测的机器学习模型综述。
Front Genet. 2023 Sep 6;14:1150596. doi: 10.3389/fgene.2023.1150596. eCollection 2023.
10
Predicting phenotypes from novel genomic markers using deep learning.利用深度学习从新型基因组标记预测表型。
Bioinform Adv. 2023 Mar 9;3(1):vbad028. doi: 10.1093/bioadv/vbad028. eCollection 2023.
Nucleic Acids Res. 2021 Jan 8;49(D1):D412-D419. doi: 10.1093/nar/gkaa913.
4
Phenotype Prediction and Genome-Wide Association Study Using Deep Convolutional Neural Network of Soybean.基于深度卷积神经网络的大豆表型预测与全基因组关联研究
Front Genet. 2019 Nov 22;10:1091. doi: 10.3389/fgene.2019.01091. eCollection 2019.
5
A deep convolutional neural network approach for predicting phenotypes from genotypes.一种基于深度卷积神经网络的基因型到表型预测方法。
Planta. 2018 Nov;248(5):1307-1318. doi: 10.1007/s00425-018-2976-9. Epub 2018 Aug 12.
6
Genome-Wide Analysis of Grain Yield Stability and Environmental Interactions in a Multiparental Soybean Population.多亲本大豆群体中籽粒产量稳定性及与环境互作的全基因组分析
G3 (Bethesda). 2018 Feb 2;8(2):519-529. doi: 10.1534/g3.117.300300.
7
Genetic Characterization of the Soybean Nested Association Mapping Population.大豆嵌套关联作图群体的遗传特征分析。
Plant Genome. 2017 Jul;10(2). doi: 10.3835/plantgenome2016.10.0109.
8
The iPlant Collaborative: Cyberinfrastructure for Enabling Data to Discovery for the Life Sciences.iPlant协作组织:用于推动生命科学领域从数据到发现的网络基础设施。
PLoS Biol. 2016 Jan 11;14(1):e1002342. doi: 10.1371/journal.pbio.1002342. eCollection 2016 Jan.
9
Genomic selection and association mapping in rice (Oryza sativa): effect of trait genetic architecture, training population composition, marker number and statistical model on accuracy of rice genomic selection in elite, tropical rice breeding lines.水稻(Oryza sativa)的基因组选择与关联图谱分析:性状遗传结构、训练群体组成、标记数量及统计模型对优质热带水稻育种系基因组选择准确性的影响
PLoS Genet. 2015 Feb 17;11(2):e1004982. doi: 10.1371/journal.pgen.1004982. eCollection 2015 Feb.
10
The iPlant Collaborative: Cyberinfrastructure for Plant Biology.i 植物协作组:植物生物学的网络基础设施。
Front Plant Sci. 2011 Jul 25;2:34. doi: 10.3389/fpls.2011.00034. eCollection 2011.