• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DeepCGP:一种用于压缩全基因组多态性以预测水稻表型的深度学习方法。

DeepCGP: A Deep Learning Method to Compress Genome-Wide Polymorphisms for Predicting Phenotype of Rice.

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2023 May-Jun;20(3):2078-2088. doi: 10.1109/TCBB.2022.3231466. Epub 2023 Jun 5.

DOI:10.1109/TCBB.2022.3231466
PMID:37018338
Abstract

Genomic selection (GS) is expected to accelerate plant and animal breeding. During the last decade, genome-wide polymorphism data have increased, which has raised concerns about storage cost and computational time. Several individual studies have attempted to compress the genome data and predict phenotypes. However, compression models lack adequate quality of data after compression, and prediction models are time consuming and use original data to predict the phenotype. Therefore, a combined application of compression and genomic prediction modeling using deep learning could resolve these limitations. A Deep Learning Compression-based Genomic Prediction (DeepCGP) model that can compress genome-wide polymorphism data and predict phenotypes of a target trait from compressed information was proposed. The DeepCGP model contained two parts: (i) an autoencoder model based on deep neural networks to compress genome-wide polymorphism data, and (ii) regression models based on random forests (RF), genomic best linear unbiased prediction (GBLUP), and Bayesian variable selection (BayesB) to predict phenotypes from compressed information. Two datasets with genome-wide marker genotypes and target trait phenotypes in rice were applied. The DeepCGP model obtained up to 99% prediction accuracy to the maximum for a trait after 98% compression. BayesB required extensive computational time among the three methods, and showed the highest accuracy; however, BayesB could only be used with compressed data. Overall, DeepCGP outperformed state-of-the-art methods in terms of both compression and prediction. Our code and data are available at https://github.com/tanzilamohita/DeepCGP.

摘要

基因组选择(GS)有望加速植物和动物的育种。在过去的十年中,全基因组多态性数据有所增加,这引起了人们对存储成本和计算时间的关注。已有几项研究试图压缩基因组数据并预测表型。然而,压缩模型在压缩后缺乏足够的数据质量,预测模型耗时且使用原始数据来预测表型。因此,深度学习结合压缩和基因组预测建模可以解决这些限制。提出了一种基于深度学习的压缩基因组预测(DeepCGP)模型,该模型可以压缩全基因组多态性数据,并从压缩信息中预测目标性状的表型。DeepCGP 模型包含两部分:(i)基于深度神经网络的自动编码器模型,用于压缩全基因组多态性数据,以及(ii)基于随机森林(RF)、基因组最佳线性无偏预测(GBLUP)和贝叶斯变量选择(BayesB)的回归模型,用于从压缩信息中预测表型。应用了两个具有全基因组标记基因型和水稻目标性状表型的数据集。DeepCGP 模型在 98%的压缩率下,最大可达 99%的预测精度。在这三种方法中,BayesB 所需的计算时间最长,准确性最高;但是,BayesB 只能用于压缩数据。总体而言,DeepCGP 在压缩和预测方面均优于最先进的方法。我们的代码和数据可在 https://github.com/tanzilamohita/DeepCGP 上获得。

相似文献

1
DeepCGP: A Deep Learning Method to Compress Genome-Wide Polymorphisms for Predicting Phenotype of Rice.DeepCGP:一种用于压缩全基因组多态性以预测水稻表型的深度学习方法。
IEEE/ACM Trans Comput Biol Bioinform. 2023 May-Jun;20(3):2078-2088. doi: 10.1109/TCBB.2022.3231466. Epub 2023 Jun 5.
2
deepGBLUP: joint deep learning networks and GBLUP framework for accurate genomic prediction of complex traits in Korean native cattle.深度 GBLUP:联合深度学习网络和 GBLUP 框架,用于准确预测韩国本土牛复杂性状的基因组。
Genet Sel Evol. 2023 Jul 31;55(1):56. doi: 10.1186/s12711-023-00825-y.
3
Deep learning versus parametric and ensemble methods for genomic prediction of complex phenotypes.深度学习与参数化和集成方法在复杂表型基因组预测中的比较。
Genet Sel Evol. 2020 Feb 24;52(1):12. doi: 10.1186/s12711-020-00531-z.
4
Genomic predictions can accelerate selection for resistance against Piscirickettsia salmonis in Atlantic salmon (Salmo salar).基因组预测可以加速大西洋鲑(Salmo salar)对鲑鱼立克次氏体抗性的选育。
BMC Genomics. 2017 Jan 31;18(1):121. doi: 10.1186/s12864-017-3487-y.
5
Accuracy of genomic selection for a sib-evaluated trait using identity-by-state and identity-by-descent relationships.利用状态一致性和系谱一致性关系对同胞评估性状进行基因组选择的准确性。
Genet Sel Evol. 2015 Feb 25;47(1):9. doi: 10.1186/s12711-014-0084-2.
6
Interpretable artificial neural networks incorporating Bayesian alphabet models for genome-wide prediction and association studies.基于贝叶斯字母模型的可解释人工神经网络在全基因组预测和关联研究中的应用。
G3 (Bethesda). 2021 Sep 27;11(10). doi: 10.1093/g3journal/jkab228.
7
Accuracy of predicting genomic breeding values for residual feed intake in Angus and Charolais beef cattle.预测 Angus 和夏洛莱肉牛剩余采食量的基因组育种值的准确性。
J Anim Sci. 2013 Oct;91(10):4669-78. doi: 10.2527/jas.2013-5715.
8
Genome-wide prediction using Bayesian additive regression trees.使用贝叶斯加法回归树进行全基因组预测。
Genet Sel Evol. 2016 Jun 10;48(1):42. doi: 10.1186/s12711-016-0219-8.
9
Comparison of genomic predictions using genomic relationship matrices built with different weighting factors to account for locus-specific variances.使用基于不同加权因子构建的基因组关系矩阵来考虑位点特异性方差的基因组预测比较。
J Dairy Sci. 2014 Oct;97(10):6547-59. doi: 10.3168/jds.2014-8210. Epub 2014 Aug 14.
10
Genome-enabled prediction of meat and carcass traits using Bayesian regression, single-step genomic best linear unbiased prediction and blending methods in Nelore cattle.利用贝叶斯回归、单步基因组最佳线性无偏预测和混合方法,基于基因组预测牛肉和胴体性状在尼洛拉牛中的应用。
Animal. 2021 Jan;15(1):100006. doi: 10.1016/j.animal.2020.100006. Epub 2020 Dec 10.

引用本文的文献

1
Opportunities and Challenges in Applying AI to Evolutionary Morphology.将人工智能应用于进化形态学的机遇与挑战。
Integr Org Biol. 2024 Sep 23;6(1):obae036. doi: 10.1093/iob/obae036. eCollection 2024.