• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

深度学习在复杂性状基因组预测中的应用指南。

A Guide for Using Deep Learning for Complex Trait Genomic Prediction.

机构信息

Catalan Institution for Research and Advanced Studies (ICREA), Passeig de Lluís Companys 23, 08010 Barcelona, Spain.

Centre for Research in Agricultural Genomics (CRAG), CSIC-IRTA-UAB-UB, 08193 Bellaterra, Barcelona, Spain.

出版信息

Genes (Basel). 2019 Jul 20;10(7):553. doi: 10.3390/genes10070553.

DOI:10.3390/genes10070553
PMID:31330861
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6678200/
Abstract

Deep learning (DL) has emerged as a powerful tool to make accurate predictions from complex data such as image, text, or video. However, its ability to predict phenotypic values from molecular data is less well studied. Here, we describe the theoretical foundations of DL and provide a generic code that can be easily modified to suit specific needs. DL comprises a wide variety of algorithms which depend on numerous hyperparameters. Careful optimization of hyperparameter values is critical to avoid overfitting. Among the DL architectures currently tested in genomic prediction, convolutional neural networks (CNNs) seem more promising than multilayer perceptrons (MLPs). A limitation of DL is in interpreting the results. This may not be relevant for genomic prediction in plant or animal breeding but can be critical when deciding the genetic risk to a disease. Although DL technologies are not "plug-and-play", they are easily implemented using Keras and TensorFlow public software. To illustrate the principles described here, we implemented a Keras-based code in GitHub.

摘要

深度学习 (DL) 已成为从图像、文本或视频等复杂数据中进行准确预测的强大工具。然而,其从分子数据预测表型值的能力研究较少。在这里,我们描述了 DL 的理论基础,并提供了一个通用代码,可轻松修改以满足特定需求。DL 由多种算法组成,这些算法依赖于许多超参数。超参数值的仔细优化对于避免过拟合至关重要。在目前用于基因组预测的 DL 架构中,卷积神经网络 (CNN) 似乎比多层感知机 (MLP) 更有前途。DL 的一个限制在于解释结果。这在植物或动物育种的基因组预测中可能并不相关,但在决定疾病的遗传风险时可能至关重要。虽然 DL 技术不是“即插即用”,但可以使用 Keras 和 TensorFlow 公共软件轻松实现。为了说明这里描述的原理,我们在 GitHub 上实现了一个基于 Keras 的代码。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9dc/6678200/c0c8833335a7/genes-10-00553-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9dc/6678200/5c033bc908c9/genes-10-00553-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9dc/6678200/7c645dc8221f/genes-10-00553-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9dc/6678200/9e3b2173ee4b/genes-10-00553-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9dc/6678200/cf813a88db4c/genes-10-00553-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9dc/6678200/577e7e15f1b0/genes-10-00553-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9dc/6678200/7772f036852d/genes-10-00553-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9dc/6678200/c0c8833335a7/genes-10-00553-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9dc/6678200/5c033bc908c9/genes-10-00553-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9dc/6678200/7c645dc8221f/genes-10-00553-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9dc/6678200/9e3b2173ee4b/genes-10-00553-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9dc/6678200/cf813a88db4c/genes-10-00553-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9dc/6678200/577e7e15f1b0/genes-10-00553-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9dc/6678200/7772f036852d/genes-10-00553-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e9dc/6678200/c0c8833335a7/genes-10-00553-g007.jpg

相似文献

1
A Guide for Using Deep Learning for Complex Trait Genomic Prediction.深度学习在复杂性状基因组预测中的应用指南。
Genes (Basel). 2019 Jul 20;10(7):553. doi: 10.3390/genes10070553.
2
Can Deep Learning Improve Genomic Prediction of Complex Human Traits?深度学习能否提高复杂人类性状的基因组预测?
Genetics. 2018 Nov;210(3):809-819. doi: 10.1534/genetics.118.301298. Epub 2018 Aug 31.
3
Heuristic hyperparameter optimization of deep learning models for genomic prediction.启发式深度学习模型的基因组预测超参数优化。
G3 (Bethesda). 2021 Jul 14;11(7). doi: 10.1093/g3journal/jkab032.
4
Deep learning versus parametric and ensemble methods for genomic prediction of complex phenotypes.深度学习与参数化和集成方法在复杂表型基因组预测中的比较。
Genet Sel Evol. 2020 Feb 24;52(1):12. doi: 10.1186/s12711-020-00531-z.
5
deepGBLUP: joint deep learning networks and GBLUP framework for accurate genomic prediction of complex traits in Korean native cattle.深度 GBLUP:联合深度学习网络和 GBLUP 框架,用于准确预测韩国本土牛复杂性状的基因组。
Genet Sel Evol. 2023 Jul 31;55(1):56. doi: 10.1186/s12711-023-00825-y.
6
Sparse Convolutional Neural Networks for Genome-Wide Prediction.用于全基因组预测的稀疏卷积神经网络
Front Genet. 2020 Feb 6;11:25. doi: 10.3389/fgene.2020.00025. eCollection 2020.
7
DNNGP, a deep neural network-based method for genomic prediction using multi-omics data in plants.DNNGP,一种基于深度神经网络的方法,用于利用植物中的多组学数据进行基因组预测。
Mol Plant. 2023 Jan 2;16(1):279-293. doi: 10.1016/j.molp.2022.11.004. Epub 2022 Nov 10.
8
A review of deep learning applications for genomic selection.深度学习在基因组选择中的应用综述。
BMC Genomics. 2021 Jan 6;22(1):19. doi: 10.1186/s12864-020-07319-x.
9
tRNA-DL: A Deep Learning Approach to Improve tRNAscan-SE Prediction Results.tRNA-DL:一种用于改善tRNAscan-SE预测结果的深度学习方法。
Hum Hered. 2018;83(3):163-172. doi: 10.1159/000493215. Epub 2019 Jan 25.
10
Exploring Deep Learning for Complex Trait Genomic Prediction in Polyploid Outcrossing Species.探索深度学习用于多倍体异交物种复杂性状的基因组预测
Front Plant Sci. 2020 Feb 6;11:25. doi: 10.3389/fpls.2020.00025. eCollection 2020.

引用本文的文献

1
Decoding Quantitative Traits in Yaks: Genomic Insights for Improved Breeding Strategies.牦牛数量性状的解码:改良育种策略的基因组学见解
Curr Issues Mol Biol. 2025 May 12;47(5):350. doi: 10.3390/cimb47050350.
2
Environment ensemble models for genomic prediction in common bean (Phaseolus vulgaris L.).普通菜豆(Phaseolus vulgaris L.)基因组预测的环境集成模型。
Plant Genome. 2025 Jun;18(2):e70057. doi: 10.1002/tpg2.70057.
3
Generative AI for predictive breeding: hopes and caveats.用于预测性育种的生成式人工智能:希望与警示

本文引用的文献

1
Crop Yield Prediction Using Deep Neural Networks.使用深度神经网络进行作物产量预测。
Front Plant Sci. 2019 May 22;10:621. doi: 10.3389/fpls.2019.00621. eCollection 2019.
2
mACPpred: A Support Vector Machine-Based Meta-Predictor for Identification of Anticancer Peptides.mACPpred:一种基于支持向量机的抗癌肽元预测器。
Int J Mol Sci. 2019 Apr 22;20(8):1964. doi: 10.3390/ijms20081964.
3
Deep learning: new computational modelling techniques for genomics.深度学习:基因组学的新计算建模技术。
Theor Appl Genet. 2025 Jun 11;138(7):147. doi: 10.1007/s00122-025-04942-8.
4
Enhancing prediction accuracy of key biomass partitioning traits in wheat using multi-kernel genomic prediction models integrating secondary traits and environmental covariates.利用整合次要性状和环境协变量的多核基因组预测模型提高小麦关键生物量分配性状的预测准确性。
Plant Genome. 2025 Jun;18(2):e70052. doi: 10.1002/tpg2.70052.
5
Artificial intelligence meets genomic selection: comparing deep learning and GBLUP across diverse plant datasets.人工智能与基因组选择相遇:跨多种植物数据集比较深度学习和基因组最佳线性无偏预测
Front Genet. 2025 Apr 29;16:1568705. doi: 10.3389/fgene.2025.1568705. eCollection 2025.
6
Advances in multi-trait genomic prediction approaches: classification, comparative analysis, and perspectives.多性状基因组预测方法的进展:分类、比较分析及展望
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf211.
7
WheatGP, a genomic prediction method based on CNN and LSTM.WheatGP,一种基于卷积神经网络(CNN)和长短期记忆网络(LSTM)的基因组预测方法。
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf191.
8
Breaking down data silos across companies to train genome-wide predictions: A feasibility study in wheat.打破公司间的数据孤岛以训练全基因组预测:小麦的可行性研究
Plant Biotechnol J. 2025 Jul;23(7):2704-2719. doi: 10.1111/pbi.70095. Epub 2025 Apr 20.
9
Genomic selection in pig breeding: comparative analysis of machine learning algorithms.猪育种中的基因组选择:机器学习算法的比较分析
Genet Sel Evol. 2025 Mar 10;57(1):13. doi: 10.1186/s12711-025-00957-3.
10
Improved genomic prediction performance with ensembles of diverse models.通过多种不同模型的集成提高基因组预测性能。
G3 (Bethesda). 2025 May 8;15(5). doi: 10.1093/g3journal/jkaf048.
Nat Rev Genet. 2019 Jul;20(7):389-403. doi: 10.1038/s41576-019-0122-6.
4
Deep learning and process understanding for data-driven Earth system science.深度学习与过程理解在数据驱动的地球系统科学中的应用。
Nature. 2019 Feb;566(7743):195-204. doi: 10.1038/s41586-019-0912-1. Epub 2019 Feb 13.
5
A Benchmarking Between Deep Learning, Support Vector Machine and Bayesian Threshold Best Linear Unbiased Prediction for Predicting Ordinal Traits in Plant Breeding.深度学习、支持向量机和贝叶斯阈值最佳线性无偏预测在植物育种中预测有序性状的基准比较
G3 (Bethesda). 2019 Feb 7;9(2):601-618. doi: 10.1534/g3.118.200998.
6
Approximate Bayesian neural networks in genomic prediction.近似贝叶斯神经网络在基因组预测中的应用。
Genet Sel Evol. 2018 Dec 22;50(1):70. doi: 10.1186/s12711-018-0439-1.
7
Quantitative Genetics and Genomics Converge to Accelerate Forest Tree Breeding.数量遗传学与基因组学融合以加速林木育种。
Front Plant Sci. 2018 Nov 22;9:1693. doi: 10.3389/fpls.2018.01693. eCollection 2018.
8
A primer on deep learning in genomics.深度学习在基因组学中的应用简介。
Nat Genet. 2019 Jan;51(1):12-18. doi: 10.1038/s41588-018-0295-5. Epub 2018 Nov 26.
9
A deep learning approach to automate refinement of somatic variant calling from cancer sequencing data.深度学习方法自动优化癌症测序数据中体细胞变异的调用。
Nat Genet. 2018 Dec;50(12):1735-1743. doi: 10.1038/s41588-018-0257-y. Epub 2018 Nov 5.
10
Multi-environment Genomic Prediction of Plant Traits Using Deep Learners With Dense Architecture.使用具有密集架构的深度学习器对植物性状进行多环境基因组预测
G3 (Bethesda). 2018 Dec 10;8(12):3813-3828. doi: 10.1534/g3.118.200740.