• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

数据增强提高了基于植物基因组的预测能力。

Data Augmentation Enhances Plant-Genomic-Enabled Predictions.

机构信息

Facultad de Telemática, Universidad de Colima, Colima 28040, Colima, Mexico.

International Maize and Wheat Improvement Center (CIMMYT), Km 45, Carretera Mexico-Veracruz, Texcoco 52640, Edo. de México, Mexico.

出版信息

Genes (Basel). 2024 Feb 24;15(3):286. doi: 10.3390/genes15030286.

DOI:10.3390/genes15030286
PMID:38540344
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10969940/
Abstract

Genomic selection (GS) is revolutionizing plant breeding. However, its practical implementation is still challenging, since there are many factors that affect its accuracy. For this reason, this research explores data augmentation with the goal of improving its accuracy. Deep neural networks with data augmentation (DA) generate synthetic data from the original training set to increase the training set and to improve the prediction performance of any statistical or machine learning algorithm. There is much empirical evidence of their success in many computer vision applications. Due to this, DA was explored in the context of GS using 14 real datasets. We found empirical evidence that DA is a powerful tool to improve the prediction accuracy, since we improved the prediction accuracy of the top lines in the 14 datasets under study. On average, across datasets and traits, the gain in prediction performance of the DA approach regarding the Conventional method in the top 20% of lines in the testing set was 108.4% in terms of the NRMSE and 107.4% in terms of the MAAPE, but a worse performance was observed on the whole testing set. We encourage more empirical evaluations to support our findings.

摘要

基因组选择 (GS) 正在彻底改变植物育种。然而,其实际实施仍然具有挑战性,因为有许多因素会影响其准确性。出于这个原因,本研究探讨了数据扩充,以提高其准确性。具有数据扩充 (DA) 的深度神经网络从原始训练集中生成合成数据,以增加训练集并提高任何统计或机器学习算法的预测性能。有很多经验证据表明它们在许多计算机视觉应用中取得了成功。因此,在 GS 背景下使用 14 个真实数据集探索了 DA。我们发现经验证据表明,DA 是提高预测准确性的有力工具,因为我们提高了所研究的 14 个数据集的前 10 行的预测准确性。平均而言,在数据集和性状方面,与传统方法相比,在测试集中前 20%的线中,DA 方法在 NRMSE 方面的预测性能提高了 108.4%,在 MAAPE 方面提高了 107.4%,但在整个测试集中观察到的性能更差。我们鼓励进行更多的实证评估来支持我们的发现。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2b82/10969940/e092d18a9e01/genes-15-00286-g006a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2b82/10969940/ea91315ae88a/genes-15-00286-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2b82/10969940/27d70834643a/genes-15-00286-g002a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2b82/10969940/87977547b7ef/genes-15-00286-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2b82/10969940/a77b748771fb/genes-15-00286-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2b82/10969940/4e10769d0edd/genes-15-00286-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2b82/10969940/e092d18a9e01/genes-15-00286-g006a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2b82/10969940/ea91315ae88a/genes-15-00286-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2b82/10969940/27d70834643a/genes-15-00286-g002a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2b82/10969940/87977547b7ef/genes-15-00286-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2b82/10969940/a77b748771fb/genes-15-00286-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2b82/10969940/4e10769d0edd/genes-15-00286-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2b82/10969940/e092d18a9e01/genes-15-00286-g006a.jpg

相似文献

1
Data Augmentation Enhances Plant-Genomic-Enabled Predictions.数据增强提高了基于植物基因组的预测能力。
Genes (Basel). 2024 Feb 24;15(3):286. doi: 10.3390/genes15030286.
2
Two simple methods to improve the accuracy of the genomic selection methodology.两种提高基因组选择方法准确性的简单方法。
BMC Genomics. 2023 Apr 26;24(1):220. doi: 10.1186/s12864-023-09294-5.
3
New Deep Learning Genomic-Based Prediction Model for Multiple Traits with Binary, Ordinal, and Continuous Phenotypes.基于深度学习的基因组的新型预测模型,用于具有二元、有序和连续表型的多个特征。
G3 (Bethesda). 2019 May 7;9(5):1545-1556. doi: 10.1534/g3.119.300585.
4
Using machine learning to realize genetic site screening and genomic prediction of productive traits in pigs.利用机器学习实现猪生产性状的遗传位点筛选和基因组预测。
FASEB J. 2023 Jun;37(6):e22961. doi: 10.1096/fj.202300245R.
5
Multi-trait, Multi-environment Deep Learning Modeling for Genomic-Enabled Prediction of Plant Traits.用于基于基因组的植物性状预测的多性状、多环境深度学习建模
G3 (Bethesda). 2018 Dec 10;8(12):3829-3840. doi: 10.1534/g3.118.200728.
6
Efficacy of plant breeding using genomic information.利用基因组信息进行植物育种的功效。
Plant Genome. 2023 Jun;16(2):e20346. doi: 10.1002/tpg2.20346. Epub 2023 May 4.
7
A Multi-Trait Gaussian Kernel Genomic Prediction Model under Three Tunning Strategies.基于三种调优策略的多性状高斯核基因组预测模型。
Genes (Basel). 2022 Dec 3;13(12):2279. doi: 10.3390/genes13122279.
8
DNNGP, a deep neural network-based method for genomic prediction using multi-omics data in plants.DNNGP,一种基于深度神经网络的方法,用于利用植物中的多组学数据进行基因组预测。
Mol Plant. 2023 Jan 2;16(1):279-293. doi: 10.1016/j.molp.2022.11.004. Epub 2022 Nov 10.
9
Accounting for Correlation Between Traits in Genomic Prediction.基因组预测中性状间相关性的考量
Methods Mol Biol. 2022;2467:285-327. doi: 10.1007/978-1-0716-2205-6_10.
10
Comparing gradient boosting machine and Bayesian threshold BLUP for genome-based prediction of categorical traits in wheat breeding.梯度提升机和贝叶斯阈值 BLUP 用于小麦育种中基于基因组的分类性状预测的比较。
Plant Genome. 2022 Sep;15(3):e20214. doi: 10.1002/tpg2.20214. Epub 2022 May 10.

引用本文的文献

1
Innovative data augmentation strategy for deep learning on biological datasets with limited gene representations focused on chloroplast genomes.针对具有有限基因表征的生物数据集(聚焦于叶绿体基因组)的深度学习创新数据增强策略。
Sci Rep. 2025 Jul 25;15(1):27079. doi: 10.1038/s41598-025-12796-9.
2
Lentil plant disease and quality assessment: A detailed dataset of high-resolution images for deep learning research.小扁豆植物病害与质量评估:用于深度学习研究的高分辨率图像详细数据集。
Data Brief. 2024 Dec 12;58:111224. doi: 10.1016/j.dib.2024.111224. eCollection 2025 Feb.

本文引用的文献

1
Genomics combined with UAS data enhances prediction of grain yield in winter wheat.基因组学与无人机航空遥感数据相结合可提高冬小麦产量预测能力。
Front Genet. 2023 Mar 29;14:1124218. doi: 10.3389/fgene.2023.1124218. eCollection 2023.
2
Genomic Selection: A Tool for Accelerating the Efficiency of Molecular Breeding for Development of Climate-Resilient Crops.基因组选择:加速培育抗逆作物分子育种效率的工具。
Front Genet. 2022 Feb 9;13:832153. doi: 10.3389/fgene.2022.832153. eCollection 2022.
3
Comparison of Different Image Data Augmentation Approaches.
不同图像数据增强方法的比较
J Imaging. 2021 Nov 27;7(12):254. doi: 10.3390/jimaging7120254.
4
Genome-enabled prediction for sparse testing in multi-environmental wheat trials.基于基因组的预测在多环境小麦试验中的稀疏测试。
Plant Genome. 2021 Nov;14(3):e20151. doi: 10.1002/tpg2.20151. Epub 2021 Sep 12.
5
Text Data Augmentation for Deep Learning.用于深度学习的文本数据增强
J Big Data. 2021;8(1):101. doi: 10.1186/s40537-021-00492-0. Epub 2021 Jul 19.
6
Correcting the predictive validity of a selection test for the effect of indirect range restriction.纠正间接范围限制对选拔测验预测效度的影响的预测效度。
BMC Med Educ. 2017 Dec 11;17(1):246. doi: 10.1186/s12909-017-1070-5.
7
Genomic Selection in Plant Breeding: Methods, Models, and Perspectives.基因组选择在植物育种中的应用:方法、模型与展望。
Trends Plant Sci. 2017 Nov;22(11):961-975. doi: 10.1016/j.tplants.2017.08.011. Epub 2017 Sep 28.
8
Genomic prediction unifies animal and plant breeding programs to form platforms for biological discovery.基因组预测将动物和植物育种计划统一起来,形成生物学发现的平台。
Nat Genet. 2017 Aug 30;49(9):1297-1303. doi: 10.1038/ng.3920.
9
Genomic Selection in the of Next Generation Sequencing for Complex Traits in Plant Breeding.植物育种中复杂性状的下一代测序背景下的基因组选择
Front Genet. 2016 Dec 27;7:221. doi: 10.3389/fgene.2016.00221. eCollection 2016.
10
Genome-wide regression and prediction with the BGLR statistical package.使用BGLR统计软件包进行全基因组回归与预测。
Genetics. 2014 Oct;198(2):483-95. doi: 10.1534/genetics.114.164442. Epub 2014 Jul 9.