• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估基因组预测中的降维方法。

Evaluating dimensionality reduction for genomic prediction.

作者信息

Manthena Vamsi, Jarquín Diego, Varshney Rajeev K, Roorkiwal Manish, Dixit Girish Prasad, Bharadwaj Chellapilla, Howard Reka

机构信息

Department of Statistics, University of Nebraska-Lincoln, Lincoln, NE, United States.

Agronomy Department, University of Florida, Gainesville, FL, United States.

出版信息

Front Genet. 2022 Oct 14;13:958780. doi: 10.3389/fgene.2022.958780. eCollection 2022.

DOI:10.3389/fgene.2022.958780
PMID:36313472
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9614092/
Abstract

The development of genomic selection (GS) methods has allowed plant breeding programs to select favorable lines using genomic data before performing field trials. Improvements in genotyping technology have yielded high-dimensional genomic marker data which can be difficult to incorporate into statistical models. In this paper, we investigated the utility of applying dimensionality reduction (DR) methods as a pre-processing step for GS methods. We compared five DR methods and studied the trend in the prediction accuracies of each method as a function of the number of features retained. The effect of DR methods was studied using three models that involved the main effects of line, environment, marker, and the genotype by environment interactions. The methods were applied on a real data set containing 315 lines phenotyped in nine environments with 26,817 markers each. Regardless of the DR method and prediction model used, only a fraction of features was sufficient to achieve maximum correlation. Our results underline the usefulness of DR methods as a key pre-processing step in GS models to improve computational efficiency in the face of ever-increasing size of genomic data.

摘要

基因组选择(GS)方法的发展使植物育种计划能够在进行田间试验之前利用基因组数据选择优良品系。基因分型技术的改进产生了高维基因组标记数据,这些数据难以纳入统计模型。在本文中,我们研究了应用降维(DR)方法作为GS方法预处理步骤的效用。我们比较了五种DR方法,并研究了每种方法的预测准确性随保留特征数量的变化趋势。使用涉及品系、环境、标记的主效应以及基因型与环境互作的三种模型研究了DR方法的效果。这些方法应用于一个真实数据集,该数据集包含在九个环境中进行表型分析的315个品系,每个品系有26,817个标记。无论使用何种DR方法和预测模型,只有一小部分特征足以实现最大相关性。我们的结果强调了DR方法作为GS模型中关键预处理步骤的有用性,以在面对不断增加的基因组数据量时提高计算效率。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5bae/9614092/475f426be114/fgene-13-958780-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5bae/9614092/f788f7d012ac/fgene-13-958780-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5bae/9614092/66e2f73ab390/fgene-13-958780-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5bae/9614092/f9d371fb2f52/fgene-13-958780-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5bae/9614092/b587405ffea8/fgene-13-958780-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5bae/9614092/475f426be114/fgene-13-958780-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5bae/9614092/f788f7d012ac/fgene-13-958780-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5bae/9614092/66e2f73ab390/fgene-13-958780-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5bae/9614092/f9d371fb2f52/fgene-13-958780-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5bae/9614092/b587405ffea8/fgene-13-958780-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5bae/9614092/475f426be114/fgene-13-958780-g005.jpg

相似文献

1
Evaluating dimensionality reduction for genomic prediction.评估基因组预测中的降维方法。
Front Genet. 2022 Oct 14;13:958780. doi: 10.3389/fgene.2022.958780. eCollection 2022.
2
Genomic-enabled prediction models using multi-environment trials to estimate the effect of genotype × environment interaction on prediction accuracy in chickpea.利用多环境试验进行基因组预测模型,估计基因型与环境互作对鹰嘴豆预测准确性的影响。
Sci Rep. 2018 Aug 3;8(1):11701. doi: 10.1038/s41598-018-30027-2.
3
Genomic Selection in Rubber Tree Breeding: A Comparison of Models and Methods for Managing G×E Interactions.橡胶树育种中的基因组选择:管理基因型与环境互作的模型和方法比较
Front Plant Sci. 2019 Oct 25;10:1353. doi: 10.3389/fpls.2019.01353. eCollection 2019.
4
Genomic prediction of agronomic traits in wheat using different models and cross-validation designs.利用不同模型和交叉验证设计对小麦农艺性状进行基因组预测。
Theor Appl Genet. 2021 Jan;134(1):381-398. doi: 10.1007/s00122-020-03703-z. Epub 2020 Nov 1.
5
Improving accuracies of genomic predictions for drought tolerance in maize by joint modeling of additive and dominance effects in multi-environment trials.通过联合建模多环境试验中的加性和显性效应来提高玉米抗旱性的基因组预测准确性。
Heredity (Edinb). 2018 Jul;121(1):24-37. doi: 10.1038/s41437-018-0053-6. Epub 2018 Feb 23.
6
Increased prediction accuracy in wheat breeding trials using a marker × environment interaction genomic selection model.使用标记×环境互作基因组选择模型提高小麦育种试验中的预测准确性。
G3 (Bethesda). 2015 Feb 6;5(4):569-82. doi: 10.1534/g3.114.016097.
7
Genome-Enabled Prediction Models for Yield Related Traits in Chickpea.鹰嘴豆产量相关性状的基因组预测模型
Front Plant Sci. 2016 Nov 22;7:1666. doi: 10.3389/fpls.2016.01666. eCollection 2016.
8
Genomic selection for lentil breeding: Empirical evidence.菜豆基因组选择育种:实证证据。
Plant Genome. 2020 Mar;13(1):e20002. doi: 10.1002/tpg2.20002. Epub 2020 Mar 27.
9
Early Selection Enabled by the Implementation of Genomic Selection in Breeding.通过在育种中实施基因组选择实现早期选择。
Front Plant Sci. 2019 Jan 8;9:1934. doi: 10.3389/fpls.2018.01934. eCollection 2018.
10
The accuracy of prediction of genomic selection in elite hybrid rye populations surpasses the accuracy of marker-assisted selection and is equally augmented by multiple field evaluation locations and test years.在优良杂交黑麦群体中,基因组选择的预测准确性超过了标记辅助选择,并且通过多个田间评估地点和试验年份同样得到了提高。
BMC Genomics. 2014 Jul 4;15(1):556. doi: 10.1186/1471-2164-15-556.

引用本文的文献

1
Toward a general framework for AI-enabled prediction in crop improvement.迈向作物改良中基于人工智能预测的通用框架。
Theor Appl Genet. 2025 Jun 12;138(7):151. doi: 10.1007/s00122-025-04928-6.
2
Enhancing Genomic Prediction Accuracy of Reproduction Traits in Rongchang Pigs Through Machine Learning.通过机器学习提高荣昌猪繁殖性状的基因组预测准确性
Animals (Basel). 2025 Feb 12;15(4):525. doi: 10.3390/ani15040525.
3
A tied-weight autoencoder for the linear dimensionality reduction of sample data.一种用于样本数据线性降维的带权重自编码器。

本文引用的文献

1
On the Use of the Pearson Correlation Coefficient for Model Evaluation in Genome-Wide Prediction.关于Pearson相关系数在全基因组预测模型评估中的应用
Front Genet. 2019 Sep 26;10:899. doi: 10.3389/fgene.2019.00899. eCollection 2019.
2
Genomic-enabled prediction models using multi-environment trials to estimate the effect of genotype × environment interaction on prediction accuracy in chickpea.利用多环境试验进行基因组预测模型,估计基因型与环境互作对鹰嘴豆预测准确性的影响。
Sci Rep. 2018 Aug 3;8(1):11701. doi: 10.1038/s41598-018-30027-2.
3
Increasing Genomic-Enabled Prediction Accuracy by Modeling Genotype × Environment Interactions in Kansas Wheat.
Sci Rep. 2024 Nov 5;14(1):26801. doi: 10.1038/s41598-024-77080-8.
4
Modeling Chickpea Productivity with Artificial Image Objects and Convolutional Neural Network.利用人工图像对象和卷积神经网络对鹰嘴豆生产力进行建模
Plants (Basel). 2024 Sep 1;13(17):2444. doi: 10.3390/plants13172444.
5
CuPCA: a web server for pan-cancer association analysis of large-scale cuproptosis-related genes.CuPCA:一个用于大规模铜死亡相关基因泛癌关联分析的网络服务器。
Database (Oxford). 2024 Sep 3;2024. doi: 10.1093/database/baae075.
通过在堪萨斯州小麦中建模基因型×环境互作来提高基因组增强预测准确性。
Plant Genome. 2017 Jul;10(2). doi: 10.3835/plantgenome2016.12.0130.
4
Genome-Enabled Prediction Models for Yield Related Traits in Chickpea.鹰嘴豆产量相关性状的基因组预测模型
Front Plant Sci. 2016 Nov 22;7:1666. doi: 10.3389/fpls.2016.01666. eCollection 2016.
5
Regularization Paths for Cox's Proportional Hazards Model via Coordinate Descent.通过坐标下降法求解Cox比例风险模型的正则化路径
J Stat Softw. 2011 Mar;39(5):1-13. doi: 10.18637/jss.v039.i05.
6
A reaction norm model for genomic selection using high-dimensional genomic and environmental data.利用高维基因组和环境数据进行基因组选择的反应规范模型。
Theor Appl Genet. 2014 Mar;127(3):595-607. doi: 10.1007/s00122-013-2243-1. Epub 2013 Dec 12.
7
Application of support vector regression to genome-assisted prediction of quantitative traits.支持向量回归在全基因组辅助数量性状预测中的应用。
Theor Appl Genet. 2011 Nov;123(7):1065-74. doi: 10.1007/s00122-011-1648-y. Epub 2011 Jul 8.
8
Semi-parametric genomic-enabled prediction of genetic values using reproducing kernel Hilbert spaces methods.使用再生核希尔伯特空间方法对遗传值进行半参数基因组预测。
Genet Res (Camb). 2010 Aug;92(4):295-308. doi: 10.1017/S0016672310000285.
9
LASSO with cross-validation for genomic selection.用于基因组选择的带交叉验证的套索算法。
Genet Res (Camb). 2009 Dec;91(6):427-36. doi: 10.1017/S0016672309990334.
10
A comparison of five methods to predict genomic breeding values of dairy bulls from genome-wide SNP markers.比较五种方法从全基因组 SNP 标记预测奶牛公牛的基因组育种值。
Genet Sel Evol. 2009 Dec 31;41(1):56. doi: 10.1186/1297-9686-41-56.