• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过 EthAcc 对基因组预测进行训练集优化。

Training set optimization of genomic prediction by means of EthAcc.

机构信息

LIPM, Université de Toulouse, INRA, CNRS, Castanet-Tolosan, France.

GDEC, INRA, Clermont-Ferrand, France.

出版信息

PLoS One. 2019 Feb 19;14(2):e0205629. doi: 10.1371/journal.pone.0205629. eCollection 2019.

DOI:10.1371/journal.pone.0205629
PMID:30779753
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6380617/
Abstract

Genomic prediction is a useful tool for plant and animal breeding programs and is starting to be used to predict human diseases as well. A shortcoming that slows down the genomic selection deployment is that the accuracy of the prediction is not known a priori. We propose EthAcc (Estimated THeoretical ACCuracy) as a method for estimating the accuracy given a training set that is genotyped and phenotyped. EthAcc is based on a causal quantitative trait loci model estimated by a genome-wide association study. This estimated causal model is crucial; therefore, we compared different methods to find the one yielding the best EthAcc. The multilocus mixed model was found to perform the best. We compared EthAcc to accuracy estimators that can be derived via a mixed marker model. We showed that EthAcc is the only approach to correctly estimate the accuracy. Moreover, in case of a structured population, in accordance with the achieved accuracy, EthAcc showed that the biggest training set is not always better than a smaller and closer training set. We then performed training set optimization with EthAcc and compared it to CDmean. EthAcc outperformed CDmean on real datasets from sugar beet, maize, and wheat. Nonetheless, its performance was mainly due to the use of an optimal but inaccessible set as a start of the optimization algorithm. EthAcc's precision and algorithm issues prevent it from reaching a good training set with a random start. Despite this drawback, we demonstrated that a substantial gain in accuracy can be obtained by performing training set optimization.

摘要

基因组预测是植物和动物育种计划的有用工具,并且开始被用于预测人类疾病。减缓基因组选择部署的一个缺点是,预测的准确性是未知的。我们提出 EthAcc(估计理论准确性)作为一种方法,用于根据已基因分型和表型的训练集来估计准确性。EthAcc 基于通过全基因组关联研究估计的因果数量性状位点模型。这个估计的因果模型至关重要;因此,我们比较了不同的方法来找到产生最佳 EthAcc 的方法。多基因混合模型被发现表现最好。我们将 EthAcc 与可以通过混合标记模型得出的准确性估计器进行了比较。我们表明,EthAcc 是正确估计准确性的唯一方法。此外,在结构种群的情况下,根据所达到的准确性,EthAcc 表明最大的训练集并不总是比更小且更接近的训练集更好。然后,我们使用 EthAcc 进行了训练集优化,并将其与 CDmean 进行了比较。EthAcc 在甜菜、玉米和小麦的真实数据集上的表现优于 CDmean。尽管如此,它的性能主要是由于使用了最优但无法访问的集合作为优化算法的起点。EthAcc 的精度和算法问题阻止了它从随机起点获得良好的训练集。尽管存在这一缺陷,但我们证明,通过执行训练集优化可以获得显著的准确性提高。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a71/6380617/3b40e6f05655/pone.0205629.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a71/6380617/3a260b4d605d/pone.0205629.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a71/6380617/1ac1a0b93fc6/pone.0205629.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a71/6380617/e17a6219673a/pone.0205629.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a71/6380617/65491dd0e91c/pone.0205629.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a71/6380617/13512d75a6c2/pone.0205629.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a71/6380617/3b40e6f05655/pone.0205629.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a71/6380617/3a260b4d605d/pone.0205629.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a71/6380617/1ac1a0b93fc6/pone.0205629.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a71/6380617/e17a6219673a/pone.0205629.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a71/6380617/65491dd0e91c/pone.0205629.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a71/6380617/13512d75a6c2/pone.0205629.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a71/6380617/3b40e6f05655/pone.0205629.g006.jpg

相似文献

1
Training set optimization of genomic prediction by means of EthAcc.通过 EthAcc 对基因组预测进行训练集优化。
PLoS One. 2019 Feb 19;14(2):e0205629. doi: 10.1371/journal.pone.0205629. eCollection 2019.
2
Optimization of genomic selection training populations with a genetic algorithm.利用遗传算法优化基因组选择训练群体
Genet Sel Evol. 2015 May 6;47(1):38. doi: 10.1186/s12711-015-0116-6.
3
Training set determination for genomic selection.基因组选择的训练集确定。
Theor Appl Genet. 2019 Oct;132(10):2781-2792. doi: 10.1007/s00122-019-03387-0. Epub 2019 Jul 2.
4
Accuracy of genomic selection for a sib-evaluated trait using identity-by-state and identity-by-descent relationships.利用状态一致性和系谱一致性关系对同胞评估性状进行基因组选择的准确性。
Genet Sel Evol. 2015 Feb 25;47(1):9. doi: 10.1186/s12711-014-0084-2.
5
Forecasting the accuracy of genomic prediction with different selection targets in the training and prediction set as well as truncation selection.预测基因组预测的准确性,不同的选择目标在训练和预测集以及截断选择。
Theor Appl Genet. 2015 Nov;128(11):2189-201. doi: 10.1007/s00122-015-2577-y. Epub 2015 Aug 1.
6
Training set optimization under population structure in genomic selection.基因组选择中群体结构下的训练集优化
Theor Appl Genet. 2015 Jan;128(1):145-58. doi: 10.1007/s00122-014-2418-4. Epub 2014 Nov 1.
7
Genomic-enabled prediction with classification algorithms.使用分类算法的基因组预测
Heredity (Edinb). 2014 Jun;112(6):616-26. doi: 10.1038/hdy.2013.144. Epub 2014 Jan 15.
8
Optimal Designs for Genomic Selection in Hybrid Crops.杂种作物基因组选择的最优设计。
Mol Plant. 2019 Mar 4;12(3):390-401. doi: 10.1016/j.molp.2018.12.022. Epub 2019 Jan 6.
9
Genomic prediction and training set optimization in a structured Mediterranean oat population.在结构化的地中海燕麦群体中进行基因组预测和训练集优化。
Theor Appl Genet. 2021 Nov;134(11):3595-3609. doi: 10.1007/s00122-021-03916-w. Epub 2021 Aug 3.
10
Economical optimization of a breeding scheme by selective phenotyping of the calibration set in a multi-trait context: application to bread making quality.在多性状背景下通过对校准集进行选择性表型分析来实现一个选育方案的经济性优化:在烘焙品质方面的应用。
Theor Appl Genet. 2020 Jul;133(7):2197-2212. doi: 10.1007/s00122-020-03590-4. Epub 2020 Apr 17.

引用本文的文献

1
Genomic selection: Essence, applications, and prospects.基因组选择:本质、应用与前景。
Plant Genome. 2025 Jun;18(2):e70053. doi: 10.1002/tpg2.70053.
2
Portability of genomic predictions trained on sparse factorial designs across two maize silage breeding cycles.基于稀疏析因设计的基因组预测在两个玉米青贮育种周期中的可转移性。
Theor Appl Genet. 2024 Mar 7;137(3):75. doi: 10.1007/s00122-024-04566-4.
3
Optimization of Multi-Generation Multi-location Genomic Prediction Models for Recurrent Genomic Selection in an Upland Rice Population.

本文引用的文献

1
Estimation of genomic prediction accuracy from reference populations with varying degrees of relationship.从具有不同亲缘关系程度的参考群体中估计基因组预测准确性。
PLoS One. 2017 Dec 21;12(12):e0189775. doi: 10.1371/journal.pone.0189775. eCollection 2017.
2
Genomic Prediction of Sunflower Hybrids Oil Content.向日葵杂交种含油量的基因组预测
Front Plant Sci. 2017 Sep 21;8:1633. doi: 10.3389/fpls.2017.01633. eCollection 2017.
3
Efficient Estimation of Realized Kinship from Single Nucleotide Polymorphism Genotypes.基于单核苷酸多态性基因型的实现亲缘关系的有效估计
陆稻群体轮回基因组选择中多代多地点基因组预测模型的优化
Rice (N Y). 2023 Sep 27;16(1):43. doi: 10.1186/s12284-023-00661-0.
4
Genomic Prediction: Progress and Perspectives for Rice Improvement.基因组预测:水稻改良的进展与展望
Methods Mol Biol. 2022;2467:569-617. doi: 10.1007/978-1-0716-2205-6_21.
5
Building a Calibration Set for Genomic Prediction, Characteristics to Be Considered, and Optimization Approaches.构建用于基因组预测的校准集、需考虑的特征及优化方法。
Methods Mol Biol. 2022;2467:77-112. doi: 10.1007/978-1-0716-2205-6_3.
6
Sparse testing using genomic prediction improves selection for breeding targets in elite spring wheat.利用基因组预测进行稀疏测试可提高对优质春小麦育种目标的选择。
Theor Appl Genet. 2022 Jun;135(6):1939-1950. doi: 10.1007/s00122-022-04085-0. Epub 2022 Mar 28.
7
Combining genetic resources and elite material populations to improve the accuracy of genomic prediction in apple.结合遗传资源和优良材料群体提高苹果基因组预测的准确性。
G3 (Bethesda). 2022 Mar 4;12(3). doi: 10.1093/g3journal/jkab420.
8
Training Set Optimization for Sparse Phenotyping in Genomic Selection: A Conceptual Overview.基因组选择中稀疏表型分析的训练集优化:概念概述
Front Plant Sci. 2021 Sep 9;12:715910. doi: 10.3389/fpls.2021.715910. eCollection 2021.
9
Impact of early genomic prediction for recurrent selection in an upland rice synthetic population.早代基因组预测对水稻合成群体继代选择的影响。
G3 (Bethesda). 2021 Dec 8;11(12). doi: 10.1093/g3journal/jkab320.
10
TrainSel: An R Package for Selection of Training Populations.TrainSel:一个用于选择训练群体的R软件包。
Front Genet. 2021 May 7;12:655287. doi: 10.3389/fgene.2021.655287. eCollection 2021.
Genetics. 2017 Mar;205(3):1063-1078. doi: 10.1534/genetics.116.197004. Epub 2017 Jan 18.
4
Improvement of Predictive Ability by Uniform Coverage of the Target Genetic Space.通过均匀覆盖目标基因空间提高预测能力。
G3 (Bethesda). 2016 Nov 8;6(11):3733-3747. doi: 10.1534/g3.116.035410.
5
Optimum breeding strategies using genomic selection for hybrid breeding in wheat, maize, rye, barley, rice and triticale.利用基因组选择进行杂种优势育种的最佳繁殖策略,适用于小麦、玉米、黑麦、大麦、水稻和小黑麦。
Theor Appl Genet. 2016 Oct;129(10):1901-13. doi: 10.1007/s00122-016-2748-5. Epub 2016 Jul 7.
6
On the Accuracy of Genomic Selection.论基因组选择的准确性
PLoS One. 2016 Jun 20;11(6):e0156086. doi: 10.1371/journal.pone.0156086. eCollection 2016.
7
Approximated prediction of genomic selection accuracy when reference and candidate populations are related.当参考群体和候选群体相关时基因组选择准确性的近似预测。
Genet Sel Evol. 2016 Mar 3;48:18. doi: 10.1186/s12711-016-0183-3.
8
Variable-Selection Emerges on Top in Empirical Comparison of Whole-Genome Complex-Trait Prediction Methods.在全基因组复杂性状预测方法的实证比较中,变量选择脱颖而出。
PLoS One. 2015 Oct 6;10(10):e0138903. doi: 10.1371/journal.pone.0138903. eCollection 2015.
9
Optimization of genomic selection training populations with a genetic algorithm.利用遗传算法优化基因组选择训练群体
Genet Sel Evol. 2015 May 6;47(1):38. doi: 10.1186/s12711-015-0116-6.
10
A new initiative on precision medicine.一项关于精准医学的新倡议。
N Engl J Med. 2015 Feb 26;372(9):793-5. doi: 10.1056/NEJMp1500523. Epub 2015 Jan 30.