• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

与连续观测相比,有序分类数据的基因组预测准确性降低。

Reduction in accuracy of genomic prediction for ordered categorical data compared to continuous observations.

作者信息

Kizilkaya Kadir, Fernando Rohan L, Garrick Dorian J

机构信息

Department of Animal Science, Iowa State University, Ames IA 50011, USA.

出版信息

Genet Sel Evol. 2014 Jun 9;46(1):37. doi: 10.1186/1297-9686-46-37.

DOI:10.1186/1297-9686-46-37
PMID:24912924
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4094927/
Abstract

BACKGROUND

Accuracy of genomic prediction depends on number of records in the training population, heritability, effective population size, genetic architecture, and relatedness of training and validation populations. Many traits have ordered categories including reproductive performance and susceptibility or resistance to disease. Categorical scores are often recorded because they are easier to obtain than continuous observations. Bayesian linear regression has been extended to the threshold model for genomic prediction. The objective of this study was to quantify reductions in accuracy for ordinal categorical traits relative to continuous traits.

METHODS

Efficiency of genomic prediction was evaluated for heritabilities of 0.10, 0.25 or 0.50. Phenotypes were simulated for 2250 purebred animals using 50 QTL selected from actual 50k SNP (single nucleotide polymorphism) genotypes giving a proportion of causal to total loci of.0001. A Bayes C π threshold model simultaneously fitted all 50k markers except those that represented QTL. Estimated SNP effects were utilized to predict genomic breeding values in purebred (n = 239) or multibreed (n = 924) validation populations. Correlations between true and predicted genomic merit in validation populations were used to assess predictive ability.

RESULTS

Accuracies of genomic estimated breeding values ranged from 0.12 to 0.66 for purebred and from 0.04 to 0.53 for multibreed validation populations based on Bayes C π linear model analysis of the simulated underlying variable. Accuracies for ordinal categorical scores analyzed by the Bayes C π threshold model were 20% to 50% lower and ranged from 0.04 to 0.55 for purebred and from 0.01 to 0.44 for multibreed validation populations. Analysis of ordinal categorical scores using a linear model resulted in further reductions in accuracy.

CONCLUSIONS

Threshold traits result in markedly lower accuracy than a linear model on the underlying variable. To achieve an accuracy equal or greater than for continuous phenotypes with a training population of 1000 animals, a 2.25 fold increase in training population size was required for categorical scores fitted with the threshold model. The threshold model resulted in higher accuracies than the linear model and its advantage was greatest when training populations were smallest.

摘要

背景

基因组预测的准确性取决于训练群体中的记录数量、遗传力、有效群体大小、遗传结构以及训练群体与验证群体的亲缘关系。许多性状具有有序类别,包括繁殖性能以及对疾病的易感性或抗性。通常记录分类得分,因为它们比连续观测值更容易获得。贝叶斯线性回归已扩展到用于基因组预测的阈值模型。本研究的目的是量化序数分类性状相对于连续性状准确性的降低程度。

方法

针对遗传力为0.10、0.25或0.50的情况评估基因组预测的效率。使用从实际50k单核苷酸多态性(SNP)基因型中选择的50个数量性状位点(QTL)为2250只纯种动物模拟表型,使得因果位点与总位点的比例为0.0001。贝叶斯Cπ阈值模型同时拟合除代表QTL的那些标记之外的所有50k个标记。利用估计的SNP效应在纯种(n = 239)或多品种(n = 924)验证群体中预测基因组育种值。验证群体中真实和预测的基因组优点之间的相关性用于评估预测能力。

结果

基于对模拟潜在变量的贝叶斯Cπ线性模型分析,纯种验证群体的基因组估计育种值准确性范围为0.12至0.66,多品种验证群体为0.04至0.53。通过贝叶斯Cπ阈值模型分析的序数分类得分的准确性低20%至50%,纯种验证群体范围为0.04至0.55,多品种验证群体为0.01至0.44。使用线性模型分析序数分类得分导致准确性进一步降低。

结论

阈值性状导致的准确性明显低于基于潜在变量的线性模型。为了在训练群体为1000只动物的情况下实现与连续表型相等或更高的准确性,对于采用阈值模型拟合的分类得分,训练群体大小需要增加2.25倍。阈值模型比线性模型产生更高的准确性,并且当训练群体最小时其优势最大。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5202/4094927/4cbbb02ca836/1297-9686-46-37-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5202/4094927/c0309f601344/1297-9686-46-37-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5202/4094927/995fe47698c0/1297-9686-46-37-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5202/4094927/4cbbb02ca836/1297-9686-46-37-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5202/4094927/c0309f601344/1297-9686-46-37-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5202/4094927/995fe47698c0/1297-9686-46-37-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5202/4094927/4cbbb02ca836/1297-9686-46-37-3.jpg

相似文献

1
Reduction in accuracy of genomic prediction for ordered categorical data compared to continuous observations.与连续观测相比,有序分类数据的基因组预测准确性降低。
Genet Sel Evol. 2014 Jun 9;46(1):37. doi: 10.1186/1297-9686-46-37.
2
Genomic prediction of simulated multibreed and purebred performance using observed fifty thousand single nucleotide polymorphism genotypes.利用观测到的五万个性状 SNP 基因型对模拟多品种和纯种表现进行基因组预测。
J Anim Sci. 2010 Feb;88(2):544-51. doi: 10.2527/jas.2009-2064. Epub 2009 Oct 9.
3
Accuracy of prediction of simulated polygenic phenotypes and their underlying quantitative trait loci genotypes using real or imputed whole-genome markers in cattle.利用真实或推算的全基因组标记预测牛模拟多基因表型及其潜在数量性状位点基因型的准确性。
Genet Sel Evol. 2015 Dec 23;47:99. doi: 10.1186/s12711-015-0179-4.
4
Predicting the effect of reference population on the accuracy of within, across, and multibreed genomic prediction.预测参考群体对内部、跨群体和多群体基因组预测准确性的影响。
J Dairy Sci. 2019 Apr;102(4):3155-3174. doi: 10.3168/jds.2018-15231. Epub 2019 Feb 7.
5
Genomic prediction ability for beef fatty acid profile in Nelore cattle using different pseudo-phenotypes.使用不同伪表型对内洛尔牛牛肉脂肪酸谱的基因组预测能力。
J Appl Genet. 2018 Nov;59(4):493-501. doi: 10.1007/s13353-018-0470-5. Epub 2018 Sep 24.
6
Accuracy of predicting genomic breeding values for residual feed intake in Angus and Charolais beef cattle.预测 Angus 和夏洛莱肉牛剩余采食量的基因组育种值的准确性。
J Anim Sci. 2013 Oct;91(10):4669-78. doi: 10.2527/jas.2013-5715.
7
Accuracy of genomic predictions for feed efficiency traits of beef cattle using 50K and imputed HD genotypes.使用50K和推算的HD基因型对肉牛饲料效率性状进行基因组预测的准确性。
J Anim Sci. 2016 Apr;94(4):1342-53. doi: 10.2527/jas.2015-0126.
8
Accuracy of genomic breeding values in multibreed beef cattle populations derived from deregressed breeding values and phenotypes.来源于去回归育种值和表型的多品种肉牛群体中基因组育种值的准确性。
J Anim Sci. 2012 Dec;90(12):4177-90. doi: 10.2527/jas.2011-4586. Epub 2012 Jul 5.
9
Multibreed genomic prediction using multitrait genomic residual maximum likelihood and multitask Bayesian variable selection.多品种基因组预测使用多性状基因组残差极大似然法和多任务贝叶斯变量选择。
J Dairy Sci. 2018 May;101(5):4279-4294. doi: 10.3168/jds.2017-13366. Epub 2018 Mar 15.
10
Comparison of Bayesian models to estimate direct genomic values in multi-breed commercial beef cattle.多品种商业肉牛中贝叶斯模型估计直接基因组值的比较。
Genet Sel Evol. 2015 Apr 1;47(1):23. doi: 10.1186/s12711-015-0106-8.

引用本文的文献

1
An Extended Application of the Fast Multi-Locus Ridge Regression Algorithm in Genome-Wide Association Studies of Categorical Phenotypes.快速多基因座岭回归算法在分类性状全基因组关联研究中的扩展应用
Plants (Basel). 2024 Sep 7;13(17):2520. doi: 10.3390/plants13172520.
2
Using visual scores for genomic prediction of complex traits in breeding programs.利用可视评分进行复杂性状的基因组预测在育种计划中的应用。
Theor Appl Genet. 2023 Dec 15;137(1):9. doi: 10.1007/s00122-023-04512-w.
3
Integrating and optimizing genomic, weather, and secondary trait data for multiclass classification.

本文引用的文献

1
Accuracy of direct genomic breeding values for nationally evaluated traits in US Limousin and Simmental beef cattle.美国利木赞牛和西门塔尔牛全国评估性状直接基因组育种值的准确性。
Genet Sel Evol. 2012 Dec 7;44(1):38. doi: 10.1186/1297-9686-44-38.
2
Heritability and Bayesian genome-wide association study of first service conception and pregnancy in Brangus heifers.布郎格斯牛初情和妊娠的遗传力及贝叶斯全基因组关联研究。
J Anim Sci. 2013 Feb;91(2):605-12. doi: 10.2527/jas.2012-5580. Epub 2012 Nov 12.
3
Genome-wide association study of insect bite hypersensitivity in two horse populations in the Netherlands.
整合并优化基因组、天气和次要性状数据以进行多类分类。
Front Genet. 2023 Mar 29;13:1032691. doi: 10.3389/fgene.2022.1032691. eCollection 2022.
4
Estimating Heritabilities and Breeding Values From Censored Phenotypes Using a Data Augmentation Approach.使用数据增强方法从截尾表型估计遗传力和育种值。
Front Genet. 2022 Jul 25;13:867152. doi: 10.3389/fgene.2022.867152. eCollection 2022.
5
Genome and Environment Based Prediction Models and Methods of Complex Traits Incorporating Genotype × Environment Interaction.基于基因组和环境的复杂性状预测模型及方法:纳入基因型×环境互作
Methods Mol Biol. 2022;2467:245-283. doi: 10.1007/978-1-0716-2205-6_9.
6
Predictive performance of genomic selection methods for carcass traits in Hanwoo beef cattle: impacts of the genetic architecture.韩牛胴体性状基因组选择方法的预测性能:遗传结构的影响
Genet Sel Evol. 2017 Jan 4;49(1):1. doi: 10.1186/s12711-016-0283-0.
7
Accuracy and responses of genomic selection on key traits in apple breeding.苹果育种中关键性状的基因组选择的准确性和响应。
Hortic Res. 2015 Dec 23;2:15060. doi: 10.1038/hortres.2015.60. eCollection 2015.
8
Comparison of Bayesian models to estimate direct genomic values in multi-breed commercial beef cattle.多品种商业肉牛中贝叶斯模型估计直接基因组值的比较。
Genet Sel Evol. 2015 Apr 1;47(1):23. doi: 10.1186/s12711-015-0106-8.
全基因组关联研究在荷兰两个马种群中的昆虫叮咬过敏反应。
Genet Sel Evol. 2012 Oct 30;44(1):31. doi: 10.1186/1297-9686-44-31.
4
Sire evaluation for ordered categorical data with a threshold model.使用阈值模型对有序分类数据进行 sire 评估。
Genet Sel Evol (1983). 1983;15(2):201-24. doi: 10.1186/1297-9686-15-2-201.
5
Accuracies of genomic breeding values in American Angus beef cattle using K-means clustering for cross-validation.利用 K-均值聚类进行交叉验证评估美国安格斯肉牛基因组育种值的准确性。
Genet Sel Evol. 2011 Nov 28;43(1):40. doi: 10.1186/1297-9686-43-40.
6
Accuracy of genome-wide evaluation for disease resistance in aquaculture breeding programs.水产养殖育种计划中全基因组评估疾病抗性的准确性。
J Anim Sci. 2011 Nov;89(11):3433-42. doi: 10.2527/jas.2010-3814. Epub 2011 Jul 8.
7
Persistence of accuracy of genomic estimated breeding values over generations in layer chickens.世代间鸡基因组估计育种值准确性的持续
Genet Sel Evol. 2011 Jun 21;43(1):23. doi: 10.1186/1297-9686-43-23.
8
Whole genome analysis of infectious bovine keratoconjunctivitis in Angus cattle using Bayesian threshold models.使用贝叶斯阈值模型对安格斯牛传染性角膜结膜炎进行全基因组分析。
BMC Proc. 2011 Jun 3;5 Suppl 4(Suppl 4):S22. doi: 10.1186/1753-6561-5-S4-S22.
9
Extension of the bayesian alphabet for genomic selection.贝叶斯字母在基因组选择中的扩展。
BMC Bioinformatics. 2011 May 23;12:186. doi: 10.1186/1471-2105-12-186.
10
Genome-wide prediction of discrete traits using Bayesian regressions and machine learning.基于贝叶斯回归和机器学习的全基因组离散性状预测。
Genet Sel Evol. 2011 Feb 17;43(1):7. doi: 10.1186/1297-9686-43-7.