• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于基于基因组预测数量性状遗传值的非参数混合模型。

A non-parametric mixture model for genome-enabled prediction of genetic value for a quantitative trait.

作者信息

Gianola Daniel, Wu Xiao-Lin, Manfredi Eduardo, Simianer Henner

机构信息

Department of Animal Sciences and Department of Dairy Science, University of Wisconsin-Madison, 1675 Observatory Dr, Madison, WI 53706, USA.

出版信息

Genetica. 2010 Oct;138(9-10):959-77. doi: 10.1007/s10709-010-9478-4. Epub 2010 Aug 25.

DOI:10.1007/s10709-010-9478-4
PMID:20737196
Abstract

A Bayesian nonparametric form of regression based on Dirichlet process priors is adapted to the analysis of quantitative traits possibly affected by cryptic forms of gene action, and to the context of SNP-assisted genomic selection, where the main objective is to predict a genomic signal on phenotype. The procedure clusters unknown genotypes into groups with distinct genetic values, but in a setting in which the number of clusters is unknown a priori, so that standard methods for finite mixture analysis do not work. The central assumption is that genetic effects follow an unknown distribution with some "baseline" family, which is a normal process in the cases considered here. A Bayesian analysis based on the Gibbs sampler produces estimates of the number of clusters, posterior means of genetic effects, a measure of credibility in the baseline distribution, as well as estimates of parameters of the latter. The procedure is illustrated with a simulation representing two populations. In the first one, there are 3 unknown QTL, with additive, dominance and epistatic effects; in the second, there are 10 QTL with additive, dominance and additive × additive epistatic effects. In the two populations, baseline parameters are inferred correctly. The Dirichlet process model infers the number of unique genetic values correctly in the first population, but it produces an understatement in the second one; here, the true number of clusters is over 900, and the model gives a posterior mean estimate of about 140, probably because more replication of genotypes is needed for correct inference. The impact on inferences of the prior distribution of a key parameter (M), and of the extent of replication, was examined via an analysis of mean body weight in 192 paternal half-sib families of broiler chickens, where each sire was genotyped for nearly 7,000 SNPs. In this small sample, it was found that inference about the number of clusters was affected by the prior distribution of M. For a set of combinations of parameters of a given prior distribution, the effects of the prior dissipated when the number of replicate samples per genotype was increased. Thus, the Dirichlet process model seems to be useful for gauging the number of QTLs affecting the trait: if the number of clusters inferred is small, probably just a few QTLs code for the trait. If the number of clusters inferred is large, this may imply that standard parametric models based on the baseline distribution may suffice. However, priors may be influential, especially if sample size is not large and if only a few genotypic configurations have replicate phenotypes in the sample.

摘要

一种基于狄利克雷过程先验的贝叶斯非参数回归形式被应用于分析可能受隐性基因作用形式影响的数量性状,以及单核苷酸多态性(SNP)辅助基因组选择的背景下,其主要目标是预测表型上的基因组信号。该程序将未知基因型聚类为具有不同遗传值的组,但在聚类数量先验未知的情况下,因此有限混合分析的标准方法不起作用。核心假设是遗传效应遵循某个“基线”家族的未知分布,在此处考虑的情况下这是一个正态过程。基于吉布斯采样器的贝叶斯分析产生聚类数量的估计、遗传效应的后验均值、基线分布可信度的度量以及后者参数的估计。该程序通过一个代表两个群体的模拟进行说明。在第一个群体中,有3个未知的数量性状基因座(QTL),具有加性、显性和上位性效应;在第二个群体中,有10个QTL,具有加性、显性和加性×加性上位性效应。在这两个群体中,基线参数被正确推断。狄利克雷过程模型在第一个群体中正确推断出独特遗传值的数量,但在第二个群体中低估了该数量;在这里,聚类的真实数量超过900,而该模型给出的后验均值估计约为140,可能是因为需要更多的基因型重复来进行正确推断。通过对192个肉鸡父系半同胞家系的平均体重分析,研究了一个关键参数(M)的先验分布以及重复程度对推断的影响,其中每个父本对近7000个SNP进行了基因分型。在这个小样本中,发现关于聚类数量的推断受M的先验分布影响。对于给定先验分布的一组参数组合,当每个基因型的重复样本数量增加时,先验的影响会消散。因此,狄利克雷过程模型似乎对于衡量影响该性状的QTL数量很有用:如果推断出的聚类数量少,可能只有少数QTL编码该性状。如果推断出的聚类数量多,这可能意味着基于基线分布的标准参数模型可能就足够了。然而,先验可能有影响,特别是如果样本量不大且样本中只有少数基因型配置有重复表型时。

相似文献

1
A non-parametric mixture model for genome-enabled prediction of genetic value for a quantitative trait.一种用于基于基因组预测数量性状遗传值的非参数混合模型。
Genetica. 2010 Oct;138(9-10):959-77. doi: 10.1007/s10709-010-9478-4. Epub 2010 Aug 25.
2
Accuracy of prediction of simulated polygenic phenotypes and their underlying quantitative trait loci genotypes using real or imputed whole-genome markers in cattle.利用真实或推算的全基因组标记预测牛模拟多基因表型及其潜在数量性状位点基因型的准确性。
Genet Sel Evol. 2015 Dec 23;47:99. doi: 10.1186/s12711-015-0179-4.
3
Radial basis function regression methods for predicting quantitative traits using SNP markers.使用单核苷酸多态性(SNP)标记预测数量性状的径向基函数回归方法。
Genet Res (Camb). 2010 Jun;92(3):209-25. doi: 10.1017/S0016672310000157.
4
Impact of prior specifications in a shrinkage-inducing Bayesian model for quantitative trait mapping and genomic prediction.在一个诱导收缩的贝叶斯模型中,先验规范对数量性状定位和基因组预测的影响。
Genet Sel Evol. 2013 Jul 8;45(1):24. doi: 10.1186/1297-9686-45-24.
5
Efficiency of genomic selection using Bayesian multi-marker models for traits selected to reflect a wide range of heritabilities and frequencies of detected quantitative traits loci in mice.利用贝叶斯多标记模型进行基因组选择的效率,这些模型针对反映广泛遗传力和检测到的数量性状基因座频率的性状进行选择。
BMC Genet. 2012 May 31;13:42. doi: 10.1186/1471-2156-13-42.
6
Application of a Bayesian dominance model improves power in quantitative trait genome-wide association analysis.贝叶斯优势模型的应用提高了数量性状全基因组关联分析的效能。
Genet Sel Evol. 2017 Jan 14;49(1):7. doi: 10.1186/s12711-017-0284-7.
7
Including non-additive genetic effects in Bayesian methods for the prediction of genetic values based on genome-wide markers.基于全基因组标记物,在贝叶斯方法预测遗传值时纳入非加性遗传效应。
BMC Genet. 2011 Aug 25;12:74. doi: 10.1186/1471-2156-12-74.
8
Covariance Between Genotypic Effects and its Use for Genomic Inference in Half-Sib Families.半同胞家系中基因型效应的协方差及其在基因组推断中的应用
G3 (Bethesda). 2016 Sep 8;6(9):2761-72. doi: 10.1534/g3.116.032409.
9
Multibreed genomic prediction using multitrait genomic residual maximum likelihood and multitask Bayesian variable selection.多品种基因组预测使用多性状基因组残差极大似然法和多任务贝叶斯变量选择。
J Dairy Sci. 2018 May;101(5):4279-4294. doi: 10.3168/jds.2017-13366. Epub 2018 Mar 15.
10
A multi-trait Bayesian method for mapping QTL and genomic prediction.一种用于 QTL 作图和基因组预测的多性状贝叶斯方法。
Genet Sel Evol. 2018 Mar 24;50(1):10. doi: 10.1186/s12711-018-0377-y.

引用本文的文献

1
Deep learning for genomic selection of aquatic animals.用于水生动物基因组选择的深度学习
Mar Life Sci Technol. 2024 Sep 27;6(4):631-650. doi: 10.1007/s42995-024-00252-y. eCollection 2024 Nov.
2
Compression distance can discriminate animals by genetic profile, build relationship matrices and estimate breeding values.压缩距离可以通过基因图谱区分动物、构建关系矩阵并估计育种值。
Genet Sel Evol. 2015 Oct 13;47:78. doi: 10.1186/s12711-015-0158-9.
3
Joint prediction of multiple quantitative traits using a Bayesian multivariate antedependence model.

本文引用的文献

1
Sire evaluation for ordered categorical data with a threshold model.使用阈值模型对有序分类数据进行 sire 评估。
Genet Sel Evol (1983). 1983;15(2):201-24. doi: 10.1186/1297-9686-15-2-201.
2
Additive genetic variability and the Bayesian alphabet.加性遗传变异性和贝叶斯字母表。
Genetics. 2009 Sep;183(1):347-63. doi: 10.1534/genetics.109.103952. Epub 2009 Jul 20.
3
Predicting quantitative traits with regression models for dense molecular markers and pedigree.使用针对密集分子标记和系谱的回归模型预测数量性状。
使用贝叶斯多元前相依模型对多个数量性状进行联合预测。
Heredity (Edinb). 2015 Jul;115(1):29-36. doi: 10.1038/hdy.2015.9. Epub 2015 Apr 15.
4
Meta-analysis of candidate gene effects using bayesian parametric and non-parametric approaches.使用贝叶斯参数和非参数方法对候选基因效应进行荟萃分析。
J Genomics. 2014 Jan 2;2:1-19. doi: 10.7150/jgen.5054. eCollection 2014.
5
Predicting expected progeny difference for marbling score in Angus cattle using artificial neural networks and Bayesian regression models.利用人工神经网络和贝叶斯回归模型预测安格斯牛大理石花纹评分的预期后代差异。
Genet Sel Evol. 2013 Sep 11;45(1):34. doi: 10.1186/1297-9686-45-34.
6
A Bayesian antedependence model for whole genome prediction.全基因组预测的贝叶斯反相关模型。
Genetics. 2012 Apr;190(4):1491-501. doi: 10.1534/genetics.111.131540. Epub 2011 Nov 30.
Genetics. 2009 May;182(1):375-85. doi: 10.1534/genetics.109.101501. Epub 2009 Mar 16.
4
Genome-assisted prediction of a quantitative trait measured in parents and progeny: application to food conversion rate in chickens.利用基因组辅助预测亲本和后代中测量的数量性状:在鸡的饲料转化率中的应用。
Genet Sel Evol. 2009 Jan 5;41(1):3. doi: 10.1186/1297-9686-41-3.
5
Reproducing kernel Hilbert spaces regression: a general framework for genetic evaluation.再生核希尔伯特空间回归:遗传评估的通用框架
J Anim Sci. 2009 Jun;87(6):1883-7. doi: 10.2527/jas.2008-1259. Epub 2009 Feb 11.
6
Invited review: Genomic selection in dairy cattle: progress and challenges.特邀综述:奶牛的基因组选择:进展与挑战
J Dairy Sci. 2009 Feb;92(2):433-43. doi: 10.3168/jds.2008-1646.
7
Inferring genetic values for quantitative traits non-parametrically.非参数推断数量性状的遗传值。
Genet Res (Camb). 2008 Dec;90(6):525-40. doi: 10.1017/S0016672308009890.
8
Efficient methods to compute genomic predictions.计算基因组预测的有效方法。
J Dairy Sci. 2008 Nov;91(11):4414-23. doi: 10.3168/jds.2007-0980.
9
Bayesian LASSO for quantitative trait loci mapping.用于数量性状基因座定位的贝叶斯套索法
Genetics. 2008 Jun;179(2):1045-55. doi: 10.1534/genetics.107.085589. Epub 2008 May 27.
10
Nonparametric methods for incorporating genomic information into genetic evaluations: an application to mortality in broilers.将基因组信息纳入遗传评估的非参数方法:在肉鸡死亡率中的应用
Genetics. 2008 Apr;178(4):2305-13. doi: 10.1534/genetics.107.084293.