• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

具有偏态正态随机误差的贝叶斯基因组回归模型

A Bayesian Genomic Regression Model with Skew Normal Random Errors.

作者信息

Pérez-Rodríguez Paulino, Acosta-Pech Rocío, Pérez-Elizalde Sergio, Cruz Ciro Velasco, Espinosa Javier Suárez, Crossa José

机构信息

Colegio de Postgraduados, CP 56230, Montecillos, Edo. de México.

Colegio de Postgraduados, CP 56230, Montecillos, Edo. de México

出版信息

G3 (Bethesda). 2018 May 4;8(5):1771-1785. doi: 10.1534/g3.117.300406.

DOI:10.1534/g3.117.300406
PMID:29588381
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5940167/
Abstract

Genomic selection (GS) has become a tool for selecting candidates in plant and animal breeding programs. In the case of quantitative traits, it is common to assume that the distribution of the response variable can be approximated by a normal distribution. However, it is known that the selection process leads to skewed distributions. There is vast statistical literature on skewed distributions, but the skew normal distribution is of particular interest in this research. This distribution includes a third parameter that drives the skewness, so that it generalizes the normal distribution. We propose an extension of the Bayesian whole-genome regression to skew normal distribution data in the context of GS applications, where usually the number of predictors vastly exceeds the sample size. However, it can also be applied when the number of predictors is smaller than the sample size. We used a stochastic representation of a skew normal random variable, which allows the implementation of standard Markov Chain Monte Carlo (MCMC) techniques to efficiently fit the proposed model. The predictive ability and goodness of fit of the proposed model were evaluated using simulated and real data, and the results were compared to those obtained by the Bayesian Ridge Regression model. Results indicate that the proposed model has a better fit and is as good as the conventional Bayesian Ridge Regression model for prediction, based on the DIC criterion and cross-validation, respectively. A computing program coded in the R statistical package and C programming language to fit the proposed model is available as supplementary material.

摘要

基因组选择(GS)已成为动植物育种计划中选择候选个体的一种工具。对于数量性状而言,通常假定响应变量的分布可以用正态分布来近似。然而,众所周知,选择过程会导致分布出现偏态。关于偏态分布有大量的统计文献,但在本研究中,偏态正态分布特别受关注。这种分布包含一个驱动偏度的第三个参数,从而对正态分布进行了推广。我们提出在GS应用的背景下,将贝叶斯全基因组回归扩展到偏态正态分布数据,在GS应用中通常预测变量的数量远远超过样本量。不过,当预测变量的数量小于样本量时也可以应用。我们使用了偏态正态随机变量的一种随机表示,这使得可以实施标准的马尔可夫链蒙特卡罗(MCMC)技术来有效地拟合所提出的模型。使用模拟数据和实际数据评估了所提出模型的预测能力和拟合优度,并将结果与贝叶斯岭回归模型得到的结果进行了比较。结果表明,基于DIC准则和交叉验证,所提出的模型分别具有更好的拟合度,并且在预测方面与传统的贝叶斯岭回归模型一样好。作为补充材料提供了一个用R统计软件包和C编程语言编写的用于拟合所提出模型的计算程序。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f7e/5940167/b9e85d666bed/1771f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f7e/5940167/b9e85d666bed/1771f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f7e/5940167/c546cf613621/1771f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f7e/5940167/5d2848cbdeb8/1771f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f7e/5940167/353659b20358/1771f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f7e/5940167/d4a344214a50/1771f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f7e/5940167/b9e85d666bed/1771f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f7e/5940167/b9e85d666bed/1771f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f7e/5940167/c546cf613621/1771f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f7e/5940167/5d2848cbdeb8/1771f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f7e/5940167/353659b20358/1771f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f7e/5940167/d4a344214a50/1771f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f7e/5940167/b9e85d666bed/1771f1.jpg

相似文献

1
A Bayesian Genomic Regression Model with Skew Normal Random Errors.具有偏态正态随机误差的贝叶斯基因组回归模型
G3 (Bethesda). 2018 May 4;8(5):1771-1785. doi: 10.1534/g3.117.300406.
2
Genomic-Enabled Prediction of Ordinal Data with Bayesian Logistic Ordinal Regression.基于贝叶斯逻辑序贯回归的有序数据基因组预测
G3 (Bethesda). 2015 Aug 18;5(10):2113-26. doi: 10.1534/g3.115.021154.
3
A Variational Bayes Genomic-Enabled Prediction Model with Genotype × Environment Interaction.一种具有基因型×环境相互作用的变分贝叶斯基因组预测模型。
G3 (Bethesda). 2017 Jun 7;7(6):1833-1853. doi: 10.1534/g3.117.041202.
4
A Genomic Bayesian Multi-trait and Multi-environment Model.一种基因组贝叶斯多性状多环境模型。
G3 (Bethesda). 2016 Sep 8;6(9):2725-44. doi: 10.1534/g3.116.032359.
5
Bayesian inference for finite mixtures of univariate and multivariate skew-normal and skew-t distributions.贝叶斯推断用于单变量和多变量偏斜正态和偏斜 t 分布的有限混合。
Biostatistics. 2010 Apr;11(2):317-36. doi: 10.1093/biostatistics/kxp062. Epub 2010 Jan 27.
6
A Bayesian Genomic Multi-output Regressor Stacking Model for Predicting Multi-trait Multi-environment Plant Breeding Data.一种用于预测多性状多环境植物育种数据的贝叶斯基因组多输出回归器堆叠模型。
G3 (Bethesda). 2019 Oct 7;9(10):3381-3393. doi: 10.1534/g3.119.400336.
7
Fast genomic prediction of breeding values using parallel Markov chain Monte Carlo with convergence diagnosis.利用具有收敛诊断的并行马尔可夫链蒙特卡罗方法快速预测育种值。
BMC Bioinformatics. 2018 Jan 3;19(1):3. doi: 10.1186/s12859-017-2003-3.
8
Regularized quantile regression applied to genome-enabled prediction of quantitative traits.正则化分位数回归应用于基于基因组的数量性状预测。
Genet Mol Res. 2017 Mar 22;16(1):gmr-16-01-gmr.16019538. doi: 10.4238/gmr16019538.
9
A computationally efficient algorithm for genomic prediction using a Bayesian model.一种使用贝叶斯模型进行基因组预测的计算高效算法。
Genet Sel Evol. 2015 Apr 30;47(1):34. doi: 10.1186/s12711-014-0082-4.
10
An R Package for Bayesian Analysis of Multi-environment and Multi-trait Multi-environment Data for Genome-Based Prediction.用于基于基因组预测的多环境和多性状多环境数据的贝叶斯分析的 R 包。
G3 (Bethesda). 2019 May 7;9(5):1355-1369. doi: 10.1534/g3.119.400126.

引用本文的文献

1
Maximum Threshold Genomic Prediction Model for Ordinal Traits.最大阈值基因组预测模型用于有序性状。
G3 (Bethesda). 2020 Nov 5;10(11):4083-4102. doi: 10.1534/g3.120.401733.

本文引用的文献

1
Regularized quantile regression applied to genome-enabled prediction of quantitative traits.正则化分位数回归应用于基于基因组的数量性状预测。
Genet Mol Res. 2017 Mar 22;16(1):gmr-16-01-gmr.16019538. doi: 10.4238/gmr16019538.
2
A robust multiple-locus method for quantitative trait locus analysis of non-normally distributed multiple traits.一种用于非正态分布多性状数量性状基因座分析的稳健多位点方法。
Heredity (Edinb). 2015 Dec;115(6):556-64. doi: 10.1038/hdy.2015.61. Epub 2015 Jul 15.
3
Genome-wide regression and prediction with the BGLR statistical package.
使用BGLR统计软件包进行全基因组回归与预测。
Genetics. 2014 Oct;198(2):483-95. doi: 10.1534/genetics.114.164442. Epub 2014 Jul 9.
4
Priors in whole-genome regression: the bayesian alphabet returns.全基因组回归中的先验信息:贝叶斯字母表回归。
Genetics. 2013 Jul;194(3):573-96. doi: 10.1534/genetics.113.151753. Epub 2013 May 1.
5
Comparison between linear and non-parametric regression models for genome-enabled prediction in wheat.线性和非参数回归模型在小麦基因组预测中的比较。
G3 (Bethesda). 2012 Dec;2(12):1595-605. doi: 10.1534/g3.112.003665. Epub 2012 Dec 1.
6
Whole-genome regression and prediction methods applied to plant and animal breeding.全基因组回归和预测方法在动植物育种中的应用。
Genetics. 2013 Feb;193(2):327-45. doi: 10.1534/genetics.112.143313. Epub 2012 Jun 28.
7
Genome-enabled prediction of genetic values using radial basis function neural networks.基于径向基函数神经网络的基因组预测遗传值。
Theor Appl Genet. 2012 Aug;125(4):759-71. doi: 10.1007/s00122-012-1868-9. Epub 2012 May 8.
8
Stochastic relaxation, gibbs distributions, and the bayesian restoration of images.随机松弛,吉布斯分布,以及贝叶斯图像恢复。
IEEE Trans Pattern Anal Mach Intell. 1984 Jun;6(6):721-41. doi: 10.1109/tpami.1984.4767596.
9
Normalization of high dimensional genomics data where the distribution of the altered variables is skewed.对高维基因组学数据进行归一化,其中改变变量的分布是偏态的。
PLoS One. 2011;6(11):e27942. doi: 10.1371/journal.pone.0027942. Epub 2011 Nov 22.
10
Semi-parametric genomic-enabled prediction of genetic values using reproducing kernel Hilbert spaces methods.使用再生核希尔伯特空间方法对遗传值进行半参数基因组预测。
Genet Res (Camb). 2010 Aug;92(4):295-308. doi: 10.1017/S0016672310000285.