利用扩散核对遗传标记进行复杂性状预测及其在奶牛和小麦数据中的应用。

Predicting complex traits using a diffusion kernel on genetic markers with an application to dairy cattle and wheat data.

机构信息

Department of Animal Sciences, University of Wisconsin-Madison, Madison, WI, USA.

出版信息

Genet Sel Evol. 2013 Jun 13;45(1):17. doi: 10.1186/1297-9686-45-17.

DOI:10.1186/1297-9686-45-17

PMID:23763755

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3706293/

Abstract

BACKGROUND

Arguably, genotypes and phenotypes may be linked in functional forms that are not well addressed by the linear additive models that are standard in quantitative genetics. Therefore, developing statistical learning models for predicting phenotypic values from all available molecular information that are capable of capturing complex genetic network architectures is of great importance. Bayesian kernel ridge regression is a non-parametric prediction model proposed for this purpose. Its essence is to create a spatial distance-based relationship matrix called a kernel. Although the set of all single nucleotide polymorphism genotype configurations on which a model is built is finite, past research has mainly used a Gaussian kernel.

RESULTS

We sought to investigate the performance of a diffusion kernel, which was specifically developed to model discrete marker inputs, using Holstein cattle and wheat data. This kernel can be viewed as a discretization of the Gaussian kernel. The predictive ability of the diffusion kernel was similar to that of non-spatial distance-based additive genomic relationship kernels in the Holstein data, but outperformed the latter in the wheat data. However, the difference in performance between the diffusion and Gaussian kernels was negligible.

CONCLUSIONS

It is concluded that the ability of a diffusion kernel to capture the total genetic variance is not better than that of a Gaussian kernel, at least for these data. Although the diffusion kernel as a choice of basis function may have potential for use in whole-genome prediction, our results imply that embedding genetic markers into a non-Euclidean metric space has very small impact on prediction. Our results suggest that use of the black box Gaussian kernel is justified, given its connection to the diffusion kernel and its similar predictive performance.

摘要

背景

可以说，基因型和表型可能以标准数量遗传学中线性加性模型无法很好解决的功能形式联系在一起。因此，开发能够从所有可用分子信息中预测表型值的统计学习模型，这些模型能够捕捉复杂的遗传网络结构，这一点非常重要。贝叶斯核岭回归是为此目的而提出的一种非参数预测模型。它的本质是创建一个基于空间距离的关系矩阵，称为核。虽然模型构建所基于的所有单核苷酸多态性基因型配置的集合是有限的，但过去的研究主要使用了高斯核。

结果

我们试图使用荷斯坦奶牛和小麦数据来研究扩散核的性能，该核专门用于对离散标记输入进行建模。该核可以看作是高斯核的离散化。在荷斯坦数据中，扩散核的预测能力与基于非空间距离的加性基因组关系核相似，但在小麦数据中表现优于后者。然而，扩散核和高斯核之间的性能差异可以忽略不计。

结论

可以得出结论，扩散核捕获总遗传方差的能力并不优于高斯核，至少对于这些数据是这样。尽管扩散核作为基函数的选择可能具有用于全基因组预测的潜力，但我们的结果表明，将遗传标记嵌入非欧几里得度量空间对预测的影响很小。鉴于其与扩散核的联系及其相似的预测性能，我们的结果表明，使用黑盒高斯核是合理的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/07f5/3706293/6121b96c3bd2/1297-9686-45-17-1.jpg

相似文献

Predicting complex traits using a diffusion kernel on genetic markers with an application to dairy cattle and wheat data.利用扩散核对遗传标记进行复杂性状预测及其在奶牛和小麦数据中的应用。

Genet Sel Evol. 2013 Jun 13;45(1):17. doi: 10.1186/1297-9686-45-17.

Predicting complex quantitative traits with Bayesian neural networks: a case study with Jersey cows and wheat.贝叶斯神经网络预测复杂的数量性状：以泽西牛和小麦为例的研究

BMC Genet. 2011 Oct 7;12:87. doi: 10.1186/1471-2156-12-87.

Application of support vector regression to genome-assisted prediction of quantitative traits.支持向量回归在全基因组辅助数量性状预测中的应用。

Theor Appl Genet. 2011 Nov;123(7):1065-74. doi: 10.1007/s00122-011-1648-y. Epub 2011 Jul 8.

Predicting bull fertility using genomic data and biological information.利用基因组数据和生物学信息预测公牛的繁殖力。

J Dairy Sci. 2017 Dec;100(12):9656-9666. doi: 10.3168/jds.2017-13288. Epub 2017 Oct 4.

Predictive ability of genome-assisted statistical models under various forms of gene action.基因组辅助统计模型在各种基因作用形式下的预测能力。

Sci Rep. 2018 Aug 17;8(1):12309. doi: 10.1038/s41598-018-30089-2.

Predicting expected progeny difference for marbling score in Angus cattle using artificial neural networks and Bayesian regression models.利用人工神经网络和贝叶斯回归模型预测安格斯牛大理石花纹评分的预期后代差异。

Genet Sel Evol. 2013 Sep 11;45(1):34. doi: 10.1186/1297-9686-45-34.

Genomic Prediction of Genotype × Environment Interaction Kernel Regression Models.基因组预测基因型与环境互作核回归模型。

Plant Genome. 2016 Nov;9(3). doi: 10.3835/plantgenome2016.03.0024.

Application of neural networks with back-propagation to genome-enabled prediction of complex traits in Holstein-Friesian and German Fleckvieh cattle.基于神经网络的反向传播算法在荷斯坦-弗里森牛和德国弗莱维赫牛基因组特征预测复杂性状中的应用。

Genet Sel Evol. 2015 Mar 31;47(1):22. doi: 10.1186/s12711-015-0097-5.

Appraising the Genetic Architecture of Kernel Traits in Hexaploid Wheat Using GWAS.利用 GWAS 评估六倍体小麦籽粒性状的遗传结构。

Int J Mol Sci. 2020 Aug 6;21(16):5649. doi: 10.3390/ijms21165649.

Bayesian Genomic Prediction with Genotype × Environment Interaction Kernel Models.使用基因型×环境互作核模型的贝叶斯基因组预测

G3 (Bethesda). 2017 Jan 5;7(1):41-53. doi: 10.1534/g3.116.035584.

引用本文的文献

Learning sequence-function relationships with scalable, interpretable Gaussian processes.通过可扩展、可解释的高斯过程学习序列-函数关系。

bioRxiv. 2025 Aug 19:2025.08.15.670613. doi: 10.1101/2025.08.15.670613.

Use of the linear regression method to evaluate population accuracy of predictions from non-linear models.使用线性回归方法评估非线性模型预测的总体准确性。

Front Genet. 2024 May 31;15:1380643. doi: 10.3389/fgene.2024.1380643. eCollection 2024.

Genetic Parameter and Hyper-Parameter Estimation Underlie Nitrogen Use Efficiency in Bread Wheat.遗传参数和超参数估计是面包小麦氮利用效率的基础。

Int J Mol Sci. 2023 Sep 19;24(18):14275. doi: 10.3390/ijms241814275.

Multi-environment analysis enhances genomic prediction accuracy of agronomic traits in sesame.多环境分析提高了芝麻农艺性状的基因组预测准确性。

Front Genet. 2023 Mar 13;14:1108416. doi: 10.3389/fgene.2023.1108416. eCollection 2023.

A Multi-Trait Gaussian Kernel Genomic Prediction Model under Three Tunning Strategies.基于三种调优策略的多性状高斯核基因组预测模型。

Genes (Basel). 2022 Dec 3;13(12):2279. doi: 10.3390/genes13122279.

A Comparison between Three Tuning Strategies for Gaussian Kernels in the Context of Univariate Genomic Prediction.三种高斯核调优策略在单变量基因组预测中的比较

Genes (Basel). 2022 Dec 3;13(12):2282. doi: 10.3390/genes13122282.

Genomic Prediction Methods Accounting for Nonadditive Genetic Effects.考虑非加性遗传效应的基因组预测方法。

Methods Mol Biol. 2022;2467:219-243. doi: 10.1007/978-1-0716-2205-6_8.

Haplotype genomic prediction of phenotypic values based on chromosome distance and gene boundaries using low-coverage sequencing in Duroc pigs.基于低覆盖度测序的基于染色体距离和基因边界的表型值单倍型基因组预测在杜洛克猪中的应用。

Genet Sel Evol. 2021 Oct 7;53(1):78. doi: 10.1186/s12711-021-00661-y.

Application of Bayesian networks to the prediction of the AMEn: a new methodology in broiler nutrition.贝叶斯网络在肉仔鸡可利用氨基酸预测中的应用：肉仔鸡营养领域的一种新方法。

Transl Anim Sci. 2021 Jan 22;5(1):txaa215. doi: 10.1093/tas/txaa215. eCollection 2021 Jan.

Network and Systems Medicine: Position Paper of the European Collaboration on Science and Technology Action on Open Multiscale Systems Medicine.网络与系统医学：欧洲科技合作组织关于开放多尺度系统医学行动的立场文件

Netw Syst Med. 2020 Jul 6;3(1):67-90. doi: 10.1089/nsm.2020.0004. eCollection 2020.

本文引用的文献

An assessment of linkage disequilibrium in Holstein cattle using a Bayesian network.基于贝叶斯网络的荷斯坦奶牛连锁不平衡评估。

J Anim Breed Genet. 2012 Dec;129(6):474-87. doi: 10.1111/jbg.12002. Epub 2012 Sep 13.

Using whole-genome sequence data to predict quantitative trait phenotypes in Drosophila melanogaster.利用全基因组序列数据预测黑腹果蝇的数量性状表型。

PLoS Genet. 2012;8(5):e1002685. doi: 10.1371/journal.pgen.1002685. Epub 2012 May 3.

An experimental investigation of kernels on graphs for collaborative recommendation and semisupervised classification.基于图核的协同推荐和半监督分类的实验研究。

Neural Netw. 2012 Jul;31:53-72. doi: 10.1016/j.neunet.2012.03.001. Epub 2012 Mar 20.

Predicting complex quantitative traits with Bayesian neural networks: a case study with Jersey cows and wheat.贝叶斯神经网络预测复杂的数量性状：以泽西牛和小麦为例的研究

BMC Genet. 2011 Oct 7;12:87. doi: 10.1186/1471-2156-12-87.

Genetic architecture of growth traits revealed by global epistatic interactions.全基因组上位性互作揭示生长性状的遗传结构。

Genome Biol Evol. 2011;3:909-14. doi: 10.1093/gbe/evr065. Epub 2011 Aug 22.

Application of support vector regression to genome-assisted prediction of quantitative traits.支持向量回归在全基因组辅助数量性状预测中的应用。

Theor Appl Genet. 2011 Nov;123(7):1065-74. doi: 10.1007/s00122-011-1648-y. Epub 2011 Jul 8.

Allele coding in genomic evaluation.基因组评估中的等位基因编码

Genet Sel Evol. 2011 Jun 26;43(1):25. doi: 10.1186/1297-9686-43-25.

Extension of the bayesian alphabet for genomic selection.贝叶斯字母在基因组选择中的扩展。

BMC Bioinformatics. 2011 May 23;12:186. doi: 10.1186/1471-2105-12-186.

Predicting genetic values: a kernel-based best linear unbiased prediction with genomic data.预测遗传值：基于核的基因组数据最佳线性无偏预测。

Genetics. 2011 Jul;188(3):695-708. doi: 10.1534/genetics.111.128694. Epub 2011 Apr 21.

Predicting genetic predisposition in humans: the promise of whole-genome markers.预测人类的遗传易感性：全基因组标记的前景。

Nat Rev Genet. 2010 Dec;11(12):880-6. doi: 10.1038/nrg2898. Epub 2010 Nov 3.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

利用扩散核对遗传标记进行复杂性状预测及其在奶牛和小麦数据中的应用。

Predicting complex traits using a diffusion kernel on genetic markers with an application to dairy cattle and wheat data.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献