• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在复杂性状基因组预测模型中考虑重叠注释。

Accounting for overlapping annotations in genomic prediction models of complex traits.

机构信息

INRAE, AgroParisTech, GABI, Université Paris-Saclay, Allée de Vilvert, 78350, Jouy-en-Josas, France.

GenPhySE, INRAE, ENVT, Université de Toulouse, 31320, Castanet Tolosan, France.

出版信息

BMC Bioinformatics. 2022 Sep 6;23(1):365. doi: 10.1186/s12859-022-04914-5.

DOI:10.1186/s12859-022-04914-5
PMID:36068513
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9446854/
Abstract

BACKGROUND

It is now widespread in livestock and plant breeding to use genotyping data to predict phenotypes with genomic prediction models. In parallel, genomic annotations related to a variety of traits are increasing in number and granularity, providing valuable insight into potentially important positions in the genome. The BayesRC model integrates this prior biological information by factorizing the genome according to disjoint annotation categories, in some cases enabling improved prediction of heritable traits. However, BayesRC is not adapted to cases where markers may have multiple annotations.

RESULTS

We propose two novel Bayesian approaches to account for multi-annotated markers through a cumulative (BayesRC+) or preferential (BayesRC[Formula: see text]) model of the contribution of multiple annotation categories. We illustrate their performance on simulated data with various genetic architectures and types of annotations. We also explore their use on data from a backcross population of growing pigs in conjunction with annotations constructed using the PigQTLdb. In both simulated and real data, we observed a modest improvement in prediction quality with our models when used with informative annotations. In addition, our results show that BayesRC+ successfully prioritizes multi-annotated markers according to their posterior variance, while BayesRC[Formula: see text] provides a useful interpretation of informative annotations for multi-annotated markers. Finally, we explore several strategies for constructing annotations from a public database, highlighting the importance of careful consideration of this step.

CONCLUSION

When used with annotations that are relevant to the trait under study, BayesRC[Formula: see text] and BayesRC+ allow for improved prediction and prioritization of multi-annotated markers, and can provide useful biological insight into the genetic architecture of traits.

摘要

背景

在畜牧业和植物育种中,利用基因分型数据通过基因组预测模型来预测表型已经非常普遍。与此同时,与各种性状相关的基因组注释数量和粒度都在增加,这为理解基因组中潜在的重要位置提供了有价值的信息。BayesRC 模型通过根据不相交的注释类别对基因组进行因子分解来整合这种先验的生物学信息,在某些情况下可以提高对可遗传性状的预测能力。然而,BayesRC 并不适应于标记可能具有多个注释的情况。

结果

我们提出了两种新的贝叶斯方法,通过对多个注释类别的累积(BayesRC+)或优先(BayesRC[Formula: see text])模型来处理多注释标记。我们通过模拟具有不同遗传结构和注释类型的数据来展示它们的性能。我们还探索了它们在结合使用 PigQTLdb 构建注释的生长猪回交群体数据上的应用。在模拟和真实数据中,当我们使用信息丰富的注释时,我们的模型在预测质量上都观察到了适度的提高。此外,我们的结果表明,BayesRC+ 可以根据其后验方差成功地对多注释标记进行优先级排序,而 BayesRC[Formula: see text]则为多注释标记的信息丰富的注释提供了有用的解释。最后,我们探索了从公共数据库构建注释的几种策略,强调了仔细考虑这一步骤的重要性。

结论

当与与研究性状相关的注释一起使用时,BayesRC[Formula: see text]和 BayesRC+可以提高多注释标记的预测和优先级排序,并为性状的遗传结构提供有用的生物学见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d07a/9446854/6b2efb9396ec/12859_2022_4914_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d07a/9446854/f55342292f29/12859_2022_4914_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d07a/9446854/b052172699cf/12859_2022_4914_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d07a/9446854/5f75422f227d/12859_2022_4914_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d07a/9446854/3dd9daa609d4/12859_2022_4914_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d07a/9446854/40a537d67f47/12859_2022_4914_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d07a/9446854/8cdb0c358715/12859_2022_4914_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d07a/9446854/6b2efb9396ec/12859_2022_4914_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d07a/9446854/f55342292f29/12859_2022_4914_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d07a/9446854/b052172699cf/12859_2022_4914_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d07a/9446854/5f75422f227d/12859_2022_4914_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d07a/9446854/3dd9daa609d4/12859_2022_4914_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d07a/9446854/40a537d67f47/12859_2022_4914_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d07a/9446854/8cdb0c358715/12859_2022_4914_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d07a/9446854/6b2efb9396ec/12859_2022_4914_Fig7_HTML.jpg

相似文献

1
Accounting for overlapping annotations in genomic prediction models of complex traits.在复杂性状基因组预测模型中考虑重叠注释。
BMC Bioinformatics. 2022 Sep 6;23(1):365. doi: 10.1186/s12859-022-04914-5.
2
Exploiting biological priors and sequence variants enhances QTL discovery and genomic prediction of complex traits.利用生物学先验知识和序列变异可增强复杂性状的数量性状基因座发现及基因组预测。
BMC Genomics. 2016 Feb 27;17:144. doi: 10.1186/s12864-016-2443-6.
3
A Multiple-Trait Bayesian Lasso for Genome-Enabled Analysis and Prediction of Complex Traits.用于基于基因组的复杂性状分析与预测的多性状贝叶斯套索法
Genetics. 2020 Feb;214(2):305-331. doi: 10.1534/genetics.119.302934. Epub 2019 Dec 26.
4
Genomic prediction based on selected variants from imputed whole-genome sequence data in Australian sheep populations.基于澳大利亚绵羊群体中推断的全基因组序列数据中选择的变体的基因组预测。
Genet Sel Evol. 2019 Dec 5;51(1):72. doi: 10.1186/s12711-019-0514-2.
5
Application of Bayesian genomic prediction methods to genome-wide association analyses.贝叶斯基因组预测方法在全基因组关联分析中的应用。
Genet Sel Evol. 2022 May 13;54(1):31. doi: 10.1186/s12711-022-00724-8.
6
Genome-enabled prediction of meat and carcass traits using Bayesian regression, single-step genomic best linear unbiased prediction and blending methods in Nelore cattle.利用贝叶斯回归、单步基因组最佳线性无偏预测和混合方法,基于基因组预测牛肉和胴体性状在尼洛拉牛中的应用。
Animal. 2021 Jan;15(1):100006. doi: 10.1016/j.animal.2020.100006. Epub 2020 Dec 10.
7
Bayesian Genomic Prediction with Genotype × Environment Interaction Kernel Models.使用基因型×环境互作核模型的贝叶斯基因组预测
G3 (Bethesda). 2017 Jan 5;7(1):41-53. doi: 10.1534/g3.116.035584.
8
Accuracy of prediction of simulated polygenic phenotypes and their underlying quantitative trait loci genotypes using real or imputed whole-genome markers in cattle.利用真实或推算的全基因组标记预测牛模拟多基因表型及其潜在数量性状位点基因型的准确性。
Genet Sel Evol. 2015 Dec 23;47:99. doi: 10.1186/s12711-015-0179-4.
9
Modeling heterogeneous (co)variances from adjacent-SNP groups improves genomic prediction for milk protein composition traits.对相邻 SNP 组的异质(共)方差进行建模可提高牛奶蛋白成分性状的基因组预测能力。
Genet Sel Evol. 2017 Dec 5;49(1):89. doi: 10.1186/s12711-017-0364-8.
10
Leveraging functional genomic annotations and genome coverage to improve polygenic prediction of complex traits within and between ancestries.利用功能基因组注释和基因组覆盖度提高在不同祖源内和之间的复杂性状的多基因预测。
Nat Genet. 2024 May;56(5):767-777. doi: 10.1038/s41588-024-01704-y. Epub 2024 Apr 30.

引用本文的文献

1
DeepAnnotation: A novel interpretable deep learning-based genomic selection model that integrates comprehensive functional annotations.深度注释:一种基于深度学习的新型可解释基因组选择模型,该模型整合了全面的功能注释。
Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf083.
2
Mammary gland multi-omics data reveals new genetic insights into milk production traits in dairy cattle.乳腺多组学数据揭示了奶牛产奶性状的新遗传见解。
PLoS Genet. 2025 Apr 17;21(4):e1011675. doi: 10.1371/journal.pgen.1011675. eCollection 2025 Apr.
3
Evaluation of genomic selection models using whole genome sequence data and functional annotation in Belgian Blue cattle.

本文引用的文献

1
Bringing the Animal QTLdb and CorrDB into the future: meeting new challenges and providing updated services.将动物 QTLdb 和 CorrDB 带入未来:迎接新挑战,提供更新的服务。
Nucleic Acids Res. 2022 Jan 7;50(D1):D956-D961. doi: 10.1093/nar/gkab1116.
2
An evaluation of the predictive performance and mapping power of the BayesR model for genomic prediction.贝叶斯 R 模型在基因组预测中的预测性能和映射能力评估。
G3 (Bethesda). 2021 Oct 19;11(11). doi: 10.1093/g3journal/jkab225.
3
Leveraging Multiple Layers of Data To Predict Complex Traits.
利用全基因组序列数据和功能注释评估比利时蓝牛的基因组选择模型
Genet Sel Evol. 2025 Mar 4;57(1):10. doi: 10.1186/s12711-025-00955-5.
4
Improvement in genomic prediction of maize with prior gene ontology information depends on traits and environmental conditions.利用先前的基因本体信息改进玉米基因组预测取决于性状和环境条件。
Plant Genome. 2025 Mar;18(1):e20553. doi: 10.1002/tpg2.20553.
利用多层数据预测复杂性状。
G3 (Bethesda). 2020 Dec 3;10(12):4599-4613. doi: 10.1534/g3.120.401847.
4
Genotype by environment interactions for performance and thermoregulation responses in growing pigs1,2.生长猪生产性能和体温调节反应的基因型与环境互作研究 1,2.
J Anim Sci. 2019 Sep 3;97(9):3699-3713. doi: 10.1093/jas/skz245.
5
Integrating Gene Expression Data Into Genomic Prediction.将基因表达数据整合到基因组预测中。
Front Genet. 2019 Feb 25;10:126. doi: 10.3389/fgene.2019.00126. eCollection 2019.
6
Accelerating crop genetic gains with genomic selection.利用基因组选择加速作物遗传增益。
Theor Appl Genet. 2019 Mar;132(3):669-686. doi: 10.1007/s00122-018-3270-8. Epub 2018 Dec 19.
7
Functional Annotation of Animal Genomes (FAANG): Current Achievements and Roadmap.动物基因组功能注释(FAANG):现状与未来规划。
Annu Rev Anim Biosci. 2019 Feb 15;7:65-88. doi: 10.1146/annurev-animal-020518-114913. Epub 2018 Nov 14.
8
DNA sequencing technologies: 2006-2016.DNA 测序技术:2006-2016 年。
Nat Protoc. 2017 Feb;12(2):213-218. doi: 10.1038/nprot.2016.182. Epub 2017 Jan 5.
9
Genomic Prediction for Quantitative Traits Is Improved by Mapping Variants to Gene Ontology Categories in Drosophila melanogaster.通过将变异映射到黑腹果蝇的基因本体类别来改进数量性状的基因组预测。
Genetics. 2016 Aug;203(4):1871-83. doi: 10.1534/genetics.116.187161. Epub 2016 May 27.
10
Exploiting biological priors and sequence variants enhances QTL discovery and genomic prediction of complex traits.利用生物学先验知识和序列变异可增强复杂性状的数量性状基因座发现及基因组预测。
BMC Genomics. 2016 Feb 27;17:144. doi: 10.1186/s12864-016-2443-6.