• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

MultiBLUP:基于单核苷酸多态性(SNP)的复杂性状预测方法的改进

MultiBLUP: improved SNP-based prediction for complex traits.

作者信息

Speed Doug, Balding David J

机构信息

UCL Genetics Institute, University College London, London WC1E 6BT, United Kingdom

UCL Genetics Institute, University College London, London WC1E 6BT, United Kingdom.

出版信息

Genome Res. 2014 Sep;24(9):1550-7. doi: 10.1101/gr.169375.113. Epub 2014 Jun 24.

DOI:10.1101/gr.169375.113
PMID:24963154
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4158754/
Abstract

BLUP (best linear unbiased prediction) is widely used to predict complex traits in plant and animal breeding, and increasingly in human genetics. The BLUP mathematical model, which consists of a single random effect term, was adequate when kinships were measured from pedigrees. However, when genome-wide SNPs are used to measure kinships, the BLUP model implicitly assumes that all SNPs have the same effect-size distribution, which is a severe and unnecessary limitation. We propose MultiBLUP, which extends the BLUP model to include multiple random effects, allowing greatly improved prediction when the random effects correspond to classes of SNPs with distinct effect-size variances. The SNP classes can be specified in advance, for example, based on SNP functional annotations, and we also provide an adaptive procedure for determining a suitable partition of SNPs. We apply MultiBLUP to genome-wide association data from the Wellcome Trust Case Control Consortium (seven diseases), and from much larger studies of celiac disease and inflammatory bowel disease, finding that it consistently provides better prediction than alternative methods. Moreover, MultiBLUP is computationally very efficient; for the largest data set, which includes 12,678 individuals and 1.5 M SNPs, the total analysis can be run on a single desktop PC in less than a day and can be parallelized to run even faster. Tools to perform MultiBLUP are freely available in our software LDAK.

摘要

最佳线性无偏预测(BLUP)在动植物育种中被广泛用于预测复杂性状,在人类遗传学中的应用也日益增多。当通过系谱测量亲缘关系时,由单个随机效应项组成的BLUP数学模型是适用的。然而,当使用全基因组单核苷酸多态性(SNP)来测量亲缘关系时,BLUP模型隐含地假设所有SNP具有相同的效应大小分布,这是一个严格且不必要的限制。我们提出了多BLUP(MultiBLUP),它扩展了BLUP模型以包含多个随机效应,当随机效应对应于具有不同效应大小方差的SNP类别时,能大大提高预测效果。SNP类别可以提前指定,例如基于SNP功能注释,并且我们还提供了一种自适应程序来确定合适的SNP划分。我们将多BLUP应用于来自威康信托病例对照协会(七种疾病)的全基因组关联数据,以及来自乳糜泻和炎症性肠病的更大规模研究的数据,发现它始终比其他方法提供更好的预测。此外,多BLUP在计算上非常高效;对于包含12,678个个体和150万个SNP的最大数据集,整个分析可以在一台台式电脑上不到一天的时间内运行完成,并且可以并行运行以更快完成。执行多BLUP的工具可在我们的软件LDAK中免费获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c09/4158754/5c1a4bdd67c3/1550fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c09/4158754/5c1a4bdd67c3/1550fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c09/4158754/5c1a4bdd67c3/1550fig1.jpg

相似文献

1
MultiBLUP: improved SNP-based prediction for complex traits.MultiBLUP:基于单核苷酸多态性(SNP)的复杂性状预测方法的改进
Genome Res. 2014 Sep;24(9):1550-7. doi: 10.1101/gr.169375.113. Epub 2014 Jun 24.
2
Genomic prediction of breeding values using previously estimated SNP variances.利用先前估计的单核苷酸多态性(SNP)方差进行育种值的基因组预测。
Genet Sel Evol. 2014 Sep 25;46(1):52. doi: 10.1186/s12711-014-0052-x.
3
Performances of Adaptive MultiBLUP, Bayesian regressions, and weighted-GBLUP approaches for genomic predictions in Belgian Blue beef cattle.对比利时蓝牛肉牛进行基因组预测时,自适应多 BLUP、贝叶斯回归和加权 GBLUP 方法的表现。
BMC Genomics. 2020 Aug 6;21(1):545. doi: 10.1186/s12864-020-06921-3.
4
Whole-genome sequence-based genomic prediction in laying chickens with different genomic relationship matrices to account for genetic architecture.利用不同基因组关系矩阵在蛋鸡中基于全基因组序列进行基因组预测以考虑遗传结构。
Genet Sel Evol. 2017 Jan 16;49(1):8. doi: 10.1186/s12711-016-0277-y.
5
Simultaneous fitting of genomic-BLUP and Bayes-C components in a genomic prediction model.在基因组预测模型中同时拟合基因组最佳线性无偏预测(genomic-BLUP)和贝叶斯C(Bayes-C)成分
Genet Sel Evol. 2017 Aug 24;49(1):63. doi: 10.1186/s12711-017-0339-9.
6
Modeling heterogeneous (co)variances from adjacent-SNP groups improves genomic prediction for milk protein composition traits.对相邻 SNP 组的异质(共)方差进行建模可提高牛奶蛋白成分性状的基因组预测能力。
Genet Sel Evol. 2017 Dec 5;49(1):89. doi: 10.1186/s12711-017-0364-8.
7
The Usage of an SNP-SNP Relationship Matrix for Best Linear Unbiased Prediction (BLUP) Analysis Using a Community-Based Cohort Study.基于社区队列研究的单核苷酸多态性-单核苷酸多态性关系矩阵在最佳线性无偏预测(BLUP)分析中的应用
Genomics Inform. 2014 Dec;12(4):254-60. doi: 10.5808/GI.2014.12.4.254. Epub 2014 Dec 31.
8
Genome-wide Association Study (GWAS) and Its Application for Improving the Genomic Estimated Breeding Values (GEBV) of the Berkshire Pork Quality Traits.全基因组关联研究(GWAS)及其在提高伯克夏猪品质性状基因组估计育种值(GEBV)中的应用。
Asian-Australas J Anim Sci. 2015 Nov;28(11):1551-7. doi: 10.5713/ajas.15.0287.
9
Exploring the genetic architecture and improving genomic prediction accuracy for mastitis and milk production traits in dairy cattle by mapping variants to hepatic transcriptomic regions responsive to intra-mammary infection.通过将变异映射到对乳房内感染有反应的肝脏转录组区域,探索奶牛乳腺炎和产奶性状的遗传结构并提高基因组预测准确性。
Genet Sel Evol. 2017 May 12;49(1):44. doi: 10.1186/s12711-017-0319-0.
10
Using imputation-based whole-genome sequencing data to improve the accuracy of genomic prediction for combined populations in pigs.利用基于插补的全基因组测序数据提高猪的组合群体基因组预测的准确性。
Genet Sel Evol. 2019 Oct 21;51(1):58. doi: 10.1186/s12711-019-0500-8.

引用本文的文献

1
MIXED MODELING APPROACH FOR CHARACTERIZING THE GENETIC EFFECTS IN A LONGITUDINAL PHENOTYPE.用于表征纵向表型遗传效应的混合建模方法
Ann Appl Stat. 2025 Sep;19(3):2070-2087. doi: 10.1214/25-aoas2033. Epub 2025 Aug 28.
2
DeepAnnotation: A novel interpretable deep learning-based genomic selection model that integrates comprehensive functional annotations.深度注释:一种基于深度学习的新型可解释基因组选择模型,该模型整合了全面的功能注释。
Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf083.
3
Impact of Selection Signature on Genomic Prediction and Heritability Estimation in Livestock.

本文引用的文献

1
Accurate and robust genomic prediction of celiac disease using statistical learning.使用统计学习对乳糜泻进行准确且稳健的基因组预测。
PLoS Genet. 2014 Feb 13;10(2):e1004137. doi: 10.1371/journal.pgen.1004137. eCollection 2014 Feb.
2
Improving the efficiency of genomic selection.提高基因组选择的效率。
Stat Appl Genet Mol Biol. 2013 Aug;12(4):517-27. doi: 10.1515/sagmb-2013-0002.
3
Improving coeliac disease risk prediction by testing non-HLA variants additional to HLA variants.通过检测 HLA 变体以外的非 HLA 变体来提高乳糜泻风险预测。
选择印记对家畜基因组预测和遗传力估计的影响。
Animals (Basel). 2025 May 10;15(10):1383. doi: 10.3390/ani15101383.
4
Mammary gland multi-omics data reveals new genetic insights into milk production traits in dairy cattle.乳腺多组学数据揭示了奶牛产奶性状的新遗传见解。
PLoS Genet. 2025 Apr 17;21(4):e1011675. doi: 10.1371/journal.pgen.1011675. eCollection 2025 Apr.
5
Evaluation of genomic selection models using whole genome sequence data and functional annotation in Belgian Blue cattle.利用全基因组序列数据和功能注释评估比利时蓝牛的基因组选择模型
Genet Sel Evol. 2025 Mar 4;57(1):10. doi: 10.1186/s12711-025-00955-5.
6
Integrating Significant SNPs Identified by GWAS for Genomic Prediction of the Number of Ribs and Carcass Length in Suhuai Pigs.整合全基因组关联研究(GWAS)鉴定出的显著单核苷酸多态性(SNPs)用于苏淮猪肋骨数和胴体长度的基因组预测
Animals (Basel). 2025 Feb 2;15(3):412. doi: 10.3390/ani15030412.
7
Massively parallel variant-to-function mapping determines functional regulatory variants of non-small cell lung cancer.大规模平行变异到功能映射确定非小细胞肺癌的功能调控变异
Nat Commun. 2025 Feb 6;16(1):1391. doi: 10.1038/s41467-025-56725-w.
8
Improvement of the accuracy of breeding value prediction for egg production traits in Muscovy duck using low-coverage whole-genome sequence data.利用低覆盖度全基因组序列数据提高番鸭产蛋性状育种值预测的准确性
Poult Sci. 2025 Feb;104(2):104812. doi: 10.1016/j.psj.2025.104812. Epub 2025 Jan 15.
9
Leveraging haplotype information in heritability estimation and polygenic prediction.在遗传力估计和多基因预测中利用单倍型信息。
Nat Commun. 2025 Jan 2;16(1):126. doi: 10.1038/s41467-024-55477-3.
10
Construction and evaluation of a polygenic hazard score for prognostic assessment in localized gastric cancer.用于局部胃癌预后评估的多基因风险评分的构建与评估
Fundam Res. 2022 Nov 9;4(5):1331-1338. doi: 10.1016/j.fmre.2022.09.031. eCollection 2024 Sep.
Gut. 2014 Mar;63(3):415-22. doi: 10.1136/gutjnl-2012-304110. Epub 2013 May 23.
4
A powerful and efficient set test for genetic markers that handles confounders.一种强大而有效的遗传标记集测试方法,可处理混杂因素。
Bioinformatics. 2013 Jun 15;29(12):1526-33. doi: 10.1093/bioinformatics/btt177. Epub 2013 Apr 18.
5
Polygenic modeling with bayesian sparse linear mixed models.贝叶斯稀疏线性混合模型的多基因建模。
PLoS Genet. 2013;9(2):e1003264. doi: 10.1371/journal.pgen.1003264. Epub 2013 Feb 7.
6
Advanced complex trait analysis.高级复杂性状分析。
Bioinformatics. 2012 Dec 1;28(23):3134-6. doi: 10.1093/bioinformatics/bts571. Epub 2012 Sep 27.
7
Whole-genome regression and prediction methods applied to plant and animal breeding.全基因组回归和预测方法在动植物育种中的应用。
Genetics. 2013 Feb;193(2):327-45. doi: 10.1534/genetics.112.143313. Epub 2012 Jun 28.
8
Improved linear mixed models for genome-wide association studies.用于全基因组关联研究的改进线性混合模型。
Nat Methods. 2012 May 30;9(6):525-6. doi: 10.1038/nmeth.2037.
9
The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups.2000 个乳腺肿瘤的基因组和转录组结构揭示了新的亚群。
Nature. 2012 Apr 18;486(7403):346-52. doi: 10.1038/nature10983.
10
Genotype imputation with thousands of genomes.使用数千份基因组进行基因型推断。
G3 (Bethesda). 2011 Nov;1(6):457-70. doi: 10.1534/g3.111.001198. Epub 2011 Nov 1.