• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于汇总统计数据的贝叶斯多元回归提高多基因预测能力。

Improved polygenic prediction by Bayesian multiple regression on summary statistics.

机构信息

Institute for Molecular Bioscience, University of Queensland, St Lucia, Brisbane, 4072, QLD, Australia.

Estonian Genome Center, Institute of Genomics, University of Tartu, Riia 23b, 51010, Tartu, Estonia.

出版信息

Nat Commun. 2019 Nov 8;10(1):5086. doi: 10.1038/s41467-019-12653-0.

DOI:10.1038/s41467-019-12653-0
PMID:31704910
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6841727/
Abstract

Accurate prediction of an individual's phenotype from their DNA sequence is one of the great promises of genomics and precision medicine. We extend a powerful individual-level data Bayesian multiple regression model (BayesR) to one that utilises summary statistics from genome-wide association studies (GWAS), SBayesR. In simulation and cross-validation using 12 real traits and 1.1 million variants on 350,000 individuals from the UK Biobank, SBayesR improves prediction accuracy relative to commonly used state-of-the-art summary statistics methods at a fraction of the computational resources. Furthermore, using summary statistics for variants from the largest GWAS meta-analysis (n ≈ 700, 000) on height and BMI, we show that on average across traits and two independent data sets that SBayesR improves prediction R by 5.2% relative to LDpred and by 26.5% relative to clumping and p value thresholding.

摘要

从个体的 DNA 序列准确预测其表型是基因组学和精准医学的重大承诺之一。我们将强大的个体水平数据贝叶斯多元回归模型(BayesR)扩展为一种利用全基因组关联研究(GWAS)汇总统计数据的模型(SBayesR)。在使用来自英国生物库的 35 万名个体的 12 个真实特征和 110 万个变体进行的模拟和交叉验证中,SBayesR 提高了预测准确性,而计算资源仅为常用的最先进汇总统计数据方法的一小部分。此外,使用来自最大 GWAS 荟萃分析(n≈700000)的身高和 BMI 变体的汇总统计数据,我们表明,在跨特征和两个独立数据集的情况下,SBayesR 平均将预测 R 提高了 5.2%,与 LDpred 相比提高了 26.5%,与聚类和 p 值阈值相比提高了 26.5%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fdb/6841727/875b8a7eca90/41467_2019_12653_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fdb/6841727/38d11633f125/41467_2019_12653_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fdb/6841727/ec163d389818/41467_2019_12653_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fdb/6841727/875b8a7eca90/41467_2019_12653_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fdb/6841727/38d11633f125/41467_2019_12653_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fdb/6841727/ec163d389818/41467_2019_12653_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5fdb/6841727/875b8a7eca90/41467_2019_12653_Fig3_HTML.jpg

相似文献

1
Improved polygenic prediction by Bayesian multiple regression on summary statistics.基于汇总统计数据的贝叶斯多元回归提高多基因预测能力。
Nat Commun. 2019 Nov 8;10(1):5086. doi: 10.1038/s41467-019-12653-0.
2
Incorporating functional priors improves polygenic prediction accuracy in UK Biobank and 23andMe data sets.纳入功能先验信息可提高 UK Biobank 和 23andMe 数据集的多基因预测准确性。
Nat Commun. 2021 Oct 18;12(1):6052. doi: 10.1038/s41467-021-25171-9.
3
Fast and accurate Bayesian polygenic risk modeling with variational inference.基于变分推断的快速准确贝叶斯多基因风险建模。
Am J Hum Genet. 2023 May 4;110(5):741-761. doi: 10.1016/j.ajhg.2023.03.009. Epub 2023 Apr 7.
4
Leveraging functional genomic annotations and genome coverage to improve polygenic prediction of complex traits within and between ancestries.利用功能基因组注释和基因组覆盖度提高在不同祖源内和之间的复杂性状的多基因预测。
Nat Genet. 2024 May;56(5):767-777. doi: 10.1038/s41588-024-01704-y. Epub 2024 Apr 30.
5
Leveraging both individual-level genetic data and GWAS summary statistics increases polygenic prediction.利用个体水平的遗传数据和 GWAS 汇总统计数据可以提高多基因预测。
Am J Hum Genet. 2021 Jun 3;108(6):1001-1011. doi: 10.1016/j.ajhg.2021.04.014. Epub 2021 May 7.
6
Making the Most of Clumping and Thresholding for Polygenic Scores.充分利用聚类和阈值处理多基因评分。
Am J Hum Genet. 2019 Dec 5;105(6):1213-1221. doi: 10.1016/j.ajhg.2019.11.001. Epub 2019 Nov 21.
7
Fine mapping and accurate prediction of complex traits using Bayesian Variable Selection models applied to biobank-size data.贝叶斯变量选择模型在生物库规模数据中的应用,实现复杂性状的精细定位和准确预测。
Eur J Hum Genet. 2023 Mar;31(3):313-320. doi: 10.1038/s41431-022-01135-5. Epub 2022 Jul 19.
8
Improved genetic prediction of complex traits from individual-level data or summary statistics.从个体水平数据或汇总统计信息中提高复杂性状的遗传预测能力。
Nat Commun. 2021 Jul 7;12(1):4192. doi: 10.1038/s41467-021-24485-y.
9
Identity informative SNP associations in the UK Biobank.鉴定 UK Biobank 中的信息性 SNP 关联。
Forensic Sci Int Genet. 2019 Sep;42:45-48. doi: 10.1016/j.fsigen.2019.06.007. Epub 2019 Jun 14.
10
Haplotype function score improves biological interpretation and cross-ancestry polygenic prediction of human complex traits.单体型功能评分可改善人类复杂性状的生物学解释和跨血统多基因预测。
Elife. 2024 Apr 19;12:RP92574. doi: 10.7554/eLife.92574.

引用本文的文献

1
Genetics and Socioeconomic Status: Some Preliminary Evidence on Mechanisms.遗传学与社会经济地位:关于作用机制的一些初步证据
J Polit Econ Microecon. 2025 Aug;3(3). doi: 10.1086/732835. Epub 2025 Jul 16.
2
MIXED MODELING APPROACH FOR CHARACTERIZING THE GENETIC EFFECTS IN A LONGITUDINAL PHENOTYPE.用于表征纵向表型遗传效应的混合建模方法
Ann Appl Stat. 2025 Sep;19(3):2070-2087. doi: 10.1214/25-aoas2033. Epub 2025 Aug 28.
3
Association between polygenic risk for Major Depression and brain structure in a mega-analysis of 50,975 participants across 11 studies.

本文引用的文献

1
SumHer better estimates the SNP heritability of complex traits from summary statistics.SumHer 可以更好地从汇总统计数据估计复杂性状的 SNP 遗传力。
Nat Genet. 2019 Feb;51(2):277-284. doi: 10.1038/s41588-018-0279-5. Epub 2018 Dec 3.
2
The UK Biobank resource with deep phenotyping and genomic data.英国生物银行资源库,具有深度表型和基因组数据。
Nature. 2018 Oct;562(7726):203-209. doi: 10.1038/s41586-018-0579-z. Epub 2018 Oct 10.
3
Accurate Genomic Prediction of Human Height.人类身高的精确基因组预测。
对11项研究中50975名参与者进行的一项大型分析:重度抑郁症的多基因风险与脑结构之间的关联
Mol Psychiatry. 2025 Aug 19. doi: 10.1038/s41380-025-03136-4.
4
Genomic risk prediction for depression in a large prospective study of older adults of European descent.欧洲裔老年人大型前瞻性研究中抑郁症的基因组风险预测
Mol Psychiatry. 2025 Aug 6. doi: 10.1038/s41380-025-03145-3.
5
Uncovering the multivariate genetic architecture of frailty with genomic structural equation modeling.运用基因组结构方程模型揭示衰弱的多变量遗传结构。
Nat Genet. 2025 Aug 4. doi: 10.1038/s41588-025-02269-0.
6
Enhanced genetic fine mapping accuracy with Bayesian Linear Regression models in diverse genetic architectures.在不同遗传结构中使用贝叶斯线性回归模型提高遗传精细定位的准确性。
PLoS Genet. 2025 Jul 30;21(7):e1011783. doi: 10.1371/journal.pgen.1011783. eCollection 2025 Jul.
7
Disentangling the comorbidity between allergic disease and type 1 diabetes using genetically informative designs.利用基因信息设计解析过敏性疾病与1型糖尿病之间的共病关系。
J Allergy Clin Immunol Glob. 2025 Jun 23;4(4):100519. doi: 10.1016/j.jacig.2025.100519. eCollection 2025 Nov.
8
The association of a polygenic lifespan score with the risk of common age-related diseases and mortality.多基因寿命评分与常见年龄相关疾病风险及死亡率的关联。
J Gerontol A Biol Sci Med Sci. 2025 Aug 23;80(9). doi: 10.1093/gerona/glaf156.
9
Genome-wide association meta-regression identifies stem cell lineage orchestration as a key driver of acne risk.全基因组关联元回归分析确定干细胞谱系调控是痤疮风险的关键驱动因素。
medRxiv. 2025 Jun 28:2025.06.27.25330406. doi: 10.1101/2025.06.27.25330406.
10
PGSFusion streamlines polygenic score construction and epidemiological applications in biobank-scale cohorts.PGSFusion简化了生物样本库规模队列中的多基因评分构建和流行病学应用。
Genome Med. 2025 Jul 14;17(1):77. doi: 10.1186/s13073-025-01505-w.
Genetics. 2018 Oct;210(2):477-497. doi: 10.1534/genetics.118.301267. Epub 2018 Aug 27.
4
Meta-analysis of genome-wide association studies for height and body mass index in ∼700000 individuals of European ancestry.全基因组关联研究荟萃分析:约 70 万欧洲血统个体的身高和体重指数。
Hum Mol Genet. 2018 Oct 15;27(20):3641-3649. doi: 10.1093/hmg/ddy271.
5
Estimation of complex effect-size distributions using summary-level statistics from genome-wide association studies across 32 complex traits.使用来自 32 种复杂性状的全基因组关联研究的汇总水平统计数据估计复杂效应大小分布。
Nat Genet. 2018 Sep;50(9):1318-1326. doi: 10.1038/s41588-018-0193-x. Epub 2018 Aug 13.
6
Gene discovery and polygenic prediction from a genome-wide association study of educational attainment in 1.1 million individuals.从一项涉及 110 万人的教育程度全基因组关联研究中发现基因并进行多基因预测。
Nat Genet. 2018 Jul 23;50(8):1112-1121. doi: 10.1038/s41588-018-0147-3.
7
Estimating SNP-Based Heritability and Genetic Correlation in Case-Control Studies Directly and with Summary Statistics.在病例对照研究中直接和使用汇总统计数据估计基于 SNP 的遗传力和遗传相关性。
Am J Hum Genet. 2018 Jul 5;103(1):89-99. doi: 10.1016/j.ajhg.2018.06.002.
8
Mixed-model association for biobank-scale datasets.基于生物库规模数据集的混合模型关联分析。
Nat Genet. 2018 Jul;50(7):906-908. doi: 10.1038/s41588-018-0144-6.
9
The personal and clinical utility of polygenic risk scores.多基因风险评分的个体和临床效用。
Nat Rev Genet. 2018 Sep;19(9):581-590. doi: 10.1038/s41576-018-0018-x.
10
Signatures of negative selection in the genetic architecture of human complex traits.人类复杂特征遗传结构中的阴性选择特征。
Nat Genet. 2018 May;50(5):746-753. doi: 10.1038/s41588-018-0101-4. Epub 2018 Apr 16.