• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用 GWAS 汇总统计数据优化和基准化多基因风险评分。

Optimizing and benchmarking polygenic risk scores with GWAS summary statistics.

机构信息

Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, USA.

Department of Statistics, University of Wisconsin-Madison, Madison, WI, USA.

出版信息

Genome Biol. 2024 Oct 8;25(1):260. doi: 10.1186/s13059-024-03400-w.

DOI:10.1186/s13059-024-03400-w
PMID:39379999
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11462675/
Abstract

BACKGROUND

Polygenic risk score (PRS) is a major research topic in human genetics. However, a significant gap exists between PRS methodology and applications in practice due to often unavailable individual-level data for various PRS tasks including model fine-tuning, benchmarking, and ensemble learning.

RESULTS

We introduce an innovative statistical framework to optimize and benchmark PRS models using summary statistics of genome-wide association studies. This framework builds upon our previous work and can fine-tune virtually all existing PRS models while accounting for linkage disequilibrium. In addition, we provide an ensemble learning strategy named PUMAS-ensemble to combine multiple PRS models into an ensemble score without requiring external data for model fitting. Through extensive simulations and analysis of many complex traits in the UK Biobank, we demonstrate that this approach closely approximates gold-standard analytical strategies based on external validation, and substantially outperforms state-of-the-art PRS methods.

CONCLUSIONS

Our method is a powerful and general modeling technique that can continue to combine the best-performing PRS methods out there through ensemble learning and could become an integral component for all future PRS applications.

摘要

背景

多基因风险评分(PRS)是人类遗传学的一个主要研究课题。然而,由于各种 PRS 任务(包括模型微调、基准测试和集成学习)通常无法获得个体水平的数据,因此 PRS 方法学与实践应用之间存在显著差距。

结果

我们引入了一种创新的统计框架,使用全基因组关联研究的汇总统计数据来优化和基准测试 PRS 模型。该框架建立在我们之前的工作基础上,可以微调几乎所有现有的 PRS 模型,同时考虑到连锁不平衡。此外,我们提供了一种名为 PUMAS-ensemble 的集成学习策略,用于将多个 PRS 模型组合成一个集成分数,而无需外部数据进行模型拟合。通过对 UK Biobank 中的许多复杂特征进行广泛的模拟和分析,我们证明了这种方法非常接近基于外部验证的黄金标准分析策略,并且大大优于最先进的 PRS 方法。

结论

我们的方法是一种强大而通用的建模技术,可以通过集成学习继续结合表现最佳的 PRS 方法,并且可能成为所有未来 PRS 应用的一个组成部分。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/691a/11462675/59880364866b/13059_2024_3400_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/691a/11462675/76a9db01ef23/13059_2024_3400_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/691a/11462675/8245c32e5fdd/13059_2024_3400_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/691a/11462675/e4d4f50a5ef5/13059_2024_3400_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/691a/11462675/30bf65f314b1/13059_2024_3400_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/691a/11462675/59880364866b/13059_2024_3400_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/691a/11462675/76a9db01ef23/13059_2024_3400_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/691a/11462675/8245c32e5fdd/13059_2024_3400_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/691a/11462675/e4d4f50a5ef5/13059_2024_3400_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/691a/11462675/30bf65f314b1/13059_2024_3400_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/691a/11462675/59880364866b/13059_2024_3400_Fig5_HTML.jpg

相似文献

1
Optimizing and benchmarking polygenic risk scores with GWAS summary statistics.利用 GWAS 汇总统计数据优化和基准化多基因风险评分。
Genome Biol. 2024 Oct 8;25(1):260. doi: 10.1186/s13059-024-03400-w.
2
PUMAS: fine-tuning polygenic risk scores with GWAS summary statistics.PUMAS:使用 GWAS 汇总统计数据调整多基因风险评分。
Genome Biol. 2021 Sep 6;22(1):257. doi: 10.1186/s13059-021-02479-9.
3
Fast and scalable ensemble learning method for versatile polygenic risk prediction.快速且可扩展的集成学习方法,用于多功能多基因风险预测。
Proc Natl Acad Sci U S A. 2024 Aug 13;121(33):e2403210121. doi: 10.1073/pnas.2403210121. Epub 2024 Aug 7.
4
Tuning parameters for polygenic risk score methods using GWAS summary statistics from training data.使用来自训练数据的 GWAS 汇总统计信息调整多基因风险评分方法的参数。
Nat Commun. 2024 Jan 2;15(1):24. doi: 10.1038/s41467-023-44009-0.
5
Applying polygenic risk score methods to pharmacogenomics GWAS: challenges and opportunities.将多基因风险评分方法应用于药物基因组学全基因组关联研究:挑战与机遇
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad470.
6
Leveraging both individual-level genetic data and GWAS summary statistics increases polygenic prediction.利用个体水平的遗传数据和 GWAS 汇总统计数据可以提高多基因预测。
Am J Hum Genet. 2021 Jun 3;108(6):1001-1011. doi: 10.1016/j.ajhg.2021.04.014. Epub 2021 May 7.
7
Benchmarking Alzheimer's disease prediction: personalised risk assessment using polygenic risk scores across various methodologies and genome-wide studies.阿尔茨海默病预测的基准测试:使用跨多种方法和全基因组研究的多基因风险评分进行个性化风险评估。
Alzheimers Res Ther. 2025 Jan 6;17(1):6. doi: 10.1186/s13195-024-01664-9.
8
Quantifying portable genetic effects and improving cross-ancestry genetic prediction with GWAS summary statistics.量化可移动遗传效应并利用 GWAS 汇总统计数据提高跨血统遗传预测。
Nat Commun. 2023 Feb 14;14(1):832. doi: 10.1038/s41467-023-36544-7.
9
Improving polygenic prediction in ancestrally diverse populations.提高在祖源多样化人群中的多基因预测能力。
Nat Genet. 2022 May;54(5):573-580. doi: 10.1038/s41588-022-01054-7. Epub 2022 May 5.
10
Cancer PRSweb: An Online Repository with Polygenic Risk Scores for Major Cancer Traits and Their Evaluation in Two Independent Biobanks.癌症 PRSweb:一个具有主要癌症特征多基因风险评分的在线知识库及其在两个独立生物库中的评估。
Am J Hum Genet. 2020 Nov 5;107(5):815-836. doi: 10.1016/j.ajhg.2020.08.025. Epub 2020 Sep 28.

引用本文的文献

1
Integrating Imaging-Derived Clinical Endotypes with Plasma Proteomics and External Polygenic Risk Scores Enhances Coronary Microvascular Disease Risk Prediction.将影像学衍生的临床内型与血浆蛋白质组学和外部多基因风险评分相结合可增强冠状动脉微血管疾病风险预测。
medRxiv. 2025 Aug 21:2025.08.18.25333844. doi: 10.1101/2025.08.18.25333844.
2
Almost Free Enhancement of Multi-Population PRS: From Data-Fission to Pseudo-GWAS Subsampling.多群体PRS的近乎免费增强:从数据裂变到伪全基因组关联研究子采样
bioRxiv. 2025 Jun 20:2025.06.16.659952. doi: 10.1101/2025.06.16.659952.
3
Enhancing the discriminatory power of polygenic scores for ADHD and autism in clinical and non-clinical samples.

本文引用的文献

1
An ensemble penalized regression method for multi-ancestry polygenic risk prediction.一种用于多祖裔多基因风险预测的集成惩罚回归方法。
Nat Commun. 2024 Apr 15;15(1):3238. doi: 10.1038/s41467-024-47357-7.
2
MUSSEL: Enhanced Bayesian polygenic risk prediction leveraging information across multiple ancestry groups.基于多祖先群体信息的贝类增强贝叶斯多基因风险预测
Cell Genom. 2024 Apr 10;4(4):100539. doi: 10.1016/j.xgen.2024.100539.
3
Inferring disease architecture and predictive ability with LDpred2-auto.利用 LDpred2-auto 推断疾病结构和预测能力。
增强多基因评分在临床和非临床样本中对注意力缺陷多动障碍和自闭症的鉴别能力。
J Neurodev Disord. 2025 Jun 9;17(1):32. doi: 10.1186/s11689-025-09620-w.
4
PennPRS: a centralized cloud computing platform for efficient polygenic risk score training in precision medicine.宾夕法尼亚多基因风险评分系统:一个用于精准医学中高效多基因风险评分训练的集中式云计算平台。
medRxiv. 2025 Feb 10:2025.02.07.25321875. doi: 10.1101/2025.02.07.25321875.
5
One score to rule them all: regularized ensemble polygenic risk prediction with GWAS summary statistics.一分数统御一切:利用全基因组关联研究汇总统计数据进行正则化集成多基因风险预测
bioRxiv. 2024 Dec 4:2024.11.27.625748. doi: 10.1101/2024.11.27.625748.
6
Leveraging genetic correlations and multiple populations to improve genetic risk prediction for non-European populations.利用遗传相关性和多个群体来改善非欧洲人群的遗传风险预测。
Res Sq. 2023 Dec 25:rs.3.rs-3741763. doi: 10.21203/rs.3.rs-3741763/v1.
7
Testing a Polygenic Risk Score for Morphological Microglial Activation in Alzheimer's Disease and Aging.测试阿尔茨海默病和衰老中形态小胶质细胞激活的多基因风险评分。
J Alzheimers Dis. 2023;94(4):1549-1561. doi: 10.3233/JAD-230434.
Am J Hum Genet. 2023 Dec 7;110(12):2042-2055. doi: 10.1016/j.ajhg.2023.10.010. Epub 2023 Nov 8.
4
A new method for multiancestry polygenic prediction improves performance across diverse populations.一种新的多祖先多基因预测方法可提高不同人群的性能。
Nat Genet. 2023 Oct;55(10):1757-1768. doi: 10.1038/s41588-023-01501-z. Epub 2023 Sep 25.
5
Multi-PGS enhances polygenic prediction by combining 937 polygenic scores.多基因评分聚合(Multi-PGS)通过整合 937 个多基因评分来增强多基因预测。
Nat Commun. 2023 Aug 5;14(1):4702. doi: 10.1038/s41467-023-40330-w.
6
Quantifying portable genetic effects and improving cross-ancestry genetic prediction with GWAS summary statistics.量化可移动遗传效应并利用 GWAS 汇总统计数据提高跨血统遗传预测。
Nat Commun. 2023 Feb 14;14(1):832. doi: 10.1038/s41467-023-36544-7.
7
A saturated map of common genetic variants associated with human height.与人类身高相关的常见遗传变异的饱和图谱。
Nature. 2022 Oct;610(7933):704-712. doi: 10.1038/s41586-022-05275-y. Epub 2022 Oct 12.
8
Identifying and correcting for misspecifications in GWAS summary statistics and polygenic scores.识别并校正全基因组关联研究汇总统计数据和多基因评分中的错误设定。
HGG Adv. 2022 Aug 18;3(4):100136. doi: 10.1016/j.xhgg.2022.100136. eCollection 2022 Oct 13.
9
Challenges and Opportunities for Developing More Generalizable Polygenic Risk Scores.开发更具泛化能力的多基因风险评分的挑战与机遇。
Annu Rev Biomed Data Sci. 2022 Aug 10;5:293-320. doi: 10.1146/annurev-biodatasci-111721-074830. Epub 2022 May 16.
10
Improving polygenic prediction in ancestrally diverse populations.提高在祖源多样化人群中的多基因预测能力。
Nat Genet. 2022 May;54(5):573-580. doi: 10.1038/s41588-022-01054-7. Epub 2022 May 5.