SparsePro：一种整合汇总统计数据和功能注释的高效精细映射方法。

SparsePro: An efficient fine-mapping method integrating summary statistics and functional annotations.

机构信息

Quantitative Life Sciences, McGill University, Montreal, Quebec, Canada.

Department of Human Genetics, McGill University, Montreal, Quebec, Canada.

出版信息

PLoS Genet. 2023 Dec 28;19(12):e1011104. doi: 10.1371/journal.pgen.1011104. eCollection 2023 Dec.

DOI:10.1371/journal.pgen.1011104

PMID:38153934

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10781022/

Abstract

Identifying causal variants from genome-wide association studies (GWAS) is challenging due to widespread linkage disequilibrium (LD) and the possible existence of multiple causal variants in the same genomic locus. Functional annotations of the genome may help to prioritize variants that are biologically relevant and thus improve fine-mapping of GWAS results. Classical fine-mapping methods conducting an exhaustive search of variant-level causal configurations have a high computational cost, especially when the underlying genetic architecture and LD patterns are complex. SuSiE provided an iterative Bayesian stepwise selection algorithm for efficient fine-mapping. In this work, we build connections between SuSiE and a paired mean field variational inference algorithm through the implementation of a sparse projection, and propose effective strategies for estimating hyperparameters and summarizing posterior probabilities. Moreover, we incorporate functional annotations into fine-mapping by jointly estimating enrichment weights to derive functionally-informed priors. We evaluate the performance of SparsePro through extensive simulations using resources from the UK Biobank. Compared to state-of-the-art methods, SparsePro achieved improved power for fine-mapping with reduced computation time. We demonstrate the utility of SparsePro through fine-mapping of five functional biomarkers of clinically relevant phenotypes. In summary, we have developed an efficient fine-mapping method for integrating summary statistics and functional annotations. Our method can have wide utility in understanding the genetics of complex traits and increasing the yield of functional follow-up studies of GWAS. SparsePro software is available on GitHub at https://github.com/zhwm/SparsePro.

摘要

从全基因组关联研究（GWAS）中识别因果变异是具有挑战性的，因为广泛存在连锁不平衡（LD），并且同一基因组位置可能存在多个因果变异。基因组的功能注释可以帮助优先考虑具有生物学相关性的变异，从而提高 GWAS 结果的精细映射。进行变异级因果结构详尽搜索的经典精细映射方法计算成本很高，尤其是当潜在的遗传结构和 LD 模式复杂时。SuSiE 提供了一种迭代贝叶斯逐步选择算法，用于有效的精细映射。在这项工作中，我们通过稀疏投影的实现，在 SuSiE 和配对均值场变分推断算法之间建立联系，并提出了用于估计超参数和总结后验概率的有效策略。此外，我们通过联合估计富集权重，将功能注释纳入精细映射中，以获得功能信息先验。我们使用 UK Biobank 的资源通过广泛的模拟来评估 SparsePro 的性能。与最先进的方法相比，SparsePro 实现了提高精细映射的功效，同时减少了计算时间。我们通过对五个具有临床相关表型的功能生物标志物的精细映射来展示 SparsePro 的实用性。总之，我们开发了一种有效的整合汇总统计和功能注释的精细映射方法。我们的方法在理解复杂性状的遗传学和增加 GWAS 功能后续研究的产量方面具有广泛的应用。SparsePro 软件可在 GitHub 上获得，网址为 https://github.com/zhwm/SparsePro。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c440/10781022/d6d3634e98d8/pgen.1011104.g001.jpg

相似文献

SparsePro: An efficient fine-mapping method integrating summary statistics and functional annotations.

PLoS Genet. 2023 Dec 28;19(12):e1011104. doi: 10.1371/journal.pgen.1011104. eCollection 2023 Dec.

Genetic fine-mapping from summary data using a nonlocal prior improves the detection of multiple causal variants.

Bioinformatics. 2023 Jul 1;39(7). doi: 10.1093/bioinformatics/btad396.

SharePro: an accurate and efficient genetic colocalization method accounting for multiple causal signals.

Bioinformatics. 2024 May 2;40(5). doi: 10.1093/bioinformatics/btae295.

Integration of expression QTLs with fine mapping via SuSiE.

PLoS Genet. 2024 Jan 25;20(1):e1010929. doi: 10.1371/journal.pgen.1010929. eCollection 2024 Jan.

Improved methods for multi-trait fine mapping of pleiotropic risk loci.

Bioinformatics. 2017 Jan 15;33(2):248-255. doi: 10.1093/bioinformatics/btw615. Epub 2016 Sep 22.

Integrating functional data to prioritize causal variants in statistical fine-mapping studies.

PLoS Genet. 2014 Oct 30;10(10):e1004722. doi: 10.1371/journal.pgen.1004722. eCollection 2014 Oct.

Integrating molecular QTL data into genome-wide genetic association analysis: Probabilistic assessment of enrichment and colocalization.

PLoS Genet. 2017 Mar 9;13(3):e1006646. doi: 10.1371/journal.pgen.1006646. eCollection 2017 Mar.

CARMA is a new Bayesian model for fine-mapping in genome-wide association meta-analyses.

Nat Genet. 2023 Jun;55(6):1057-1065. doi: 10.1038/s41588-023-01392-0. Epub 2023 May 11.

Identification of potential genetic causal variants for obesity-related traits using statistical fine mapping.

Mol Genet Genomics. 2023 Nov;298(6):1309-1319. doi: 10.1007/s00438-023-02055-9. Epub 2023 Jul 27.

Finemap-MiXeR: A variational Bayesian approach for genetic finemapping.

PLoS Genet. 2024 Aug 15;20(8):e1011372. doi: 10.1371/journal.pgen.1011372. eCollection 2024 Aug.

引用本文的文献

Towards improved fine-mapping of candidate causal variants.

Nat Rev Genet. 2025 Jul 28. doi: 10.1038/s41576-025-00869-4.

Funmap: integrating high-dimensional functional annotations to improve fine-mapping.

Bioinformatics. 2024 Dec 26;41(1). doi: 10.1093/bioinformatics/btaf017.

Accounting for genetic effect heterogeneity in fine-mapping and improving power to detect gene-environment interactions with SharePro.

Nat Commun. 2024 Oct 30;15(1):9374. doi: 10.1038/s41467-024-53818-w.

SharePro: an accurate and efficient genetic colocalization method accounting for multiple causal signals.

Bioinformatics. 2024 May 2;40(5). doi: 10.1093/bioinformatics/btae295.

Integration of expression QTLs with fine mapping via SuSiE.

PLoS Genet. 2024 Jan 25;20(1):e1010929. doi: 10.1371/journal.pgen.1010929. eCollection 2024 Jan.

MESuSiE enables scalable and powerful multi-ancestry fine-mapping of causal variants in genome-wide association studies.

Nat Genet. 2024 Jan;56(1):170-179. doi: 10.1038/s41588-023-01604-7. Epub 2024 Jan 2.

Integration of Expression QTLs with fine mapping via SuSiE.

medRxiv. 2023 Oct 6:2023.10.03.23294486. doi: 10.1101/2023.10.03.23294486.

Fast and accurate Bayesian polygenic risk modeling with variational inference.

Am J Hum Genet. 2023 May 4;110(5):741-761. doi: 10.1016/j.ajhg.2023.03.009. Epub 2023 Apr 7.

Considering strategies for SNP selection in genetic and polygenic risk scores.

Front Genet. 2022 Oct 25;13:900595. doi: 10.3389/fgene.2022.900595. eCollection 2022.

本文引用的文献

A simple new approach to variable selection in regression, with application to genetic fine mapping.

J R Stat Soc Series B Stat Methodol. 2020 Dec;82(5):1273-1300. doi: 10.1111/rssb.12388. Epub 2020 Jul 10.

Fine-mapping from summary data with the "Sum of Single Effects" model.

PLoS Genet. 2022 Jul 19;18(7):e1010299. doi: 10.1371/journal.pgen.1010299. eCollection 2022 Jul.

The trans-ancestral genomic architecture of glycemic traits.

Nat Genet. 2021 Jun;53(6):840-860. doi: 10.1038/s41588-021-00852-9. Epub 2021 May 31.

Genetic analysis in European ancestry individuals identifies 517 loci associated with liver enzymes.

Nat Commun. 2021 May 10;12(1):2579. doi: 10.1038/s41467-021-22338-2.

Genome-wide discovery of genetic loci that uncouple excess adiposity from its comorbidities.

Nat Metab. 2021 Feb;3(2):228-243. doi: 10.1038/s42255-021-00346-2. Epub 2021 Feb 22.

Genome-wide association study of serum liver enzymes implicates diverse metabolic and liver pathology.

Nat Commun. 2021 Feb 5;12(1):816. doi: 10.1038/s41467-020-20870-1.

A genome-wide meta-analysis yields 46 new loci associating with biomarkers of iron homeostasis.

Commun Biol. 2021 Feb 3;4(1):156. doi: 10.1038/s42003-020-01575-z.

Functionally informed fine-mapping and polygenic localization of complex trait heritability.

Nat Genet. 2020 Dec;52(12):1355-1363. doi: 10.1038/s41588-020-00735-5. Epub 2020 Nov 16.

The Polygenic and Monogenic Basis of Blood Traits and Diseases.

Cell. 2020 Sep 3;182(5):1214-1231.e11. doi: 10.1016/j.cell.2020.08.008.

A resource-efficient tool for mixed model association analysis of large-scale data.

Nat Genet. 2019 Dec;51(12):1749-1755. doi: 10.1038/s41588-019-0530-8. Epub 2019 Nov 25.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

SparsePro：一种整合汇总统计数据和功能注释的高效精细映射方法。

SparsePro: An efficient fine-mapping method integrating summary statistics and functional annotations.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献