使用组套索和分数重采样模型平均法对加性和显性单核苷酸多态性效应进行精细定位。

Fine-mapping additive and dominant SNP effects using group-LASSO and fractional resample model averaging.

作者信息

Sabourin Jeremy, Nobel Andrew B, Valdar William

机构信息

Department of Genetics, University of North Carolina at Chapel Hill, North Carolina, United States of America; Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, North Carolina, United States of America.

出版信息

Genet Epidemiol. 2015 Feb;39(2):77-88. doi: 10.1002/gepi.21869. Epub 2014 Nov 21.

DOI:10.1002/gepi.21869

PMID:25417853

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4314429/

Abstract

Genomewide association studies (GWAS) sometimes identify loci at which both the number and identities of the underlying causal variants are ambiguous. In such cases, statistical methods that model effects of multiple single-nucleotide polymorphisms (SNPs) simultaneously can help disentangle the observed patterns of association and provide information about how those SNPs could be prioritized for follow-up studies. Current multi-SNP methods, however, tend to assume that SNP effects are well captured by additive genetics; yet when genetic dominance is present, this assumption translates to reduced power and faulty prioritizations. We describe a statistical procedure for prioritizing SNPs at GWAS loci that efficiently models both additive and dominance effects. Our method, LLARRMA-dawg, combines a group LASSO procedure for sparse modeling of multiple SNP effects with a resampling procedure based on fractional observation weights. It estimates for each SNP the robustness of association with the phenotype both to sampling variation and to competing explanations from other SNPs. In producing an SNP prioritization that best identifies underlying true signals, we show the following: our method easily outperforms a single-marker analysis; when additive-only signals are present, our joint model for additive and dominance is equivalent to or only slightly less powerful than modeling additive-only effects; and when dominance signals are present, even in combination with substantial additive effects, our joint model is unequivocally more powerful than a model assuming additivity. We also describe how performance can be improved through calibrated randomized penalization, and discuss how dominance in ungenotyped SNPs can be incorporated through either heterozygote dosage or multiple imputation.

摘要

全基因组关联研究（GWAS）有时会识别出一些基因座，在这些基因座上，潜在因果变异的数量和身份都不明确。在这种情况下，同时对多个单核苷酸多态性（SNP）效应进行建模的统计方法有助于理清观察到的关联模式，并提供有关如何对这些SNP进行后续研究优先级排序的信息。然而，当前的多SNP方法往往假定SNP效应可以通过加性遗传学很好地捕捉；然而，当存在遗传显性时，这种假设会导致功效降低和错误的优先级排序。我们描述了一种在GWAS基因座上对SNP进行优先级排序的统计程序，该程序能有效地对加性和显性效应进行建模。我们的方法LLARRMA-dawg，将用于多个SNP效应稀疏建模的组套索程序与基于分数观察权重的重采样程序相结合。它为每个SNP估计与表型关联对抽样变异和来自其他SNP的竞争性解释的稳健性。在生成能最佳识别潜在真实信号的SNP优先级排序时，我们展示了以下内容：我们的方法轻松优于单标记分析；当仅存在加性信号时，我们的加性和显性联合模型等同于仅建模加性效应的模型，或者功效仅略低；当存在显性信号时，即使与大量加性效应相结合，我们的联合模型也明显比假定加性的模型更具功效。我们还描述了如何通过校准随机惩罚来提高性能，并讨论了如何通过杂合子剂量或多重填补将未分型SNP中的显性纳入其中。

相似文献

Fine-mapping additive and dominant SNP effects using group-LASSO and fractional resample model averaging.

Genet Epidemiol. 2015 Feb;39(2):77-88. doi: 10.1002/gepi.21869. Epub 2014 Nov 21.

Reprioritizing genetic associations in hit regions using LASSO-based resample model averaging.

Genet Epidemiol. 2012 Jul;36(5):451-62. doi: 10.1002/gepi.21639. Epub 2012 Apr 30.

Genome-wide prediction using Bayesian additive regression trees.

Genet Sel Evol. 2016 Jun 10;48(1):42. doi: 10.1186/s12711-016-0219-8.

Prioritizing genetic variants in GWAS with lasso using permutation-assisted tuning.

Bioinformatics. 2020 Jun 1;36(12):3811-3817. doi: 10.1093/bioinformatics/btaa229.

Accounting for dominance to improve genomic evaluations of dairy cows for fertility and milk production traits.

Genet Sel Evol. 2016 Feb 1;48:8. doi: 10.1186/s12711-016-0186-0.

Non-additive genetic variation in growth, carcass and fertility traits of beef cattle.

Genet Sel Evol. 2015 Apr 2;47(1):26. doi: 10.1186/s12711-015-0114-8.

A genome-wide association study reveals additive and dominance effects on growth and fatness traits in large white pigs.

Anim Genet. 2021 Oct;52(5):749-753. doi: 10.1111/age.13131. Epub 2021 Aug 17.

Validation of markers with non-additive effects on milk yield and fertility in Holstein and Jersey cows.

BMC Genet. 2015 Jul 22;16:89. doi: 10.1186/s12863-015-0241-9.

Efficient approximation of P-value of the maximum of correlated tests, with applications to genome-wide association studies.

Ann Hum Genet. 2008 May;72(Pt 3):397-406. doi: 10.1111/j.1469-1809.2008.00437.x. Epub 2008 Mar 3.

One degree of freedom for dominance in indirect association studies.

Genet Epidemiol. 2007 Apr;31(3):261-71. doi: 10.1002/gepi.20207.

引用本文的文献

BEATRICE: Bayesian fine-mapping from summary data using deep variational inference.

Bioinformatics. 2024 Oct 1;40(10). doi: 10.1093/bioinformatics/btae590.

Genetic polymorphism of the Dab2 gene and its association with Type 2 Diabetes Mellitus in the Chinese Uyghur population.

PeerJ. 2023 Jun 21;11:e15536. doi: 10.7717/peerj.15536. eCollection 2023.

BEATRICE: Bayesian Fine-mapping from Summary Data using Deep Variational Inference.

bioRxiv. 2024 Sep 8:2023.03.24.534116. doi: 10.1101/2023.03.24.534116.

Genetic variations in relation to bleeding and pharmacodynamics of dabigatran in Chinese patients with nonvalvular atrial fibrillation: A nationwide multicentre prospective cohort study.

Clin Transl Med. 2022 Dec;12(12):e1104. doi: 10.1002/ctm2.1104.

Multiple-trait analyses improved the accuracy of genomic prediction and the power of genome-wide association of productivity and climate change-adaptive traits in lodgepole pine.

BMC Genomics. 2022 Jul 23;23(1):536. doi: 10.1186/s12864-022-08747-7.

A Bayesian model selection approach to mediation analysis.

PLoS Genet. 2022 May 9;18(5):e1010184. doi: 10.1371/journal.pgen.1010184. eCollection 2022 May.

SNP and Haplotype Interaction Models Reveal Association of Surfactant Protein Gene Polymorphisms With Hypersensitivity Pneumonitis of Mexican Population.

Front Med (Lausanne). 2021 Jan 5;7:588404. doi: 10.3389/fmed.2020.588404. eCollection 2020.

ComPaSS-GWAS: A method to reduce type I error in genome-wide association studies when replication data are not available.

Genet Epidemiol. 2019 Feb;43(1):102-111. doi: 10.1002/gepi.22168. Epub 2018 Oct 18.

An approximate Bayesian significance test for genomic evaluations.

Biom J. 2018 Nov;60(6):1096-1109. doi: 10.1002/bimj.201700219. Epub 2018 Aug 12.

Genome-Wide Association Study Implicates Atrial Natriuretic Peptide Rather Than B-Type Natriuretic Peptide in the Regulation of Blood Pressure in the General Population.

Circ Cardiovasc Genet. 2017 Dec;10(6):e001713. doi: 10.1161/CIRCGENETICS.117.001713.

本文引用的文献

Regularized rare variant enrichment analysis for case-control exome sequencing data.

Genet Epidemiol. 2014 Feb;38(2):104-13. doi: 10.1002/gepi.21783. Epub 2013 Dec 30.

Beyond GWASs: illuminating the dark road from association to function.

Am J Hum Genet. 2013 Nov 7;93(5):779-97. doi: 10.1016/j.ajhg.2013.10.012.

Identification of grouped rare and common variants via penalized logistic regression.

Genet Epidemiol. 2013 Sep;37(6):592-602. doi: 10.1002/gepi.21746. Epub 2013 Jul 8.

Evidence of inbreeding depression on human height.

PLoS Genet. 2012;8(7):e1002655. doi: 10.1371/journal.pgen.1002655. Epub 2012 Jul 19.

An efficient multi-locus mixed-model approach for genome-wide association studies in structured populations.

Nat Genet. 2012 Jun 17;44(7):825-30. doi: 10.1038/ng.2314.

Power of single- vs. multi-marker tests of association.

Genet Epidemiol. 2012 Jul;36(5):480-7. doi: 10.1002/gepi.21642. Epub 2012 May 30.

Reprioritizing genetic associations in hit regions using LASSO-based resample model averaging.

Genet Epidemiol. 2012 Jul;36(5):451-62. doi: 10.1002/gepi.21639. Epub 2012 Apr 30.

Novel loci for adiponectin levels and their influence on type 2 diabetes and metabolic traits: a multi-ethnic meta-analysis of 45,891 individuals.

PLoS Genet. 2012;8(3):e1002607. doi: 10.1371/journal.pgen.1002607. Epub 2012 Mar 29.

Deep resequencing unveils genetic architecture of ADIPOQ and identifies a novel low-frequency variant strongly associated with adiponectin variation.

Diabetes. 2012 May;61(5):1297-301. doi: 10.2337/db11-0985. Epub 2012 Mar 8.

Penalized-regression-based multimarker genotype analysis of Genetic Analysis Workshop 17 data.

BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S92. doi: 10.1186/1753-6561-5-S9-S92.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用组套索和分数重采样模型平均法对加性和显性单核苷酸多态性效应进行精细定位。

Fine-mapping additive and dominant SNP effects using group-LASSO and fractional resample model averaging.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献