使用经验贝叶斯方法将功能基因组信息纳入基因关联研究。

Incorporating Functional Genomic Information in Genetic Association Studies Using an Empirical Bayes Approach.

作者信息

Spencer Amy V, Cox Angela, Lin Wei-Yu, Easton Douglas F, Michailidou Kyriaki, Walters Kevin

机构信息

Advanced Analytics Centre, Global Medicines Development, AstraZeneca, Alderley Park, Macclesfield, United Kingdom.

School of Mathematics and Statistics, University of Sheffield, Sheffield, United Kingdom.

出版信息

Genet Epidemiol. 2016 Apr;40(3):176-87. doi: 10.1002/gepi.21956. Epub 2016 Feb 1.

DOI:10.1002/gepi.21956

PMID:26833494

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4832271/

Abstract

There is a large amount of functional genetic data available, which can be used to inform fine-mapping association studies (in diseases with well-characterised disease pathways). Single nucleotide polymorphism (SNP) prioritization via Bayes factors is attractive because prior information can inform the effect size or the prior probability of causal association. This approach requires the specification of the effect size. If the information needed to estimate a priori the probability density for the effect sizes for causal SNPs in a genomic region isn't consistent or isn't available, then specifying a prior variance for the effect sizes is challenging. We propose both an empirical method to estimate this prior variance, and a coherent approach to using SNP-level functional data, to inform the prior probability of causal association. Through simulation we show that when ranking SNPs by our empirical Bayes factor in a fine-mapping study, the causal SNP rank is generally as high or higher than the rank using Bayes factors with other plausible values of the prior variance. Importantly, we also show that assigning SNP-specific prior probabilities of association based on expert prior functional knowledge of the disease mechanism can lead to improved causal SNPs ranks compared to ranking with identical prior probabilities of association. We demonstrate the use of our methods by applying the methods to the fine mapping of the CASP8 region of chromosome 2 using genotype data from the Collaborative Oncological Gene-Environment Study (COGS) Consortium. The data we analysed included approximately 46,000 breast cancer case and 43,000 healthy control samples.

摘要

现有大量功能基因数据，可用于指导精细定位关联研究（针对疾病通路已明确的疾病）。通过贝叶斯因子进行单核苷酸多态性（SNP）优先级排序很有吸引力，因为先验信息可以为效应大小或因果关联的先验概率提供依据。这种方法需要指定效应大小。如果在基因组区域中先验估计因果SNP效应大小的概率密度所需的信息不一致或无法获得，那么指定效应大小的先验方差就具有挑战性。我们提出了一种估计此先验方差的经验方法，以及一种使用SNP水平功能数据的连贯方法，以提供因果关联的先验概率。通过模拟我们表明，在精细定位研究中按我们的经验贝叶斯因子对SNP进行排名时，因果SNP的排名通常与使用具有其他合理先验方差值的贝叶斯因子时的排名一样高或更高。重要的是，我们还表明，与使用相同关联先验概率进行排名相比，基于疾病机制的专家先验功能知识分配SNP特异性关联先验概率可导致因果SNP排名得到改善。我们通过将这些方法应用于使用协作肿瘤基因-环境研究（COGS）联盟的基因型数据对2号染色体的CASP8区域进行精细定位，展示了我们方法的应用。我们分析的数据包括约46,000例乳腺癌病例和43,000例健康对照样本。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8be9/4832271/7f4451c46d5e/GEPI-40-176-g001.jpg

相似文献

Incorporating Functional Genomic Information in Genetic Association Studies Using an Empirical Bayes Approach.

Genet Epidemiol. 2016 Apr;40(3):176-87. doi: 10.1002/gepi.21956. Epub 2016 Feb 1.

Novel bayes factors that capture expert uncertainty in prior density specification in genetic association studies.

Genet Epidemiol. 2015 May;39(4):239-48. doi: 10.1002/gepi.21891. Epub 2015 Feb 27.

Bayesian variable selection using partially observed categorical prior information in fine-mapping association studies.

Genet Epidemiol. 2019 Sep;43(6):690-703. doi: 10.1002/gepi.22213. Epub 2019 Jul 12.

The utility of the Laplace effect size prior distribution in Bayesian fine-mapping studies.

Genet Epidemiol. 2021 Jun;45(4):386-401. doi: 10.1002/gepi.22375. Epub 2021 Jan 6.

Bayesian multivariant fine mapping using the Laplace prior.

Genet Epidemiol. 2023 Apr;47(3):249-260. doi: 10.1002/gepi.22517. Epub 2023 Feb 5.

Simultaneous analysis of all SNPs in genome-wide and re-sequencing association studies.

PLoS Genet. 2008 Jul 25;4(7):e1000130. doi: 10.1371/journal.pgen.1000130.

Using GWAS top hits to inform priors in Bayesian fine-mapping association studies.

Genet Epidemiol. 2019 Sep;43(6):675-689. doi: 10.1002/gepi.22212. Epub 2019 Jul 9.

Fine scale mapping of the 17q22 breast cancer locus using dense SNPs, genotyped within the Collaborative Oncological Gene-Environment Study (COGs).

Sci Rep. 2016 Sep 7;6:32512. doi: 10.1038/srep32512.

A latent model for prioritization of SNPs for functional studies.

PLoS One. 2011;6(6):e20764. doi: 10.1371/journal.pone.0020764. Epub 2011 Jun 8.

Re-ranking sequencing variants in the post-GWAS era for accurate causal variant identification.

PLoS Genet. 2013;9(8):e1003609. doi: 10.1371/journal.pgen.1003609. Epub 2013 Aug 8.

引用本文的文献

Replicability in cancer omics data analysis: measures and empirical explorations.

Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac304.

Probabilistic identification of bacterial essential genes via insertion density using TraDIS data with Tn5 libraries.

Bioinformatics. 2021 Dec 7;37(23):4343-4349. doi: 10.1093/bioinformatics/btab508.

Fine-mapping genetic associations.

Hum Mol Genet. 2020 Sep 30;29(R1):R81-R88. doi: 10.1093/hmg/ddaa148.

Quantifying posterior effect size distribution of susceptibility loci by common summary statistics.

Genet Epidemiol. 2020 Jun;44(4):339-351. doi: 10.1002/gepi.22286. Epub 2020 Feb 25.

Stepwise approach to SNP-set analysis illustrated with the Metabochip and colorectal cancer in Japanese Americans of the Multiethnic Cohort.

BMC Genomics. 2018 Jul 9;19(1):524. doi: 10.1186/s12864-018-4910-8.

Inclusion of biological knowledge in a Bayesian shrinkage model for joint estimation of SNP effects.

Genet Epidemiol. 2017 May;41(4):320-331. doi: 10.1002/gepi.22038. Epub 2017 Apr 10.

本文引用的文献

eQuIPS: eQTL Analysis Using Informed Partitioning of SNPs - A Fully Bayesian Approach.

Genet Epidemiol. 2016 May;40(4):273-83. doi: 10.1002/gepi.21961. Epub 2016 Mar 14.

Novel bayes factors that capture expert uncertainty in prior density specification in genetic association studies.

Genet Epidemiol. 2015 May;39(4):239-48. doi: 10.1002/gepi.21891. Epub 2015 Feb 27.

Joint analysis of functional genomic data and genome-wide association studies of 18 human traits.

Am J Hum Genet. 2014 Apr 3;94(4):559-73. doi: 10.1016/j.ajhg.2014.03.004.

Comparing the efficacy of SNP filtering methods for identifying a single causal SNP in a known association region.

Ann Hum Genet. 2014 Jan;78(1):50-61. doi: 10.1111/ahg.12043. Epub 2013 Nov 11.

All SNPs are not created equal: genome-wide association studies reveal a consistent pattern of enrichment among functionally annotated SNPs.

PLoS Genet. 2013 Apr;9(4):e1003449. doi: 10.1371/journal.pgen.1003449. Epub 2013 Apr 25.

Large-scale genotyping identifies 41 new loci associated with breast cancer risk.

Nat Genet. 2013 Apr;45(4):353-61, 361e1-2. doi: 10.1038/ng.2563.

Bayesian refinement of association signals for 14 loci in 3 common diseases.

Nat Genet. 2012 Dec;44(12):1294-301. doi: 10.1038/ng.2435. Epub 2012 Oct 28.

Annotation of functional variation in personal genomes using RegulomeDB.

Genome Res. 2012 Sep;22(9):1790-7. doi: 10.1101/gr.137323.112.

A latent model for prioritization of SNPs for functional studies.

PLoS One. 2011;6(6):e20764. doi: 10.1371/journal.pone.0020764. Epub 2011 Jun 8.

HAPGEN2: simulation of multiple disease SNPs.

Bioinformatics. 2011 Aug 15;27(16):2304-5. doi: 10.1093/bioinformatics/btr341. Epub 2011 Jun 8.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用经验贝叶斯方法将功能基因组信息纳入基因关联研究。

Incorporating Functional Genomic Information in Genetic Association Studies Using an Empirical Bayes Approach.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献