School of Mathematics and Statistics, University of Sheffield, Sheffield, UK.
Department of Oncology, Sheffield Cancer Research Centre, University of Sheffield Medical School, Sheffield, UK.
Genet Epidemiol. 2019 Sep;43(6):675-689. doi: 10.1002/gepi.22212. Epub 2019 Jul 9.
The default causal single-nucleotide polymorphism (SNP) effect size prior in Bayesian fine-mapping studies is usually the Normal distribution. This choice is often based on computational convenience, rather than evidence that it is the most suitable prior distribution. The choice of prior is important because previous studies have shown considerable sensitivity of causal SNP Bayes factors to the form of the prior. In some well-studied diseases there are now considerable numbers of genome-wide association study (GWAS) top hits along with estimates of the number of yet-to-be-discovered causal SNPs. We show how the effect sizes of the top hits and estimates of the number of yet-to-be-discovered causal SNPs can be used to choose between the Laplace and Normal priors, to estimate the prior parameters and to quantify the uncertainty in this estimation. The methodology can readily be applied to other priors. We show that the top hits available from breast cancer GWAS provide overwhelming support for the Laplace over the Normal prior, which has important consequences for variant prioritisation. This work in this paper enables practitioners to derive more objective priors than are currently being used and could lead to prioritisation of different variants.
贝叶斯精细映射研究中默认的因果单核苷酸多态性 (SNP) 效应大小先验通常是正态分布。这种选择通常基于计算方便,而不是证据表明它是最合适的先验分布。先验的选择很重要,因为先前的研究表明,因果 SNP 贝叶斯因子对先验形式的敏感性相当大。在一些研究充分的疾病中,现在有相当数量的全基因组关联研究 (GWAS) 顶级命中以及尚未发现的因果 SNP 数量的估计。我们展示了如何使用顶级命中的效应大小和尚未发现的因果 SNP 数量的估计来在拉普拉斯和正态先验之间进行选择,以估计先验参数并量化这种估计的不确定性。该方法可以很容易地应用于其他先验。我们表明,来自乳腺癌 GWAS 的顶级命中压倒性地支持拉普拉斯先验而不是正态先验,这对变体优先级排序有重要影响。本文的工作使从业者能够得出比当前使用的更客观的先验,并且可能导致不同变体的优先级排序。