Zhang Yu, Ghosh Soumitra, Hakonarson Hakon
Department of Statistics, The Pennsylvania State University, University Park, Pennsylvania 16802
Division of Immuno-Inflammatory and Respiratory Genetics, GlaxoSmithKline, King of Prussia, Pennsylvania 19406.
Genetics. 2014 Nov;198(3):867-78. doi: 10.1534/genetics.114.167403. Epub 2014 Sep 11.
Rare genetic variants have recently been studied for genome-wide associations with human complex diseases. Existing rare variant methods are based on the hypothesis-testing framework that predefined variant sets need to be tested separately. The power of those methods is contingent upon accurate selection of variants for testing, and frequently, common variants are left out for separate testing. In this article, we present a novel Bayesian method for simultaneous testing of all genome-wide variants across the whole frequency range. The method allows for much more flexible grouping of variants and dynamically combines them for joint testing. The method accounts for correlation among variant sets, such that only direct associations with the disease are reported, whereas indirect associations due to linkage disequilibrium are not. Consequently, the method can obtain much improved power and flexibility and simultaneously pinpoint multiple disease variants with high resolution. Additional covariates of categorical, discrete, and continuous values can also be added. We compared our method with seven existing categories of approaches for rare variant mapping. We demonstrate that our method achieves similar power to the best methods available to date when testing very rare variants in small SNP sets. When moderately rare or common variants are included, or when testing a large collection of variants, however, our method significantly outperforms all existing methods evaluated in this study. We further demonstrate the power and the usage of our method in a whole-genome resequencing study of type 1 diabetes.
最近,人们对罕见基因变异与人类复杂疾病的全基因组关联进行了研究。现有的罕见变异方法基于假设检验框架,即需要分别对预定义的变异集进行测试。这些方法的效力取决于用于测试的变异的准确选择,而且常见变异常常被排除在外单独进行测试。在本文中,我们提出了一种新颖的贝叶斯方法,用于同时测试整个频率范围内的所有全基因组变异。该方法允许对变异进行更加灵活的分组,并动态地将它们组合起来进行联合测试。该方法考虑了变异集之间的相关性,因此只报告与疾病的直接关联,而不报告由于连锁不平衡导致的间接关联。因此,该方法可以获得显著提高的效力和灵活性,同时以高分辨率精确找出多个疾病变异。也可以添加分类、离散和连续值的其他协变量。我们将我们的方法与现有的七类罕见变异定位方法进行了比较。我们证明,在对小SNP集中的极罕见变异进行测试时,我们的方法与迄今为止可用的最佳方法具有相似的效力。然而,当纳入中度罕见或常见变异时,或者在测试大量变异时,我们的方法明显优于本研究中评估的所有现有方法。我们进一步在1型糖尿病的全基因组重测序研究中展示了我们方法的效力和用法。