Urrutia Eugene, Lee Seunggeun, Maity Arnab, Zhao Ni, Shen Judong, Li Yun, Wu Michael C
Department of Biostatistics, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA.
Department of Biostatistics, University of Michigan, Ann Arbor, MI 48105, USA.
Stat Interface. 2015;8(4):495-505. doi: 10.4310/SII.2015.v8.n4.a8.
Analysis of rare genetic variants has focused on region-based analysis wherein a subset of the variants within a genomic region is tested for association with a complex trait. Two important practical challenges have emerged. First, it is difficult to choose which test to use. Second, it is unclear which group of variants within a region should be tested. Both depend on the unknown true state of nature. Therefore, we develop the Multi-Kernel SKAT (MK-SKAT) which tests across a range of rare variant tests and groupings. Specifically, we demonstrate that several popular rare variant tests are special cases of the sequence kernel association test which compares pair-wise similarity in trait value to similarity in the rare variant genotypes between subjects as measured through a kernel function. Choosing a particular test is equivalent to choosing a kernel. Similarly, choosing which group of variants to test also reduces to choosing a kernel. Thus, MK-SKAT uses perturbation to test across a range of kernels. Simulations and real data analyses show that our framework controls type I error while maintaining high power across settings: MK-SKAT loses power when compared to the kernel for a particular scenario but has much greater power than poor choices.
对罕见基因变异的分析主要集中在基于区域的分析上,即在一个基因组区域内的一部分变异被测试与复杂性状的关联性。出现了两个重要的实际挑战。首先,很难选择使用哪种测试。其次,不清楚在一个区域内应该测试哪一组变异。这两者都取决于未知的真实自然状态。因此,我们开发了多内核序列核关联检验(MK-SKAT),它能在一系列罕见变异测试和分组中进行检验。具体来说,我们证明了几种流行的罕见变异测试是序列核关联检验的特殊情况,该检验通过核函数测量,比较性状值的成对相似性与受试者之间罕见变异基因型的相似性。选择特定的测试等同于选择一个核。同样,选择测试哪一组变异也归结为选择一个核。因此,MK-SKAT使用扰动在一系列核上进行检验。模拟和实际数据分析表明,我们的框架在各种情况下都能控制第一类错误,同时保持高功效:与特定情况下的核相比,MK-SKAT会损失功效,但比糟糕的选择具有更高的功效。