Mallaney Cates, Sung Yun Ju
Division of Biostatistics, Washington University in St. Louis, School of Medicine, St. Louis, MO 63110, USA.
BMC Proc. 2014 Jun 17;8(Suppl 1 Genetic Analysis Workshop 18Vanessa Olmo):S10. doi: 10.1186/1753-6561-8-S1-S10. eCollection 2014.
Sequence kernel association test (SKAT) has become one of the most commonly used nonburden tests for analyzing rare variants. Performance of burden tests depends on the weighting of rare and common variants when collapsing them in a genomic region. Using the systolic and diastolic blood pressure phenotypes of 142 unrelated individuals in the Genetic Analysis Workshop 18 data, we investigated whether performance of SKAT also depends on the weighting scheme. We analyzed the entire sequencing data for all 200 replications using 3 weighting schemes: equal weighting, Madsen-Browning weighting, and SKAT default linear weighting. We considered two options: all single-nucleotide polymorphisms (SNPs) and only low-frequency SNPs. A SKAT default weighting scheme (which heavily downweights common variants) performed better for the genes in which causal SNPs are mostly rare. This SKAT default weighting scheme behaved similarly to other weighting schemes after eliminating all common SNPs. In contrast, the equal weighting scheme performed the best for MAP4 and FLT3, both of which included a common variant with a large effect. However, SKAT with all 3 weighting schemes performed poorly. Overall power across all causal genes was about 0.05, which was almost identical to the type I error rate. This poor performance is partly due to a small sample size because of the need to analyze only unrelated individuals. Because a half of causal SNPs were not found in the annotation file based on the 1000 Genomes Project, we suspect that performance was also affected by our use of incomplete annotation information.
序列核关联检验(SKAT)已成为分析罕见变异最常用的非加权检验方法之一。加权检验的性能取决于在基因组区域中将罕见变异和常见变异合并时的加权方式。利用遗传分析研讨会18数据中142名无亲属关系个体的收缩压和舒张压表型,我们研究了SKAT的性能是否也取决于加权方案。我们使用三种加权方案分析了所有200次重复的完整测序数据:等加权、Madsen-Browning加权和SKAT默认线性加权。我们考虑了两种选择:所有单核苷酸多态性(SNP)和仅低频SNP。对于因果SNP大多为罕见的基因,SKAT默认加权方案(该方案严重降低常见变异的权重)表现更好。在去除所有常见SNP后,这种SKAT默认加权方案的表现与其他加权方案类似。相比之下,等加权方案在MAP4和FLT3上表现最佳,这两个基因都包含一个具有较大效应的常见变异。然而,采用所有三种加权方案的SKAT表现都很差。所有因果基因的总体检验效能约为0.05,几乎与I型错误率相同。这种不佳表现部分归因于样本量小,因为只需要分析无亲属关系的个体。由于基于千人基因组计划的注释文件中未发现一半的因果SNP,我们怀疑性能也受到我们使用不完整注释信息的影响。