Chen Wenan, Gao Xi, Wang Jiexun, Sun Chuanyu, Wan Wen, Zhi Degui, Liu Nianjun, Chen Xiangning, Gao Guimin
Department of Biostatistics, Virginia Commonwealth University School of Medicine, 830 East Main Street, One Capitol Square, 7th Floor, Richmond, VA 23298-0032, USA.
Department of Computer Science, Virginia Commonwealth University, 401 West Main Street, Room E4225, PO Box 843019, Richmond, VA 23284-3019, USA.
BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S86. doi: 10.1186/1753-6561-5-S9-S86. eCollection 2011.
We evaluate four association tests for rare variants-the combined multivariate and collapsing (CMC) method, two weighted-sum methods, and a variable threshold method-by applying them to the simulated data sets of unrelated individuals in the Genetic Analysis Workshop 17 (GAW17) data. The family-wise error rate (FWER) and average power are used as criteria for evaluation. Our results show that when all nonsynonymous SNPs (rare variants and common variants) in a gene are jointly analyzed, the CMC method fails to control the FWER; when only rare variants (single-nucleotide polymorphisms with minor allele frequency less than 0.05) are analyzed, all four methods can control FWER well. All four methods have comparable power, which is low for the analysis of the GAW17 data sets. Three of the methods (not including the CMC method) involve estimation of p-values using permutation procedures that either can be computationally intensive or generate inflated FWERs. We adapt a fast permutation procedure into these three methods. The results show that using the fast permutation procedure can produce FWERs and average powers close to the values obtained from the standard permutation procedure on the GAW17 data sets. The standard permutation procedure is computationally intensive.
我们通过将四种针对罕见变异的关联检验方法——联合多变量和压缩(CMC)法、两种加权和法以及一种可变阈值法——应用于遗传分析研讨会17(GAW17)数据中无关个体的模拟数据集,对其进行评估。以家族性错误率(FWER)和平均功效作为评估标准。我们的结果表明,当对一个基因中的所有非同义单核苷酸多态性(罕见变异和常见变异)进行联合分析时,CMC法无法控制FWER;当仅分析罕见变异(次要等位基因频率小于0.05的单核苷酸多态性)时,所有四种方法都能很好地控制FWER。所有四种方法的功效相当,对于GAW17数据集的分析而言功效较低。其中三种方法(不包括CMC法)涉及使用置换程序估计p值,这些程序要么计算量很大,要么会产生过高的FWER。我们将一种快速置换程序应用于这三种方法。结果表明,在GAW17数据集上,使用快速置换程序产生的FWER和平均功效接近从标准置换程序获得的值。标准置换程序计算量很大。