Bioinformatics Program, Taiwan International Graduate Program, Academia Sinica, Taipei, Taiwan; Institute of Biomedical Informatics, National Yang-Ming University, Taipei, Taiwan; Institute of Information Science, Academia Sinica, Taipei, Taiwan; Institute of Biomedical Sciences, Academia Sinica, Taipei, Taiwan.
Gene. 2014 Jan 1;533(1):304-12. doi: 10.1016/j.gene.2013.09.041. Epub 2013 Sep 25.
Identifying susceptibility genes that influence complex diseases is extremely difficult because loci often influence the disease state through genetic interactions. Numerous approaches to detect disease-associated SNP-SNP interactions have been developed, but none consistently generates high-quality results under different disease scenarios. Using summarizing techniques to combine a number of existing methods may provide a solution to this problem. Here we used three popular non-parametric methods-Gini, absolute probability difference (APD), and entropy-to develop two novel summary scores, namely principle component score (PCS) and Z-sum score (ZSS), with which to predict disease-associated genetic interactions. We used a simulation study to compare performance of the non-parametric scores, the summary scores, the scaled-sum score (SSS; used in polymorphism interaction analysis (PIA)), and the multifactor dimensionality reduction (MDR). The non-parametric methods achieved high power, but no non-parametric method outperformed all others under a variety of epistatic scenarios. PCS and ZSS, however, outperformed MDR. PCS, ZSS and SSS displayed controlled type-I-errors (<0.05) compared to GS, APDS, ES (>0.05). A real data study using the genetic-analysis-workshop 16 (GAW 16) rheumatoid arthritis dataset identified a number of interesting SNP-SNP interactions.
确定影响复杂疾病的易感基因极其困难,因为这些基因通常通过遗传相互作用影响疾病状态。已经开发了许多用于检测与疾病相关的 SNP-SNP 相互作用的方法,但在不同的疾病情况下,没有一种方法能够始终产生高质量的结果。使用汇总技术结合许多现有的方法可能是解决这个问题的一种方法。在这里,我们使用了三种流行的非参数方法——基尼系数、绝对概率差 (APD) 和熵——来开发两种新的汇总评分,即主成分评分 (PCS) 和 Z 总和评分 (ZSS),用于预测与疾病相关的遗传相互作用。我们使用模拟研究比较了非参数评分、汇总评分、缩放总和评分 (SSS; 用于多态性相互作用分析 (PIA)) 和多因素维度减少 (MDR) 的性能。非参数方法的功效很高,但在各种上位性情况下,没有一种非参数方法优于所有其他方法。然而,PCS 和 ZSS 优于 MDR。与 GS、APDS、ES (>0.05) 相比,PCS、ZSS 和 SSS 与控制型错误率 (I 型错误率 <0.05) 相关。对遗传分析研讨会 16 (GAW 16) 类风湿关节炎数据集的真实数据研究确定了一些有趣的 SNP-SNP 相互作用。