Department of Psychiatry and State Key Laboratory for Cognitive and Brain Sciences, the University of Hong Kong, Pokfulam, Hong Kong.
Am J Hum Genet. 2011 Mar 11;88(3):283-93. doi: 10.1016/j.ajhg.2011.01.019.
The gene has been proposed as an attractive unit of analysis for association studies, but a simple yet valid, powerful, and sufficiently fast method of evaluating the statistical significance of all genes in large, genome-wide datasets has been lacking. Here we propose the use of an extended Simes test that integrates functional information and association evidence to combine the p values of the single nucleotide polymorphisms within a gene to obtain an overall p value for the association of the entire gene. Our computer simulations demonstrate that this test is more powerful than the SNP-based test, offers effective control of the type 1 error rate regardless of gene size and linkage-disequilibrium pattern among markers, and does not need permutation or simulation to evaluate empirical significance. Its statistical power in simulated data is at least comparable, and often superior, to that of several alternative gene-based tests. When applied to real genome-wide association study (GWAS) datasets on Crohn disease, the test detected more significant genes than SNP-based tests and alternative gene-based tests. The proposed test, implemented in an open-source package, has the potential to identify additional novel disease-susceptibility genes for complex diseases from large GWAS datasets.
该基因已被提议作为关联研究的一个有吸引力的分析单位,但缺乏一种简单、有效、强大且足够快速的方法来评估全基因组范围内大型数据集的所有基因的统计显著性。在这里,我们建议使用扩展的 Simes 检验,该检验整合了功能信息和关联证据,将基因内单个核苷酸多态性的 p 值组合起来,以获得整个基因关联的总体 p 值。我们的计算机模拟表明,与 SNP 为基础的检验相比,该检验具有更高的功效,无论基因大小和标记间的连锁不平衡模式如何,都能有效地控制第一类错误率,而且不需要置换或模拟来评估经验显著性。它在模拟数据中的统计功效至少与几种替代的基于基因的检验相当,而且通常更优越。当应用于克罗恩病的真实全基因组关联研究 (GWAS) 数据集时,该检验比 SNP 为基础的检验和替代的基于基因的检验检测到了更多显著的基因。该检验以开源软件包的形式实现,有可能从大型 GWAS 数据集中鉴定出更多与复杂疾病相关的新的易感基因。