Department of Hygiene and Epidemiology, University of Ioannina School of Medicine, Ioannina, Greece.
Int J Epidemiol. 2012 Feb;41(1):273-86. doi: 10.1093/ije/dyr178. Epub 2011 Dec 5.
Robust replication is a sine qua non for the rigorous documentation of proposed associations in the genome-wide association (GWA) setting. Currently, associations of common variants reaching P ≤ 5 × 10(-8) are considered replicated. However, there is some ambiguity about the most suitable threshold for claiming genome-wide significance.
We defined as 'borderline' associations those with P > 5 × 10(-8) and P ≤ 1 × 10(-7). The eligible associations were retrieved using the 'Catalog of Published Genome-Wide Association Studies'. For each association we assessed whether it reached P ≤ 5 × 10(-8) with inclusion of additional data from subsequent GWA studies.
Thirty-four eligible genotype-phenotype associations were evaluated with data and clarifications contributed from diverse investigators. Replication data from subsequent GWA studies could be obtained for 26 of them. Of those, 19 associations (73%) reached P ≤ 5 × 10(-8) for the same or a related trait implicating either the exact same allele or one in very high linkage disequilibrium and 17 reached P < 10(-8). If the seven associations that did not reach P ≤ 5 × 10(-8) when additional data were considered are assumed to have been false-positives, the false-discovery rate for borderline associations is estimated to be 27% [95% confidence interval (CI) 12-48%]. For five associations, the current P-value is > 10(-6) [corresponding false-discovery rate 19% (95% CI 7-39%)].
A substantial proportion, but not all, of the associations with borderline genome-wide significance represent replicable, possibly genuine associations. Our empirical evaluation suggests a possible relaxation in the current GWS threshold.
稳健的复制是在全基因组关联(GWA)研究中严格记录所提出关联的必要条件。目前,达到 P ≤ 5×10(-8)的常见变体关联被认为是可复制的。然而,对于声称全基因组显著性的最合适阈值仍存在一些模糊性。
我们将 P > 5×10(-8)且 P ≤ 1×10(-7)的关联定义为“边缘”关联。使用“已发表的全基因组关联研究目录”检索合格的关联。对于每个关联,我们评估了是否通过纳入来自后续 GWA 研究的附加数据达到 P ≤ 5×10(-8)。
使用来自不同研究者的数据和澄清,评估了 34 个合格的基因型-表型关联。可以获得其中 26 个关联的后续 GWA 研究的复制数据。其中,19 个关联(73%)达到 P ≤ 5×10(-8),涉及相同或相关特征,涉及相同的等位基因或高度连锁不平衡的等位基因,且 17 个关联达到 P < 10(-8)。如果假设在考虑附加数据时未达到 P ≤ 5×10(-8)的七个关联为假阳性,则边缘关联的假发现率估计为 27%(95%置信区间 [12-48%])。对于五个关联,当前 P 值 > 10(-6) [相应的假发现率为 19%(95%置信区间 [7-39%])]。
虽然有相当一部分,但不是全部,具有边缘全基因组显著性的关联代表可复制的、可能真实的关联。我们的实证评估表明,当前的 GWS 阈值可能会放宽。