Moskvina V, O'Donovan M C
Department of Psychological Medicine, Wales College of Medicine, Cardiff University, Cardiff, UK.
Hum Hered. 2007;64(1):63-73. doi: 10.1159/000101424. Epub 2007 Apr 27.
Genetic association studies are usually based upon restricted sets of 'tag' markers selected to represent the total sequence variation. Tag selection is often determined by some threshold for the r(2) coefficients of linkage disequilibrium (LD) between tag and untyped markers, it being widely assumed that power to detect an effect at the untyped sites is retained by typing the tag marker in a sample scaled by the inverse of the selected threshold (1/r(2)). However, unless only a single causal variant occurs at a locus, it has been shown [Eur J Hum Genet 2006;14:426-437] that significant power loss can occur if this principle is applied. We sought to investigate whether unexpected loss of power might be an exceptional case or more general concern. In the absence of detailed knowledge about the genetic architecture at complex disease loci, we developed a mathematical approach to test all possible situations.
We derived mathematical formulae allowing the calculation of all possible odds ratios (OR) at a tag marker locus given the effect size that would be observed by typing a second locus and the r(2) between the two loci. For a range of allele frequencies, r(2) between loci, and strengths of association at the causal locus (OR from 0.5 to 2) that we consider realistic for complex disease loci, we next determined the sample sizes that would be necessary to give equivalent power to detect association by genotyping tag and causal loci and compared these with the sample sizes predicted by applying 1/r(2).
Under most of the hypothetical scenarios we examined, the calculated sample sizes required to maintain power by typing markers that tag the causal locus at even moderately high r(2) (0.8) were greater than that calculated by applying 1/r(2). Even in populations with apparently similar measurements of allele frequency, LD structure, and effect size at the susceptibility allele, the required sample size to detect association with a tag marker can vary substantially. We also show that in apparently similar populations, associations to either allele at the tag site are possible.
Indirect tests of association are less powered than sizes predicted by applying 1/r(2) in the majority of hypothetical scenarios we examined. Our findings pertain even for what we consider likely to be larger than average effect sizes in complex diseases (OR = 1.5-2) and even for moderately high r(2) values between the markers. Until a substantial number of disease genes have been identified through methods that are not based on tagging, and therefore biased towards those situations most favourable to tagging, it is impossible to know how the true scenarios are distributed across the range of possible scenarios. Nevertheless, while association designs based upon tag marker selection by necessity are the tool of choice for de novo gene discovery, our data suggest power to initially detect association may often be less than assumed. Moreover, our data suggest that to avoid genuine findings being subsequently discarded by unpredictable losses of power, follow up studies in other samples should be based upon more detailed analyses of the gene rather than simply on the tag SNPs showing association in the discovery study.
基因关联研究通常基于一组经过筛选的“标签”标记,这些标记被选来代表整个序列变异。标签的选择通常由标签与未分型标记之间的连锁不平衡(LD)的r²系数的某个阈值决定,人们普遍认为,通过在一个样本中对标签标记进行分型,其样本量按所选阈值的倒数(1/r²)进行缩放,就能够保留检测未分型位点效应的能力。然而,除非一个基因座上只出现一个因果变异,否则已有研究表明[《欧洲人类遗传学杂志》2006年;14:426 - 437],如果应用这一原则,可能会出现显著的效能损失。我们试图研究这种意外的效能损失是个别情况还是更普遍的问题。在缺乏关于复杂疾病基因座遗传结构的详细知识的情况下,我们开发了一种数学方法来测试所有可能的情况。
我们推导了数学公式,给定在第二个基因座分型时观察到的效应大小以及两个基因座之间的r²,就可以计算标签标记基因座上所有可能的比值比(OR)。对于一系列我们认为在复杂疾病基因座中实际存在的等位基因频率、基因座之间的r²以及因果基因座的关联强度(OR从0.5到2),我们接下来确定通过对标签基因座和因果基因座进行基因分型来检测关联所需的等效效能的样本量,并将这些样本量与应用1/r²预测的样本量进行比较。
在我们研究的大多数假设情景下,通过对即使是中等偏高r²(0.8)的因果基因座进行标签标记分型来维持效能所需的计算样本量,大于应用1/r²计算的样本量。即使在等位基因频率、LD结构和易感等位基因效应大小的测量结果明显相似的人群中,检测与标签标记关联所需的样本量也可能有很大差异。我们还表明,在明显相似的人群中,标签位点上的任何一个等位基因都可能存在关联。
在我们研究的大多数假设情景下,关联的间接检测效能低于应用1/r²预测的大小。我们的研究结果甚至适用于我们认为在复杂疾病中可能大于平均效应大小的情况(OR = 1.5 - 2),甚至适用于标记之间中等偏高的r²值。在通过非基于标签的方法鉴定出大量疾病基因之前,因此偏向于那些最有利于标签法的情况,我们无法知道真实情景在所有可能情景范围内是如何分布的。然而,虽然基于标签标记选择的关联设计必然是从头发现基因的首选工具,但我们的数据表明,最初检测关联的效能可能常常低于预期。此外,我们的数据表明,为了避免真正的发现随后因不可预测的效能损失而被丢弃,在其他样本中的后续研究应该基于对基因更详细的分析,而不是仅仅基于在发现研究中显示关联的标签单核苷酸多态性。