Department of Gene Diagnostics and Therapeutics, Research Institute, National Center for Global Health and Medicine, Tokyo, Japan.
Genome Res. 2011 Jul;21(7):1122-30. doi: 10.1101/gr.115832.110. Epub 2011 Mar 25.
Genome-wide association (GWA) studies have identified hundreds of common (minor allele frequency ≥5%) single nucleotide polymorphisms (SNPs) associated with phenotype traits or diseases, yet causal variants accounting for the association signals have rarely been determined. A question then raised is whether a GWA signal represents an "indirect association" as a proxy of a strongly correlated causal variant with similar frequency, or a "synthetic association" of one or more rarer causal variants in linkage disequilibrium (D' ≈ 1, but r(2) not large); answering the question generally requires extensive resequencing and association analysis. Instead, we propose to test statistically whether a quantitative trait (QT) association of an SNP represents a synthetic association or not by inspecting the QT distribution at each genotype, not requiring the causal variant(s) to be known. We devised two test statistics and assessed the power by mathematical analysis and simulation. Testing the heterogeneity of variance was powerful when low-frequency causal alleles are linked mostly to one SNP allele, while testing the skewness outperformed when the causal alleles are linked evenly to either of the SNP alleles. By testing a statistic combining these two in 5000 individuals, we could detect synthetic association of a GWA signal when causal alleles sum up to 3% in frequency. Such signal only partially explains the heritability contributed by the whole locus. The proposed test is useful for designing fine mapping after studying association of common SNPs exhaustively; we can prioritize which GWA signal and which individuals to be resequenced, and identify the causal variants efficiently.
全基因组关联 (GWA) 研究已经确定了数百个与表型特征或疾病相关的常见(次要等位基因频率≥5%)单核苷酸多态性 (SNP),但很少确定导致关联信号的变体。然后提出了一个问题,即 GWA 信号是否代表“间接关联”,作为具有相似频率的强相关因果变体的代表,或者是连锁不平衡 (D'≈1,但 r(2) 不大) 中一个或多个罕见因果变体的“综合关联”;回答这个问题通常需要广泛的重测序和关联分析。相反,我们建议通过检查每个基因型的 QT 分布,从统计学上检验 SNP 的定量性状 (QT) 关联是否代表综合关联,而无需知道因果变体。我们设计了两个测试统计量,并通过数学分析和模拟评估了功效。当低频因果等位基因主要与一个 SNP 等位基因相关联时,检验方差异质性非常有效,而当因果等位基因均匀地与 SNP 等位基因中的任一个相关联时,检验偏度则表现出色。通过在 5000 个人中测试一个结合这两个统计量的统计量,我们可以在因果等位基因频率总计为 3%时检测到 GWA 信号的综合关联。这种信号仅部分解释了整个基因座所贡献的遗传力。该方法可用于在详细研究常见 SNP 的关联后设计精细图谱;我们可以优先考虑要重测序的 GWA 信号和个体,并有效地识别因果变体。