Division of Biomedical Statistics and Informatics, Department of Health Sciences Research, Mayo Clinic, Rochester, Minnesota, USA.
Genet Epidemiol. 2013 Apr;37(3):229-38. doi: 10.1002/gepi.21708. Epub 2013 Jan 24.
Genome-wide association studies (GWAS) of complex traits have generated many association signals for single nucleotide polymorphisms (SNPs). To understand the underlying causal genetic variant(s), focused DNA resequencing of targeted genomic regions is commonly used, yet the current cost of resequencing limits sample sizes for resequencing studies. Information from the large GWAS can be used to guide choice of samples for resequencing, such as the SNP genotypes in the targeted genomic region. Viewing the GWAS tag-SNPs as imperfect surrogates for the underlying causal variants, yet expecting that the tag-SNPs are correlated with the causal variants, a reasonable approach is a two-phase case-control design, with the GWAS serving as the first-phase and the resequencing study serving as the second-phase. Using stratified sampling based on both tag-SNP genotypes and case-control status, we explore the gains in power of a two-phase design relative to randomly sampling cases and controls for resequencing (i.e., ignoring tag-SNP genotypes). Simulation results show that stratified sampling based on both tag-SNP genotypes and case-control status is not likely to have lower power than stratified sampling based only on case-control status, and can sometimes have substantially greater power. The gain in power depends on the amount of linkage disequilibrium between the tag-SNP and causal variant alleles, as well as the effect size of the causal variant. Hence, the two-phase design provides an efficient approach to follow-up GWAS signals with DNA resequencing.
全基因组关联研究(GWAS)对复杂性状产生了许多单核苷酸多态性(SNP)的关联信号。为了了解潜在的因果遗传变异,通常会对目标基因组区域进行靶向 DNA 重测序,然而目前的重测序成本限制了重测序研究的样本量。可以利用来自大型 GWAS 的信息来指导重测序样本的选择,例如目标基因组区域中的 SNP 基因型。将 GWAS 标签-SNP 视为潜在因果变异的不完美替代物,但期望标签-SNP 与因果变异相关,因此可以采用两阶段病例对照设计,GWAS 作为第一阶段,重测序研究作为第二阶段。我们基于标签-SNP 基因型和病例对照状态进行分层抽样,探索两阶段设计相对于随机抽样病例和对照进行重测序(即忽略标签-SNP 基因型)的功效增益。模拟结果表明,基于标签-SNP 基因型和病例对照状态的分层抽样不太可能比仅基于病例对照状态的分层抽样具有更低的功效,并且有时可以具有更大的功效。功效增益取决于标签-SNP 和因果变异等位基因之间的连锁不平衡程度以及因果变异的效应大小。因此,两阶段设计为通过 DNA 重测序对 GWAS 信号进行后续研究提供了一种有效的方法。