Suppr超能文献

采用两阶段设计对全基因组关联信号进行 DNA 重测序研究的随访。

Two-phase designs to follow-up genome-wide association signals with DNA resequencing studies.

机构信息

Division of Biomedical Statistics and Informatics, Department of Health Sciences Research, Mayo Clinic, Rochester, Minnesota, USA.

出版信息

Genet Epidemiol. 2013 Apr;37(3):229-38. doi: 10.1002/gepi.21708. Epub 2013 Jan 24.

Abstract

Genome-wide association studies (GWAS) of complex traits have generated many association signals for single nucleotide polymorphisms (SNPs). To understand the underlying causal genetic variant(s), focused DNA resequencing of targeted genomic regions is commonly used, yet the current cost of resequencing limits sample sizes for resequencing studies. Information from the large GWAS can be used to guide choice of samples for resequencing, such as the SNP genotypes in the targeted genomic region. Viewing the GWAS tag-SNPs as imperfect surrogates for the underlying causal variants, yet expecting that the tag-SNPs are correlated with the causal variants, a reasonable approach is a two-phase case-control design, with the GWAS serving as the first-phase and the resequencing study serving as the second-phase. Using stratified sampling based on both tag-SNP genotypes and case-control status, we explore the gains in power of a two-phase design relative to randomly sampling cases and controls for resequencing (i.e., ignoring tag-SNP genotypes). Simulation results show that stratified sampling based on both tag-SNP genotypes and case-control status is not likely to have lower power than stratified sampling based only on case-control status, and can sometimes have substantially greater power. The gain in power depends on the amount of linkage disequilibrium between the tag-SNP and causal variant alleles, as well as the effect size of the causal variant. Hence, the two-phase design provides an efficient approach to follow-up GWAS signals with DNA resequencing.

摘要

全基因组关联研究(GWAS)对复杂性状产生了许多单核苷酸多态性(SNP)的关联信号。为了了解潜在的因果遗传变异,通常会对目标基因组区域进行靶向 DNA 重测序,然而目前的重测序成本限制了重测序研究的样本量。可以利用来自大型 GWAS 的信息来指导重测序样本的选择,例如目标基因组区域中的 SNP 基因型。将 GWAS 标签-SNP 视为潜在因果变异的不完美替代物,但期望标签-SNP 与因果变异相关,因此可以采用两阶段病例对照设计,GWAS 作为第一阶段,重测序研究作为第二阶段。我们基于标签-SNP 基因型和病例对照状态进行分层抽样,探索两阶段设计相对于随机抽样病例和对照进行重测序(即忽略标签-SNP 基因型)的功效增益。模拟结果表明,基于标签-SNP 基因型和病例对照状态的分层抽样不太可能比仅基于病例对照状态的分层抽样具有更低的功效,并且有时可以具有更大的功效。功效增益取决于标签-SNP 和因果变异等位基因之间的连锁不平衡程度以及因果变异的效应大小。因此,两阶段设计为通过 DNA 重测序对 GWAS 信号进行后续研究提供了一种有效的方法。

相似文献

1
Two-phase designs to follow-up genome-wide association signals with DNA resequencing studies.
Genet Epidemiol. 2013 Apr;37(3):229-38. doi: 10.1002/gepi.21708. Epub 2013 Jan 24.
2
Enriching targeted sequencing experiments for rare disease alleles.
Bioinformatics. 2011 Aug 1;27(15):2112-8. doi: 10.1093/bioinformatics/btr324. Epub 2011 Jun 23.
3
Two-phase stratified sampling designs for regional sequencing.
Genet Epidemiol. 2012 May;36(4):320-32. doi: 10.1002/gepi.21624. Epub 2012 Mar 28.
5
Re-ranking sequencing variants in the post-GWAS era for accurate causal variant identification.
PLoS Genet. 2013;9(8):e1003609. doi: 10.1371/journal.pgen.1003609. Epub 2013 Aug 8.
6
Design considerations for genetic linkage and association studies.
Methods Mol Biol. 2012;850:237-62. doi: 10.1007/978-1-61779-555-8_13.
7
Implication of next-generation sequencing on association studies.
BMC Genomics. 2011 Jun 17;12:322. doi: 10.1186/1471-2164-12-322.
8
Efficient association study design via power-optimized tag SNP selection.
Ann Hum Genet. 2008 Nov;72(Pt 6):834-47. doi: 10.1111/j.1469-1809.2008.00469.x. Epub 2008 Aug 13.
9
Detection of common single nucleotide polymorphisms synthesizing quantitative trait association of rarer causal variants.
Genome Res. 2011 Jul;21(7):1122-30. doi: 10.1101/gr.115832.110. Epub 2011 Mar 25.
10
Using the gene ontology to scan multilevel gene sets for associations in genome wide association studies.
Genet Epidemiol. 2012 Jan;36(1):3-16. doi: 10.1002/gepi.20632. Epub 2011 Dec 7.

引用本文的文献

1
Two-phase sample selection strategies for design and analysis in post-genome-wide association fine-mapping studies.
Stat Med. 2021 Dec 30;40(30):6792-6817. doi: 10.1002/sim.9211. Epub 2021 Oct 1.
2
Two-phase designs for joint quantitative-trait-dependent and genotype-dependent sampling in post-GWAS regional sequencing.
Genet Epidemiol. 2018 Feb;42(1):104-116. doi: 10.1002/gepi.22099. Epub 2017 Dec 14.
3
Breast cancer chemoprevention pharmacogenomics: Deep sequencing and functional genomics of the and genes.
NPJ Breast Cancer. 2017 Aug 21;3:30. doi: 10.1038/s41523-017-0036-4. eCollection 2017.
4
Complex pedigrees in the sequencing era: to track transmissions or decorrelate?
Genet Epidemiol. 2014 Sep;38 Suppl 1(0 1):S29-36. doi: 10.1002/gepi.21822.
5
Two-phase and family-based designs for next-generation sequencing studies.
Front Genet. 2013 Dec 13;4:276. doi: 10.3389/fgene.2013.00276.

本文引用的文献

2
Using the whole cohort in the analysis of case-cohort data.
Am J Epidemiol. 2009 Jun 1;169(11):1398-405. doi: 10.1093/aje/kwp055. Epub 2009 Apr 8.
3
Multistage sampling for genetic studies.
Annu Rev Genomics Hum Genet. 2007;8:327-42. doi: 10.1146/annurev.genom.8.080706.092357.
4
Weighted likelihood, pseudo-likelihood and maximum likelihood methods for logistic regression analysis of two-stage data.
Stat Med. 1997;16(1-3):103-16. doi: 10.1002/(sici)1097-0258(19970115)16:1<103::aid-sim474>3.0.co;2-p.
5
Optimal sampling strategies for two-stage studies.
Am J Epidemiol. 1996 Jan 1;143(1):92-100. doi: 10.1093/oxfordjournals.aje.a008662.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验