用于全基因组关联研究的最优两阶段基因分型设计

Optimal two-stage genotyping designs for genome-wide association scans.

作者信息

Wang Hansong, Thomas Duncan C, Pe'er Itsik, Stram Daniel O

机构信息

Division of Biostatistics and Genetic Epidemiology, Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California, USA.

出版信息

Genet Epidemiol. 2006 May;30(4):356-68. doi: 10.1002/gepi.20150.

DOI:10.1002/gepi.20150

PMID:16607626

Abstract

The much-anticipated fixed-array, genome-wide SNP genotyping technologies make large-scale genome-wide association scans now possible for large numbers of subjects. In this paper we reconsider the problem (Satagopan and Elston [2003] Genet Epidemiol 25:149-157) of optimizing a two-stage genotyping design to deal with important new issues that are relevant when studies are expanded from candidate gene size to a genome-wide scale. We investigate how the basic two-stage genotyping approach, in which all markers are genotyped in an initial group of subjects (stage I) and only the promising markers are genotyped in additional subjects (stage II), can be used to reduce genotyping cost in a genome-wide case-control association study even after allowing for much higher per genotype costs using specially designed assays in stage II, compared to the fixed array of SNPs used in stage I. In addition, we consider the problem of using measured SNPs to make (imperfect) prediction of unmeasured SNPs for association tests of all SNPs (measured or unmeasured) genome wide and the utility of expanding genotyping densities in stage II in the regions where significant associations were detected in stage I. Under a set of reasonable but conservative assumptions, we derive optimal two-stage design configurations (sample sizes and the thresholds of significance in both stages) with these optimal designs depending both on the total number of markers tested and upon the ratios of cost in stage II versus stage I. In addition we show how existing software for power and sample size calculations can be used for the purpose of designing two-stage studies, for a wide range of assumptions about the number of markers genotyped and the costs of genotyping in each stage of the study.

摘要

备受期待的固定阵列全基因组单核苷酸多态性（SNP）基因分型技术，使得对大量受试者进行大规模全基因组关联扫描成为可能。在本文中，我们重新审视了一个问题（Satagopan和Elston [2003]《遗传流行病学》25:149 - 157），即优化两阶段基因分型设计，以应对当研究从候选基因规模扩展到全基因组规模时出现的重要新问题。我们研究了基本的两阶段基因分型方法，即在初始受试者组（第一阶段）中对所有标记进行基因分型，而仅在额外受试者（第二阶段）中对有前景的标记进行基因分型，如何用于降低全基因组病例对照关联研究中的基因分型成本，即便在第二阶段使用专门设计的检测方法时每个基因型的成本比第一阶段使用的固定SNP阵列要高得多。此外，我们考虑了利用已测量的SNP对未测量的SNP进行（不完美）预测，以用于全基因组所有SNP（已测量或未测量）的关联测试的问题，以及在第一阶段检测到显著关联的区域中增加第二阶段基因分型密度的效用。在一组合理但保守的假设下，我们得出了最优的两阶段设计配置（样本量和两个阶段的显著性阈值），这些最优设计既取决于所测试标记的总数，也取决于第二阶段与第一阶段的成本比。此外，我们展示了现有的功效和样本量计算软件如何用于设计两阶段研究，适用于关于基因分型标记数量和研究每个阶段基因分型成本的广泛假设。

相似文献

Optimal two-stage genotyping designs for genome-wide association scans.

Genet Epidemiol. 2006 May;30(4):356-68. doi: 10.1002/gepi.20150.

Optimal robust two-stage designs for genome-wide association studies.

Ann Hum Genet. 2009 Nov;73(Pt 6):638-51. doi: 10.1111/j.1469-1809.2009.00544.x.

Optimal designs for two-stage genome-wide association studies.

Genet Epidemiol. 2007 Nov;31(7):776-88. doi: 10.1002/gepi.20240.

Including sampling and phenotyping costs into the optimization of two stage designs for genomewide association studies.

Genet Epidemiol. 2007 Dec;31(8):844-52. doi: 10.1002/gepi.20245.

Optimal multistage designs--a general framework for efficient genome-wide association studies.

Biostatistics. 2009 Apr;10(2):297-309. doi: 10.1093/biostatistics/kxn036. Epub 2008 Dec 15.

Toward genome-wide SNP genotyping.

Nat Genet. 2005 Jun;37 Suppl:S5-10. doi: 10.1038/ng1558.

Joint analysis is more efficient than replication-based analysis for two-stage genome-wide association studies.

Nat Genet. 2006 Feb;38(2):209-13. doi: 10.1038/ng1706. Epub 2006 Jan 15.

Study designs for genome-wide association studies.

Adv Genet. 2008;60:465-504. doi: 10.1016/S0065-2660(07)00417-8.

The TaqMan method for SNP genotyping.

Methods Mol Biol. 2009;578:293-306. doi: 10.1007/978-1-60327-411-1_19.

Application of the stepwise focusing method to optimize the cost-effectiveness of genome-wide association studies with limited research budgets for genotyping and phenotyping.

Ann Hum Genet. 2005 May;69(Pt 3):323-8. doi: 10.1046/j.1529-8817.2005.00157.x.

引用本文的文献

Genome-wide association analysis identifies a susceptibility locus for sporadic vestibular schwannoma at 9p21.

Brain. 2023 Jul 3;146(7):2861-2868. doi: 10.1093/brain/awac478.

Inherited DNA-Repair Defects in Colorectal Cancer.

Am J Hum Genet. 2018 Mar 1;102(3):401-414. doi: 10.1016/j.ajhg.2018.01.018. Epub 2018 Feb 22.

Two-phase designs for joint quantitative-trait-dependent and genotype-dependent sampling in post-GWAS regional sequencing.

Genet Epidemiol. 2018 Feb;42(1):104-116. doi: 10.1002/gepi.22099. Epub 2017 Dec 14.

Identifying Potential Regions of Copy Number Variation for Bipolar Disorder.

Microarrays (Basel). 2014 Feb 28;3(1):52-71. doi: 10.3390/microarrays3010052.

Genome-Wide Association Study for Autism Spectrum Disorder in Taiwanese Han Population.

PLoS One. 2015 Sep 23;10(9):e0138695. doi: 10.1371/journal.pone.0138695. eCollection 2015.

A guide to genome-wide association analysis and post-analytic interrogation.

Stat Med. 2015 Dec 10;34(28):3769-92. doi: 10.1002/sim.6605. Epub 2015 Sep 6.

Two-stage family-based designs for sequencing studies.

BMC Proc. 2014 Jun 17;8(Suppl 1):S32. doi: 10.1186/1753-6561-8-S1-S32. eCollection 2014.

Two-phase and family-based designs for next-generation sequencing studies.

Front Genet. 2013 Dec 13;4:276. doi: 10.3389/fgene.2013.00276.

Association of CASP9, CASP10 gene polymorphisms and tea drinking with colorectal cancer risk in the Han Chinese population.

J Zhejiang Univ Sci B. 2013 Jan;14(1):47-57. doi: 10.1631/jzus.B1200218.

BCL2 genetic variants are associated with acute kidney injury in septic shock*.

Crit Care Med. 2012 Jul;40(7):2116-23. doi: 10.1097/CCM.0b013e3182514bca.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于全基因组关联研究的最优两阶段基因分型设计

Optimal two-stage genotyping designs for genome-wide association scans.

作者信息

Wang Hansong, Thomas Duncan C, Pe'er Itsik, Stram Daniel O

机构信息

Division of Biostatistics and Genetic Epidemiology, Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California, USA.

出版信息

Genet Epidemiol. 2006 May;30(4):356-68. doi: 10.1002/gepi.20150.

DOI:10.1002/gepi.20150

PMID:16607626

Abstract

摘要

用于全基因组关联研究的最优两阶段基因分型设计

Optimal two-stage genotyping designs for genome-wide association scans.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

用于全基因组关联研究的最优两阶段基因分型设计

Optimal two-stage genotyping designs for genome-wide association scans.

作者信息

机构信息

出版信息

相似文献

引用本文的文献