Fondazione Bruno Kessler, Trento, Italy.
Pharmacogenomics J. 2010 Aug;10(4):355-63. doi: 10.1038/tpj.2010.47.
The discordance in results between independent genome-wide association studies (GWAS) indicates the potential for Type I and Type II errors. To identify the causes of variability underlying lack of reproducibility, here we present the results of a repeatability experiment on GWAS on a cohort of 1991 coronary artery disease individuals and 1500 controls (National Blood Service) provided by the Wellcome Trust Case Control Consortium. As part of the MicroArray Quality Control project, we identified quality control (QC) and association analysis steps with a major impact on the identification of candidate markers for possible classifiers. Different experimental conditions were used with the CHIAMO calling algorithm to assess the effects of batch size and case-control composition on downstream association analysis. Results showed that both composition and size create discordant single-nucleotide polymorphism (SNP) results for QC and statistical analysis and may contribute to the lack of reproducibility in GWAS. An interactive effect of batch size and composition contributes to discordant results in significantly associated loci. About 800 significant SNPs (Cochran-Armitage trend test, P<5.0 x 10(-7)) were found for batches of 2000 samples with separated cases and controls, whereas only 14 significant markers were found with one batch of all samples.
独立全基因组关联研究(GWAS)结果之间的不一致表明存在 I 型和 II 型错误的可能性。为了确定导致重现性缺乏的可变性的原因,我们在此介绍了对 1991 名冠心病个体和 1500 名对照(国家血液服务)的 GWAS 进行的重复性实验的结果,这些个体和对照由威康信托基金会病例对照联盟提供。作为 MicroArray Quality Control 项目的一部分,我们确定了质量控制(QC)和关联分析步骤,这些步骤对识别可能的分类器候选标记有重大影响。使用 CHIAMO 调用算法进行不同的实验条件评估,以评估批次大小和病例对照组成对下游关联分析的影响。结果表明,组成和大小都会对 QC 和统计分析产生不一致的单核苷酸多态性(SNP)结果,这可能导致 GWAS 缺乏重现性。批次大小和组成的交互作用会导致显着相关基因座的不一致结果。对于 2000 个样本的批次,分离病例和对照,发现了约 800 个显著 SNP(Cochran-Armitage 趋势检验,P<5.0 x 10(-7)),而对于一个批次的所有样本,仅发现了 14 个显著标记。