Begum Ferdouse, Sharker Monir H, Sherman Stephanie L, Tseng George C, Feingold Eleanor
Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, United States of America.
Department of Information Science and Technology, University of Pittsburgh, Pennsylvania, United States of America.
Genet Epidemiol. 2016 Feb;40(2):154-60. doi: 10.1002/gepi.21949. Epub 2015 Dec 28.
Genome-wide association studies are proven tools for finding disease genes, but it is often necessary to combine many cohorts into a meta-analysis to detect statistically significant genetic effects. Often the component studies are performed by different investigators on different populations, using different chips with minimal SNPs overlap. In some cases, raw data are not available for imputation so that only the genotyped single nucleotide polymorphisms (SNPs) results can be used in meta-analysis. Even when SNP sets are comparable, different cohorts may have peak association signals at different SNPs within the same gene due to population differences in linkage disequilibrium or environmental interactions. We hypothesize that the power to detect statistical signals in these situations will improve by using a method that simultaneously meta-analyzes and smooths the signal over nearby markers. In this study, we propose regionally smoothed meta-analysis methods and compare their performance on real and simulated data.
全基因组关联研究是寻找疾病基因的经证实的工具,但通常有必要将多个队列合并进行荟萃分析,以检测具有统计学意义的基因效应。通常,组成研究由不同的研究者在不同的人群中进行,使用不同的芯片,单核苷酸多态性(SNP)重叠最少。在某些情况下,原始数据不可用于归因,因此在荟萃分析中只能使用基因分型的单核苷酸多态性(SNP)结果。即使SNP集具有可比性,由于连锁不平衡或环境相互作用的人群差异,不同队列在同一基因内的不同SNP处可能有峰值关联信号。我们假设,通过使用一种同时对附近标记进行荟萃分析和平滑信号的方法,在这些情况下检测统计信号的能力将得到提高。在本研究中,我们提出了区域平滑荟萃分析方法,并在真实数据和模拟数据上比较了它们的性能。