Murk William, DeWan Andrew T
Department of Chronic Disease Epidemiology, Yale School of Public Health, New Haven, Connecticut 06510.
Department of Chronic Disease Epidemiology, Yale School of Public Health, New Haven, Connecticut 06510
G3 (Bethesda). 2016 Jul 7;6(7):2043-50. doi: 10.1534/g3.116.028563.
The identification of statistical SNP-SNP interactions may help explain the genetic etiology of many human diseases, but exhaustive genome-wide searches for these interactions have been difficult, due to a lack of power in most datasets. We aimed to use data from the Resource for Genetic Epidemiology Research on Adult Health and Aging (GERA) study to search for SNP-SNP interactions associated with 10 common diseases. FastEpistasis and BOOST were used to evaluate all pairwise interactions among approximately N = 300,000 single nucleotide polymorphisms (SNPs) with minor allele frequency (MAF) ≥ 0.15, for the dichotomous outcomes of allergic rhinitis, asthma, cardiac disease, depression, dermatophytosis, type 2 diabetes, dyslipidemia, hemorrhoids, hypertensive disease, and osteoarthritis. A total of N = 45,171 subjects were included after quality control steps were applied. These data were divided into discovery and replication subsets; the discovery subset had > 80% power, under selected models, to detect genome-wide significant interactions (P < 10(-12)). Interactions were also evaluated for enrichment in particular SNP features, including functionality, prior disease relevancy, and marginal effects. No interaction in any disease was significant in both the discovery and replication subsets. Enrichment analysis suggested that, for some outcomes, interactions involving SNPs with marginal effects were more likely to be nominally replicated, compared to interactions without marginal effects. If SNP-SNP interactions play a role in the etiology of the studied conditions, they likely have weak effect sizes, involve lower-frequency variants, and/or involve complex models of interaction that are not captured well by the methods that were utilized.
统计性单核苷酸多态性(SNP)-SNP相互作用的识别可能有助于解释许多人类疾病的遗传病因,但由于大多数数据集缺乏检验效能,对这些相互作用进行全基因组范围的详尽搜索一直很困难。我们旨在利用成人健康与衰老遗传流行病学研究资源(GERA)的数据,搜索与10种常见疾病相关的SNP-SNP相互作用。使用FastEpistasis和BOOST软件,针对变应性鼻炎、哮喘、心脏病、抑郁症、皮肤癣菌病、2型糖尿病、血脂异常、痔疮、高血压病和骨关节炎的二分结局,评估了次要等位基因频率(MAF)≥0.15的约N = 300,000个单核苷酸多态性(SNP)之间的所有成对相互作用。在应用质量控制步骤后,共纳入了N = 45,171名受试者。这些数据被分为发现集和验证集;在选定模型下,发现集有超过80%的检验效能来检测全基因组显著的相互作用(P < 10^(-12))。还评估了相互作用在特定SNP特征(包括功能、既往疾病相关性和边际效应)方面的富集情况。在发现集和验证集中,任何疾病的相互作用均无显著意义。富集分析表明,对于某些结局,与无边际效应的相互作用相比,涉及有边际效应SNP的相互作用更有可能被名义上验证。如果SNP-SNP相互作用在所研究疾病的病因中起作用,它们可能效应大小较弱,涉及低频变异,和/或涉及所使用方法未能很好捕捉的复杂相互作用模型。