Fish Alexandra E, Capra John A, Bush William S
Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN 37235, USA.
Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN 37235, USA; Departments of Biological Sciences, Biomedical Informatics, and Computer Science, Vanderbilt University, Nashville, TN 37235, USA.
Am J Hum Genet. 2016 Oct 6;99(4):817-830. doi: 10.1016/j.ajhg.2016.07.022. Epub 2016 Sep 15.
The importance of epistasis-or statistical interactions between genetic variants-to the development of complex disease in humans has been controversial. Genome-wide association studies of statistical interactions influencing human traits have recently become computationally feasible and have identified many putative interactions. However, statistical models used to detect interactions can be confounded, which makes it difficult to be certain that observed statistical interactions are evidence for true molecular epistasis. In this study, we investigate whether there is evidence for epistatic interactions between genetic variants within the cis-regulatory region that influence gene expression after accounting for technical, statistical, and biological confounding factors. We identified 1,119 (FDR = 5%) interactions that appear to regulate gene expression in human lymphoblastoid cell lines, a tightly controlled, largely genetically determined phenotype. Many of these interactions replicated in an independent dataset (90 of 803 tested, Bonferroni threshold). We then performed an exhaustive analysis of both known and novel confounders, including ceiling/floor effects, missing genotype combinations, haplotype effects, single variants tagged through linkage disequilibrium, and population stratification. Every interaction could be explained by at least one of these confounders, and replication in independent datasets did not protect against some confounders. Assuming that the confounding factors provide a more parsimonious explanation for each interaction, we find it unlikely that cis-regulatory interactions contribute strongly to human gene expression, which calls into question the relevance of cis-regulatory interactions for other human phenotypes. We additionally propose several best practices for epistasis testing to protect future studies from confounding.
上位性(即基因变异之间的统计相互作用)对人类复杂疾病发展的重要性一直存在争议。最近,对影响人类性状的统计相互作用进行全基因组关联研究在计算上已变得可行,并已确定了许多假定的相互作用。然而,用于检测相互作用的统计模型可能会受到混淆,这使得难以确定观察到的统计相互作用是否是真正分子上位性的证据。在本研究中,我们调查在考虑技术、统计和生物学混杂因素后,顺式调控区域内影响基因表达的基因变异之间是否存在上位性相互作用的证据。我们在人类淋巴母细胞系中确定了1119个(错误发现率=5%)似乎调控基因表达的相互作用,人类淋巴母细胞系是一种受到严格控制、很大程度上由基因决定的表型。其中许多相互作用在独立数据集中得到了重复验证(803个测试中有90个,采用Bonferroni阈值)。然后,我们对已知和新出现的混杂因素进行了详尽分析,包括天花板/地板效应、缺失的基因型组合、单倍型效应、通过连锁不平衡标记的单个变异以及群体分层。每一种相互作用都可以由这些混杂因素中的至少一种来解释,并且在独立数据集中的重复验证并不能防止某些混杂因素的影响。假设混杂因素为每种相互作用提供了更简洁的解释,我们发现顺式调控相互作用对人类基因表达的贡献不大可能很大,这对顺式调控相互作用与其他人类表型的相关性提出了质疑。我们还提出了几个上位性检验的最佳实践,以保护未来的研究免受混杂因素的影响。