Zhang Jie, Wei Zhi, Cardinale Christopher J, Gusareva Elena S, Van Steen Kristel, Sleiman Patrick, Hakonarson Hakon
Department of Computer Science, New Jersey Institute of Technology, Newark, NJ, United States.
Adobe Inc., San Jose, CA, United States.
Front Genet. 2019 Apr 3;10:257. doi: 10.3389/fgene.2019.00257. eCollection 2019.
Successful searching for epistasis is much challenging, which generally requires very large sample sizes and/or very dense marker information. We exploited the largest Crohn's disease (CD) dataset (18,000 cases + 34,000 controls) and ulcerative colitis (UC) dataset (14,000 cases + 34,000 controls) to date. Leveraging its dense marker information and the large sample size of this IBD dataset, we employed a two-step approach to exhaustively search for epistasis. We detected abundant genome-wide significant ( < 1 × 10) epistatic signals, all within the MHC region. These signals were reduced substantially when conditional on the additive background, but still nine pairs remained significant at the Immunochip-wide level ( < 1.1 × 10) in conditional tests for UC. All these nine epistatic interactions come from the MHC region, and each explains on average 0.15% of the phenotypic variance. Eight of them were replicated in a replication cohort. There are multiple but relatively weak interactions independent of the additive effects within the MHC region for UC. Our promising results warrant the search for epistasis in large data sets with dense markers, exploiting dependencies between markers.
成功搜寻上位效应极具挑战性,这通常需要非常大的样本量和/或非常密集的标记信息。我们利用了迄今为止最大的克罗恩病(CD)数据集(18000例病例 + 34000例对照)和溃疡性结肠炎(UC)数据集(14000例病例 + 34000例对照)。借助该炎症性肠病数据集密集的标记信息和大样本量,我们采用两步法来彻底搜寻上位效应。我们检测到大量全基因组显著(< 1×10)的上位信号,均位于MHC区域内。当以加性背景为条件时,这些信号大幅减少,但在UC的条件性检验中,仍有9对在免疫芯片水平(< 1.1×10)显著。所有这9种上位相互作用均来自MHC区域,且每种平均解释0.15%的表型变异。其中8种在一个重复队列中得到重复验证。对于UC,在MHC区域内存在多个但相对较弱的独立于加性效应的相互作用。我们的这些有前景的结果表明,利用标记之间的相关性,在具有密集标记的大数据集中搜寻上位效应是可行的。