Hou Lin, Chen Min, Zhang Clarence K, Cho Judy, Zhao Hongyu
Department of Biostatistics, Yale School of Public Health, New Haven, CT 06510, USA.
Hum Mol Genet. 2014 May 15;23(10):2780-90. doi: 10.1093/hmg/ddt668. Epub 2013 Dec 30.
Although Genome Wide Association Studies (GWAS) have identified many susceptibility loci for common diseases, they only explain a small portion of heritability. It is challenging to identify the remaining disease loci because their association signals are likely weak and difficult to identify among millions of candidates. One potentially useful direction to increase statistical power is to incorporate functional genomics information, especially gene expression networks, to prioritize GWAS signals. Most current methods utilizing network information to prioritize disease genes are based on the 'guilt by association' principle, in which networks are treated as static, and disease-associated genes are assumed to locate closer with each other than random pairs in the network. In contrast, we propose a novel 'guilt by rewiring' principle. Studying the dynamics of gene networks between controls and patients, this principle assumes that disease genes more likely undergo rewiring in patients, whereas most of the network remains unaffected in disease condition. To demonstrate this principle, we consider the changes of co-expression networks in Crohn's disease patients and controls, and how network dynamics reveals information on disease associations. Our results demonstrate that network rewiring is abundant in the immune system, and disease-associated genes are more likely to be rewired in patients. To integrate this network rewiring feature and GWAS signals, we propose to use the Markov random field framework to integrate network information to prioritize genes. Applications in Crohn's disease and Parkinson's disease show that this framework leads to more replicable results, and implicates potentially disease-associated pathways.
尽管全基因组关联研究(GWAS)已经确定了许多常见疾病的易感基因座,但它们仅解释了遗传力的一小部分。识别其余的疾病基因座具有挑战性,因为它们的关联信号可能很微弱,并且在数百万个候选基因中难以识别。提高统计效力的一个潜在有用方向是纳入功能基因组学信息,尤其是基因表达网络,以便对GWAS信号进行优先级排序。目前大多数利用网络信息对疾病基因进行优先级排序的方法都基于“关联有罪”原则,即网络被视为静态的,并且假定与疾病相关的基因在网络中比随机配对的基因彼此定位得更近。相比之下,我们提出了一种新颖的“重连有罪”原则。通过研究对照组和患者之间基因网络的动态变化,该原则假定疾病基因在患者中更有可能发生重连,而在疾病状态下大多数网络保持不变。为了证明这一原则,我们考虑了克罗恩病患者和对照组中共表达网络的变化,以及网络动态如何揭示疾病关联信息。我们的结果表明,网络重连在免疫系统中很常见,并且与疾病相关的基因在患者中更有可能发生重连。为了整合这种网络重连特征和GWAS信号,我们建议使用马尔可夫随机场框架来整合网络信息以对基因进行优先级排序。在克罗恩病和帕金森病中的应用表明,该框架能产生更具可重复性的结果,并暗示潜在的疾病相关途径。