Luan Yi-Zhao, Zuo Xiao-Yu, Liu Ke, Li Gu, Rao Shao-Qi
Yi Chuan. 2013 Dec;35(12):1331-9. doi: 10.3724/sp.j.1005.2013.01331.
The SNP-based association analysis has become one of the most important approaches to interpret the underlying molecular mechanisms for human complex diseases. Nevertheless, the widely-used singe-locus analysis is only capable of capturing a small portion of susceptible SNPs with prominent marginal effects, leaving the important genetic component, epistasis or joint effects, to be undetectable. Identifying the complex interplays among multiple genes in the genome-wide context is an essential task for systematically unraveling the molecular mechanisms for complex diseases. Many approaches have been used to detect genome-wide gene-gene interactions and provided new insights into the genetic basis of complex diseases. This paper reviewed recent advances of the methods for detecting gene-gene interaction, categorized into three types, model-based and model-free statistical methods, and data mining methods, based on their characteristics in theory and numerical algorithm. In particular, the basic principle, numerical implementation and cautions for application for each method were elucidated. In addition, this paper briefly discussed the limitations and challenges associated with detecting genome-wide epistasis, in order to provide some methodological consultancies for scientists in the related fields.
基于单核苷酸多态性(SNP)的关联分析已成为阐释人类复杂疾病潜在分子机制的最重要方法之一。然而,广泛使用的单基因座分析仅能捕捉到一小部分具有显著边际效应的易感SNP,而重要的遗传成分——上位性或基因间联合效应则无法被检测到。在全基因组范围内识别多个基因之间复杂的相互作用,是系统揭示复杂疾病分子机制的一项重要任务。许多方法已被用于检测全基因组范围内的基因-基因相互作用,并为复杂疾病的遗传基础提供了新的见解。本文回顾了检测基因-基因相互作用方法的最新进展,根据其理论和数值算法特点将其分为三类:基于模型和无模型的统计方法以及数据挖掘方法。特别地,阐述了每种方法的基本原理、数值实现及应用注意事项。此外,本文简要讨论了检测全基因组上位性相关的局限性和挑战,以便为相关领域的科学家提供一些方法学参考。