Zhang Li, Liu Ruitao, Wang Zhong, Culver Daniel A, Wu Rongling
Department of Quantitative Health Sciences, Cleveland Clinic Cleveland, OH, USA.
Front Genet. 2012 Jan 18;3:2. doi: 10.3389/fgene.2012.00002. eCollection 2012.
Haplotype analysis has been increasingly used to study the genetic basis of human diseases, but models for characterizing genetic interactions between haplotypes from different chromosomal regions have not been well developed in the current literature. In this article, we describe a statistical model for testing haplotype-haplotype interactions for human diseases with a case-control genetic association design. The model is formulated on a contingency table in which cases and controls are typed for the same set of molecular markers. By integrating well-established quantitative genetic principles, the model is equipped with a capacity to characterize physiologically meaningful epistasis arising from interactions between haplotypes from different chromosomal regions. The model allows the partition of epistasis into different components due to additive × additive, additive × dominance, dominance × additive, and dominance × dominance interactions. We derive the EM algorithm to estimate and test the effects of each of these components on differences in the pattern of genetic variation between cases and controls and, therefore, examine their role in the pathogenesis of human diseases. The method was further extended to investigate gene-environment interactions expressed at the haplotype level. The statistical properties of the models were investigated through simulation studies and its usefulness and utilization validated by analyzing the genetic association of sarcoidosis from a human genetics project.
单倍型分析已越来越多地用于研究人类疾病的遗传基础,但目前文献中尚未很好地建立用于表征来自不同染色体区域的单倍型之间遗传相互作用的模型。在本文中,我们描述了一种用于在病例对照遗传关联设计中测试人类疾病单倍型 - 单倍型相互作用的统计模型。该模型建立在一个列联表上,其中病例和对照针对同一组分子标记进行分型。通过整合成熟的数量遗传原理,该模型具备表征因来自不同染色体区域的单倍型之间相互作用而产生的具有生理意义的上位性的能力。该模型允许将上位性因加性×加性、加性×显性、显性×加性和显性×显性相互作用而划分为不同成分。我们推导了期望最大化(EM)算法来估计和检验这些成分中每一个对病例和对照之间遗传变异模式差异的影响,从而研究它们在人类疾病发病机制中的作用。该方法进一步扩展以研究在单倍型水平表达的基因 - 环境相互作用。通过模拟研究考察了模型的统计特性,并通过分析一个人类遗传学项目中结节病的遗传关联来验证其有用性和实用性。