Cattaert Tom, Calle M Luz, Dudek Scott M, Mahachie John Jestinah M, Van Lishout François, Urrea Victor, Ritchie Marylyn D, Van Steen Kristel
Montefiore Institute, University of Liege, Belgium.
Ann Hum Genet. 2011 Jan;75(1):78-89. doi: 10.1111/j.1469-1809.2010.00604.x. Epub 2010 Sep 8.
Analyzing the combined effects of genes and/or environmental factors on the development of complex diseases is a great challenge from both the statistical and computational perspective, even using a relatively small number of genetic and nongenetic exposures. Several data-mining methods have been proposed for interaction analysis, among them, the Multifactor Dimensionality Reduction Method (MDR) has proven its utility in a variety of theoretical and practical settings. Model-Based Multifactor Dimensionality Reduction (MB-MDR), a relatively new MDR-based technique that is able to unify the best of both nonparametric and parametric worlds, was developed to address some of the remaining concerns that go along with an MDR analysis. These include the restriction to univariate, dichotomous traits, the absence of flexible ways to adjust for lower order effects and important confounders, and the difficulty in highlighting epistatic effects when too many multilocus genotype cells are pooled into two new genotype groups. We investigate the empirical power of MB-MDR to detect gene-gene interactions in the absence of any noise and in the presence of genotyping error, missing data, phenocopy, and genetic heterogeneity. Power is generally higher for MB-MDR than for MDR, in particular in the presence of genetic heterogeneity, phenocopy, or low minor allele frequencies.
从统计学和计算角度来看,分析基因和/或环境因素对复杂疾病发展的综合影响是一项巨大挑战,即便使用相对少量的遗传和非遗传暴露因素亦是如此。已经提出了几种用于相互作用分析的数据挖掘方法,其中,多因素降维法(MDR)已在各种理论和实际场景中证明了其效用。基于模型的多因素降维法(MB-MDR)是一种相对较新的基于MDR的技术,它能够融合非参数和参数世界的最佳之处,旨在解决MDR分析中一些尚存的问题。这些问题包括对单变量二分性状的限制、缺乏灵活调整低阶效应和重要混杂因素的方法,以及当过多的多位点基因型细胞被合并为两个新的基因型组时难以突出上位性效应。我们研究了MB-MDR在无任何噪声以及存在基因分型错误、缺失数据、拟表型和遗传异质性情况下检测基因-基因相互作用的实证效能。MB-MDR的效能通常高于MDR,尤其是在存在遗传异质性、拟表型或低次要等位基因频率的情况下。